Apple releases new family of Open-source Efficient Language Models as AI work progresses

Share via:


Ahead of iOS 18’s debut at WWDC in June, Apple has released a family of open-source large language models. Called OpenELM, Apple describes these as: a family of Open-source Efficient Language Models.

In its testing, Apple says that OpenELM offers similar performance to other open language models, but with less training data.

Apple explains:

To this end, we release OpenELM, a state-of-the-art open language model. OpenELM uses a layer-wise scaling strategy to efficiently allocate parameters within each layer of the transformer model, leading to enhanced accuracy. For example, with a parameter budget of approximately one billion parameters, OpenELM exhibits a 2.36% improvement in accuracy compared to OLMo while requiring 2× fewer pre-training tokens.

Diverging from prior practices that only provide model weights and inference code, and pre-train on private datasets, our release includes the complete framework for training and evaluation of the language model on publicly available datasets, including training logs, multiple checkpoints, and pre-training configurations. We also release code to convert models to MLX library for inference and fine-tuning on Apple devices. This comprehensive release aims to empower and strengthen the open research community, paving the way for future open research endeavors.

You can find more details at the links below:

iOS 18 will include a collection of new artificial intelligence features, and today’s OpenELM release is just the latest piece of Apple’s behind-the-scenes work in preparation.

Bloomberg reported last week that iOS 18’s AI features will be powered by an entirely on-device large language model, which will offer privacy and speed benefits.

Follow ChanceThreadsTwitterInstagram, and Mastodon

FTC: We use income earning auto affiliate links. More.





Source link

Disclaimer

We strive to uphold the highest ethical standards in all of our reporting and coverage. We StartupNews.fyi want to be transparent with our readers about any potential conflicts of interest that may arise in our work. It’s possible that some of the investors we feature may have connections to other businesses, including competitors or companies we write about. However, we want to assure our readers that this will not have any impact on the integrity or impartiality of our reporting. We are committed to delivering accurate, unbiased news and information to our audience, and we will continue to uphold our ethics and principles in all of our work. Thank you for your trust and support.

Popular

More Like this

Apple releases new family of Open-source Efficient Language Models as AI work progresses


Ahead of iOS 18’s debut at WWDC in June, Apple has released a family of open-source large language models. Called OpenELM, Apple describes these as: a family of Open-source Efficient Language Models.

In its testing, Apple says that OpenELM offers similar performance to other open language models, but with less training data.

Apple explains:

To this end, we release OpenELM, a state-of-the-art open language model. OpenELM uses a layer-wise scaling strategy to efficiently allocate parameters within each layer of the transformer model, leading to enhanced accuracy. For example, with a parameter budget of approximately one billion parameters, OpenELM exhibits a 2.36% improvement in accuracy compared to OLMo while requiring 2× fewer pre-training tokens.

Diverging from prior practices that only provide model weights and inference code, and pre-train on private datasets, our release includes the complete framework for training and evaluation of the language model on publicly available datasets, including training logs, multiple checkpoints, and pre-training configurations. We also release code to convert models to MLX library for inference and fine-tuning on Apple devices. This comprehensive release aims to empower and strengthen the open research community, paving the way for future open research endeavors.

You can find more details at the links below:

iOS 18 will include a collection of new artificial intelligence features, and today’s OpenELM release is just the latest piece of Apple’s behind-the-scenes work in preparation.

Bloomberg reported last week that iOS 18’s AI features will be powered by an entirely on-device large language model, which will offer privacy and speed benefits.

Follow ChanceThreadsTwitterInstagram, and Mastodon

FTC: We use income earning auto affiliate links. More.





Source link

Disclaimer

We strive to uphold the highest ethical standards in all of our reporting and coverage. We StartupNews.fyi want to be transparent with our readers about any potential conflicts of interest that may arise in our work. It’s possible that some of the investors we feature may have connections to other businesses, including competitors or companies we write about. However, we want to assure our readers that this will not have any impact on the integrity or impartiality of our reporting. We are committed to delivering accurate, unbiased news and information to our audience, and we will continue to uphold our ethics and principles in all of our work. Thank you for your trust and support.

Website Upgradation is going on for any glitch kindly connect at office@startupnews.fyi

More like this

Shanghai startup begins mass-producing its humanoid robots

AgiBot is backed by major investors such as...

Israeli fintech firm acquired by Italian company for $150m

Morning was originally founded in 2011 under the...

xAI is testing a standalone iOS app for its...

Elon Musk’s AI company, xAI, is testing out...

Popular

Upcoming Events

Startup Information that matters. Get in your inbox Daily!