Google launches two new open LLMs

Barely a week after launching the latest iteration of its Gemini models, Google today announced the launch of Gemma, a new family of lightweight open-weight models. Starting with Gemma 2B and Gemma 7B, these new models were “inspired by Gemini” and are available for commercial and research usage.

Google did not provide us with a detailed paper on how these models perform against similar models from Meta and Mistral, for example, and only noted that they are “state-of-the-art.” The company did note that these are dense decoder-only models, though, which is the same architecture it used for its Gemini models (and its earlier PaLM models) and that we will see the benchmarks later today on Hugging Face’s leaderboard.

To get started with Gemma, developers can get access to ready-to-use Colab and Kaggle notebooks, as well as integrations with Hugging Face, MaxText and Nvidia’s NeMo. Once pre-trained and tuned, these models can then run everywhere.

While Google highlights that these are open models, it’s worth noting that they are not open-source. Indeed, in a press briefing ahead of today’s announcement, Google’s Janine Banks stressed the company’s commitment to open source but also noted that Google is very intentional about how it refers to the Gemma models.

“[Open models] has become pretty pervasive now in the industry,” Banks said. “And it often refers to open weights models, where there is wide access for developers and researchers to customize and fine-tune models but, at the same time, the terms of use — things like redistribution, as well as ownership of those variants that are developed — vary based on the model’s own specific terms of use. And so we see some difference between what we would traditionally refer to as open source and we decided that it made the most sense to refer to our Gemma models as open models.”

That means developers can use the model for inferencing and fine-tune them at will and Google’s team argues that even though these model sizes are a good fit for a lot of use cases.

“The generation quality has gone significantly up in the last year,” Google DeepMind product management director Tris Warkentin said. “things that previously would have been the remit of extremely large models are now possible with state-of-the-art smaller models. This unlocks completely new ways of developing AI applications that we’re pretty excited about, including being able to run inference and do tuning on your local developer desktop or laptop with your RTX GPU or on a single host in GCP with Cloud TPUs, as well.”

That is true of the open models from Google’s competitors in this space as well, so we’ll have to see how the Gemma models perform in real-world scenarios.

In addition to the new models, Google is also releasing a new responsible generative AI toolkit to provide “guidance and essential tools for creating safer AI applications with Gemma,” as well as a debugging tool.

Source link

Previous News

Samphire Neuroscience is building a brain stimulating wearable for period pain

Next News

Klub invests in animal nutrition brand eFeed, securing funding

Google launches two new open LLMs

Disclaimer

Popular

Vodafone Idea Plan for 56 Days which is Good Value

The stark divide in the UAE and India war info systems

Today’s NYT Mini Crossword Answers for March 21

CBS News Shutters Radio Service After Nearly a Century

Accenture’s Q2FY26 revenues of $18 billion point to stable demand for Indian IT

More Like this

Linux kernel scale is swamping an already-flawed CVE system

WhatsApp will let messages disappear after being read on Android

Credilio kicks off Series A; valuation jumps 2.7X

A Top Democrat Is Urging Colleagues to Support Trump’s Spy Machine

Today’s NYT Mini Crossword Answers for March 21

The Best Micro Four Thirds Lenses We’ve Tested for 2026

Google launches two new open LLMs

Disclaimer

More like this

Linux kernel scale is swamping an already-flawed CVE system

WhatsApp will let messages disappear after being read on...

Credilio kicks off Series A; valuation jumps 2.7X

Popular

Block title

How Much?! The Complete Guide to Streaming Service Costs and Price Hikes

Xbox Ally X gets Auto SR with up to 30% better performance

DrinkPrime secures $2.2M from Artha Continuum Fund and Mirabilis Investment Trust to scale its...

Crypto.com cuts 12% of staff

Why some experts don't think AI can cure cancer

Vivo V70 FE India Launch Timeline and Price Leaked

Microsoft Says It Is Fixing Windows 11

Startup Events

Trending News

Linux kernel scale is swamping an already-flawed CVE system

WhatsApp will let messages disappear after being read on Android

Credilio kicks off Series A; valuation jumps 2.7X

A Top Democrat Is Urging Colleagues to Support Trump’s Spy Machine

Today’s NYT Mini Crossword Answers for March 21

About

Partnership

Contact us