Google open-sources tools to support AI model development

In a typical year, Cloud Next — one of Google’s two major annual developer conferences, the other being I/O — almost exclusively features managed and otherwise closed-source, gated-behind-locked-down-APIs products and services. But this year, whether to foster developer goodwill or advance its ecosystem ambitions (or both), Google debuted a number of open-source tools primarily aimed at supporting generative AI projects and infrastructure.

The first, MaxDiffusion, which Google actually quietly released in February, is a collection of reference implementations of various diffusion models — models like the image generator Stable Diffusion — that run on XLA devices. “XLA” stands for Accelerated Linear Algebra, an admittedly awkward acronym referring to a technique that optimizes and speeds up specific types of AI workloads including fine-tuning and serving.

Google’s own tensor processing units (TPUs) are XLA devices, as are recent Nvidia GPUs.

Beyond MaxDiffusion, Google’s launching Jetstream, a new engine to run generative AI models — specifically text-generating models (so not Stable Diffusion). Currently limited to supporting TPUs with GPU compatibility supposedly coming in the future, Jetstream offers up to 3x higher “performance per dollar” for models like Google’s own Gemma 7B and Meta’s Llama 2, Google claims.

“As customers bring their AI workloads to production, there’s an increasing demand for a cost-efficient inference stack that delivers high performance,” Mark Lohmeyer, Google Cloud’s GM of compute and machine learning infrastructure, wrote in a blog post shared with TechCrunch. “JetStream helps with this need … and includes optimizations for popular open models such as Llama 2 and Gemma.”

Now, “3x” improvement is quite a claim to make, and it’s not exactly clear how Google arrived at that figure. Using which generation of TPU? Compared to which baseline engine? And how’s “performance” being defined here, anyway?

I’ve asked Google all these questions and will update this post if I hear back.

Second-to-last on the list of Google’s open-source contributions are new additions to MaxText, Google’s collection of text-generating AI models targeting TPUs and Nvidia GPUs in the cloud. MaxText now includes Gemma 7B, OpenAI’s GPT-3 (the predecessor to GPT-4), Llama 2 and models from AI startup Mistral — all of which Google says can be customized and fine-tuned to developers’ needs.

“We’ve heavily optimized [the models’] performance on TPUs and also partnered closely with Nvidia to optimize performance on large GPU clusters,” Lohmeyer said. “These improvements maximize GPU and TPU utilization, leading to higher energy efficiency and cost optimization.”

Finally, Google’s collaborated with Hugging Face, the AI startup, to create Optimum TPU, which provides tooling to bring certain AI workloads to TPUs. The goal is to reduce the barrier to entry for getting generative AI models onto TPU hardware, according to Google — in particular text-generating models.

But at present, Optimum TPU is a bit bare bones. The only model it works with is Gemma 7B. And Optimum TPU doesn’t yet support training generative models on TPUs — only running them.

Google’s promising improvements down the line.

Source link

Previous News

Investor Funding Driving Behaviour Which Is Not Good For Edtech Sector: Aakash CEO

Next News

Google Cloud Next 2024: Everything announced so far

Google open-sources tools to support AI model development

Disclaimer

Popular

Microsoft to Introduce Voice Reporting Feature for Xbox

Adobe teams up with India’s Education Ministry for creative learning initiative

Meta May Allow Instagram and Facebook Users in Europe to Pay to Avoid Ads

Indian fintechs amplify payments soundbox pitches to woo merchants

Fintech Unicorn Pine Labs Launches Mini — A QR-First Device With Card Support

More Like this

How The House Of Brands Crumbled

We’re Losing the Plot on AI in Universities

New iPhone 17 Pro renders highlight aluminum design, repositioned Apple logo

‘Improved’ Grok criticizes Democrats and Hollywood’s ‘Jewish executives’

Bitcoin, Ethereum, XRP and Altcoins Could Break Out as as U.S. Prepares for Historic Crypto Week

Traders Focused on Cardano Are Now Watching a Different Project Set to Launch by End of July

Google open-sources tools to support AI model development

Disclaimer

More like this

How The House Of Brands Crumbled

We’re Losing the Plot on AI in Universities

New iPhone 17 Pro renders highlight aluminum design, repositioned...

Popular

Block title

Reuters’ official X account blocked in India amid legal demand

These 4 Meme Coins Could 20x in 2025 as Spot Dogecoin ETF Nears SEC...

Rapido eyeing food delivery ops pilot in Bengaluru by next week

Here’s everything new for Apple Photos in iOS 26

Indian Luxury in Full Bloom: Radico Khaitan Unveils ‘Celebrate India’ Zone at Ospree- Mumbai...

AI infra, new energy to drive $50 billion market value rise for Reliance

Cathy Gao’s bringing the real playbook to TC All Stage

Upcoming Events

I-Propel Acceleration Program | online | July 1 - Sept 30

Entrepreneurship & Finance Fellowship Program | Online | July 2-8

Dynamic D2C Summit : India's Next Gen D2C Event | Delhi | July 4

EmergeTech 2025 - The Multicity Roadshow | Chennai | July 4

8th Edition of the ET Edge GCC Summit 2025 | Pune | July 4

StartupNews.fyi

StartupNews.fyi

Google open-sources tools to support AI model development

Disclaimer

Popular

More Like this

Google open-sources tools to support AI model development

Disclaimer

More like this

Popular

Block title

Upcoming Events

Newsletter Signup Form!