Mistral AI Unveils Mistral Large 2, Beats Llama 3.1 on Code and Math

A day after Meta released Llama 3.1, Mistral AI has announced Mistral Large 2, the latest generation of its flagship model, offering substantial improvements in code generation, mathematics, and multilingual support. The model introduces advanced function-calling capabilities and is available on la Plateforme.

Yesterday, @AIatMeta dropped Llama 405B. Today, @MistralAI drops Mistral Large, their biggest dense model with 123B parameters 🤯.

TL;DR:
🧮 123B Instruct model with 128k context
📃 Non-Commercial License, Research only
🌐 Strong Multilingual, including English, German, French,… pic.twitter.com/cW4PAoAQxD

— Philipp Schmid (@_philschmid) July 24, 2024

With a 128k context window and support for dozens of languages, including French, German, Spanish, and Chinese, Mistral Large 2 aims to cater to diverse linguistic needs. It also supports 80+ coding languages, such as Python, Java, and C++. The model is designed for single-node inference and long-context applications, boasting 123 billion parameters.

Mistral Large 2 is released under the Mistral Research License for research and non-commercial use. It achieves 84.0% accuracy on the MMLU benchmark, setting a new standard for performance and cost efficiency in open models. In code generation and reasoning, it competes with leading models like GPT-4o and Llama 3.

The model’s training focused on reducing hallucinations and ensuring accurate outputs, significantly enhancing its reasoning and problem-solving skills. Mistral Large 2 is trained to acknowledge its limitations in providing solutions, reflecting its commitment to accuracy.

Improvements in instruction-following and conversational capabilities are evident, with the model excelling in benchmarks such as MT-Bench, Wild Bench, and Arena Hard. Mistral AI emphasizes concise responses, vital for business applications.

Mistral Large 2’s multilingual proficiency includes languages like Russian, Japanese, and Arabic, performing strongly on the multilingual MMLU benchmark. It also features enhanced function calling skills, making it suitable for complex business applications.

Users can access Mistral Large 2 via la Plateforme under the name mistral-large-2407. Mistral AI is consolidating its offerings, including general-purpose models Mistral Nemo and Mistral Large, and specialist models Codestral and Embed. Fine-tuning capabilities are now extended to these models.

The model is available through partnerships with Google Cloud Platform, Azure AI Studio, Amazon Bedrock, and IBM watsonx.ai. This expansion aims to bring Mistral AI’s advanced models to a global audience, enhancing accessibility and application development.

Mistral Large 2 is the fourth model from the company in the past week, following the release of MathΣtral, a specialized 7B model designed for advanced mathematical reasoning and scientific exploration.

The company also released Codestral Mamba 7B, based on the advanced Mamba 2 architecture, which is trained with a context length of 256k tokens and built for code generation tasks for developers worldwide. Additionally, Mistral AI introduced Mistral NeMo, a 12-billion parameter model with a 128k token context length, developed in partnership with NVIDIA.

Source link

Previous News

TechCrunch Minute: A UK school was reprimanded for unlawful use of facial recognition technology

Next News

Get ready to see a lot of iPads on college football sidelines this season

Disclaimer

Popular

Microsoft to Introduce Voice Reporting Feature for Xbox

Adobe teams up with India’s Education Ministry for creative learning initiative

Meta May Allow Instagram and Facebook Users in Europe to Pay to Avoid Ads

Indian fintechs amplify payments soundbox pitches to woo merchants

Fintech Unicorn Pine Labs Launches Mini — A QR-First Device With Card Support

More Like this

Fisker Ocean owners stuck paying for recall repairs

Govt To Safeguard Retailers In Case Of Predatory Pricing: FM

Runway announces an API for its video-generating AI models

GIFT City: Infosys, Wipro to start fintech hubs in GIFT City’s IFSC under Techfin framework

BitGo launches regulated custody platform for native protocol tokens

iOS 18 rolling out RCS to the iPhone for better Android messaging

Mistral AI Unveils Mistral Large 2, Beats Llama 3.1 on Code and Math

Disclaimer

More like this

Fisker Ocean owners stuck paying for recall repairs

Govt To Safeguard Retailers In Case Of Predatory Pricing:...

Runway announces an API for its video-generating AI models

Popular

Wealthtech Centricity Bags $20 Mn To Build GenAI Modules

MCA Exempts Startups Looking To Reverse Flip From NCLT Nod

iPhone users can stay on iOS 17 and get security patches

Xiaomi India Ropes In Ex-Motorola Exec Sudhin Mathur As COO

Annual EV Sales To Touch 1 Cr Mark In India By 2030: Gadkari

Mamearth Shares Jump 5% To Hit A Fresh All-Time High At INR 546.5

AppsForBharat Nets $18 Mn To Boost Operations Of Its Spiritual App Sri Mandir, Eyes...

Upcoming Events

Fintech Revolution Summit | Jakarta | October 24

International Technology Congress 2024 Moscow | Russia | September 17 - 19

S1000D Launchpad | Bengaluru | September 17

Token 2049 | Singapore | Sept 18-19

ECODOX 4.0 | Delhi | September 18 - 19

StartupNews.fyi

StartupNews.fyi

Mistral AI Unveils Mistral Large 2, Beats Llama 3.1 on Code and Math

Disclaimer

Popular

More Like this

Mistral AI Unveils Mistral Large 2, Beats Llama 3.1 on Code and Math

Disclaimer

More like this

Popular

Upcoming Events

Newsletter Signup Form!

Newsletter Signup Form!