xAI’s Grok-2 Ranks Second on the Chatbot Arena Leaderboard, Competing with Gemini 1.5 and GPT-4o

In an exciting development from the xAI team, Grok-2 and Grok-Mini have officially secured positions on the LMSys Chatbot Arena leaderboard. Grok-2 has taken the #2 spot, surpassing GPT-4o (May) and tying with the latest Gemini model, driven by over 6,000 community votes.

Meanwhile, Grok-2-Mini has earned the #5 position.

Chatbot Arena update❤️‍🔥

Exciting news—@xAI‘s Grok-2 and Grok-mini are now officially on the leaderboard!

With over 6000 community votes, Grok-2 has claimed the #2 spot, surpassing GPT-4o (May) and tying with the latest Gemini! Grok-2-mini also impresses at #5.

Grok-2 excels in… pic.twitter.com/5lyQgratJQ

— lmsys.org (@lmsysorg) August 23, 2024

Grok-2 has excelled particularly in mathematical tasks, ranking #1 in this category, and secured the #2 positions across various other tasks, including hard prompts, coding, and instruction-following.

Additionally, Grok-2-Mini has undergone significant speed enhancements, now performing twice as fast as before. This boost was achieved after xAI’s inference team as they completely rewrote the inference stack using SGLang, enabling more efficient multi-host inference and improved accuracy.

The team also introduced new algorithms for computation and communication kernels, alongside better batch scheduling and quantisation, further enhancing the models’ performance.

Several people are still sceptical about the performance. OpenAI’s GPT-4o, which claims the top spot, does not perform as well as Claude 3.5, which is at the 5th spot. Though, people have started experimenting with Grok-2 and claim that the model is actually brilliant in coding and maths related tasks.

Released in Beta this month, the Grok-2 family of models are also available for testing on X. The model also allows users to generate images using the FLUX.1 image generation model.

Source link

Previous News

No Spl Purpose For CBDC, UPI Already Prevalent: Ex-RBI Dy Guv

Next News

Polygon’s Discord channel hacked, team works to regain control

Disclaimer

Popular

Microsoft to Introduce Voice Reporting Feature for Xbox

Adobe teams up with India’s Education Ministry for creative learning initiative

Meta May Allow Instagram and Facebook Users in Europe to Pay to Avoid Ads

Indian fintechs amplify payments soundbox pitches to woo merchants

Fintech Unicorn Pine Labs Launches Mini — A QR-First Device With Card Support

More Like this

Chinese Tether laundromat, Bhutan enjoys recent Bitcoin boost: Asia Express

First iPhone 16 pre-orders arrive as lines form at Apple Stores around the world

OpenAI o1 “Strawberry” Finally Available on GitHub Copilot Chat with VS Code Integration

Here is what’s illegal under California’s 8 (and counting) new AI laws

Californians can now add their driver’s licenses to Apple Wallet

Chinese Tether laundromat, Bhutan enjoys recent Bitcoin boost: Asia Express

xAI’s Grok-2 Ranks Second on the Chatbot Arena Leaderboard, Competing with Gemini 1.5 and GPT-4o

Disclaimer

More like this

Chinese Tether laundromat, Bhutan enjoys recent Bitcoin boost: Asia...

First iPhone 16 pre-orders arrive as lines form at...

OpenAI o1 “Strawberry” Finally Available on GitHub Copilot Chat...

Popular

Apple releases new firmware version for AirPods Pro 2 and AirPods 4

Railways Developing A Super App: Ashwini Vaishnaw

Moneyboxx To Raise INR 176 Cr To Expand Its Lending Play

Wealthtech Centricity Bags $20 Mn To Build GenAI Modules

MCA Exempts Startups Looking To Reverse Flip From NCLT Nod

iPhone users can stay on iOS 17 and get security patches

Xiaomi India Ropes In Ex-Motorola Exec Sudhin Mathur As COO

Upcoming Events

Fintech Revolution Summit | Jakarta | October 24

International Technology Congress 2024 Moscow | Russia | September 17 - 19

Token 2049 | Singapore | Sept 18-19

ECODOX 4.0 | Delhi | September 18 - 19

Startup Meetup (RTF) | Gurugram | September 20

StartupNews.fyi

StartupNews.fyi

xAI’s Grok-2 Ranks Second on the Chatbot Arena Leaderboard, Competing with Gemini 1.5 and GPT-4o

Disclaimer

Popular

More Like this

xAI’s Grok-2 Ranks Second on the Chatbot Arena Leaderboard, Competing with Gemini 1.5 and GPT-4o

Disclaimer

More like this

Popular

Upcoming Events

Newsletter Signup Form!

Newsletter Signup Form!