Microsoft Releases Phi-2, Outperforms Gemini Nano, Mistral 7B, and Llama 2 Models

Microsoft has released its Small Language Model (SML) Phi-2, a 2.7 billion-parameter language model showcasing exceptional reasoning and language understanding abilities.

Phi-2, a Transformer-based model with a next-word prediction objective, underwent training on 1.4T tokens from a mix of Synthetic and Web datasets for NLP and coding. The training process, conducted over 14 days using 96 A100 GPUs, resulted in Phi-2, a base model without alignment through reinforcement learning from human feedback (RLHF) or fine-tuning instructions.

Despite its modest 2.7 billion parameters, Phi-2 outperforms Mistral and Llama-2 models, both at 7B and 13B parameters, across various aggregated benchmarks. Particularly noteworthy is its superior performance compared to the significantly larger 70B-parameter Llama-2 model in multi-step reasoning tasks, such as coding and math.

Furthermore, Phi-2 matches or outperforms the recently-announced Google Gemini Nano 2, despite being smaller in size.

Microsoft couldn’t help but make a subtle reference to Google’s staged demo video for Gemini, which received significant criticism. In the video, Google showcased its upcoming AI model, Gemini Ultra, solving complex physics problems and rectifying students’ errors.

Interestingly, Microsoft highlighted that despite Phi-2 likely being a fraction of the size of Gemini Ultra, it demonstrated the ability to provide accurate answers and correct students using similar prompts.

The post Microsoft Releases Phi-2, Outperforms Gemini Nano, Mistral 7B, and Llama 2 Models appeared first on Analytics India Magazine.

Previous News

M2P Fintech Takes Over Nexus Ventures-Backed Goals101 For $30 Mn

Next News

OKX invests in Animoca Brands’ Mocaverse for growth on X1 network

Disclaimer

Popular

Microsoft to Introduce Voice Reporting Feature for Xbox

Adobe teams up with India’s Education Ministry for creative learning initiative

Meta May Allow Instagram and Facebook Users in Europe to Pay to Avoid Ads

Indian fintechs amplify payments soundbox pitches to woo merchants

Fintech Unicorn Pine Labs Launches Mini — A QR-First Device With Card Support

More Like this

Swift, UBS, Chainlink pilot tokenized fund settlement

PayPay expands digital wage payment system in Japan

Apple @ Work Podcast: Password security 101

India issues notice to Wikipedia over concerns of bias

CaratLane’s Q2 Total Income Jumps 28% YoY to INR 829 Cr

Accel, Elevation To See 34X Returns; Prosus To Pocket $500 Mn

Microsoft Releases Phi-2, Outperforms Gemini Nano, Mistral 7B, and Llama 2 Models

Disclaimer

More like this

Swift, UBS, Chainlink pilot tokenized fund settlement

PayPay expands digital wage payment system in Japan

Apple @ Work Podcast: Password security 101

Popular

Seizing A Trillion-Dollar Opportunity By 2030

Prediction markets are not being manipulated — Kalshi founder

8i Ventures Exits M2P Fintech With 12X Returns

US has 26M strong ‘crypto voting bloc’ ahead of elections — Survey

Elon Musk’s X is changing its privacy policy to allow third parties to train...

59 Cleantech Startups Working Towards Making India Greener

Trump’s crypto website crashed after its WLFI token went on sale

Upcoming Events

Singapore Fintech Festival | Singapore | November 6 - 8

Startup Networking | Chennai | November 9

Product Marketing Insights | Bengaluru | November 8

Startup Finance Fest - SFF 2024 | Bengaluru | November 8-9

Startup Meetup | Delhi | November 9

StartupNews.fyi

StartupNews.fyi

Microsoft Releases Phi-2, Outperforms Gemini Nano, Mistral 7B, and Llama 2 Models

Disclaimer

Popular

More Like this

Microsoft Releases Phi-2, Outperforms Gemini Nano, Mistral 7B, and Llama 2 Models

Disclaimer

More like this

Popular

Upcoming Events

Newsletter Signup Form!

Newsletter Signup Form!