NexusRaven Outperforms GPT-4 for Zero-shot Function Calling

Nexusflow.ai, has recently launched NexusRaven-V2, a powerful 13-billion parameter LLM that outperforms GPT-4 in zero-shot function calling. The open source model showcases a remarkable capability to transform natural language instructions into executable code, facilitating the utilisation of software tools by copilots and agents.

NexusRaven-V2 demonstrates superiority over GPT-4 by achieving up to a 7% higher success rate in function calling in human-generated use cases involving nested and composite functions. Notably, NexusRaven-V2 accomplishes this without prior training on the specific functions used in the evaluation.

Check out the model on GitHub here, and on Hugging Face here.

Nexusflow.ai introduces the Nexus-Function-Calling benchmark, establishing a Hugging Face leaderboard. This includes a diverse collection of real-life human-curated function-calling examples, with eight out of the nine benchmarks open-sourced.

Open models now starting to surpass GPT4 for specialized tasks. Let’s go!

Model by @NexusflowX: https://t.co/TBUBrevTpJ

Leaderboard: https://t.co/jbvk3U8Ibt pic.twitter.com/G3tEtB5zyp

— clem (@ClementDelangue) December 5, 2023

Built on top of Llama 2, leveraging CodeLlama-13B-instruct, NexusRaven-V2 is instruction-tuned and utilises curated data from Nexusflow’s pipeline. The model is commercially permissive, encouraging both community developers and enterprises to explore its capabilities.

Nexusflow.ai provides open-source utility artefacts, enabling users to seamlessly replace mainstream proprietary function calling APIs with NexusRaven-V2 in their software workflows. Online demos and Colab notebooks are also available for onboarding and integration demonstrations.

NexusRaven-V2 showcases a 4% higher success rate in function calling on average compared to the latest GPT-4 model, as observed in a human-curated benchmark. In tasks involving nested and composite function calls, NexusRaven-V2 exhibits a significant 7% advantage over GPT-4, highlighting its robustness in handling variations in developers’ descriptions of functions.

To ensure reproducibility and standardisation, Nexusflow.ai releases the benchmark and associated leaderboard along with model weights. The evaluation benchmark prioritises human-generated samples with meticulous checks on executability and encompasses a diverse representation of function calling use cases and difficulties.

Nexusflow.ai is also providing a Python package, “nexusraven,” facilitating easy integration with copilots or agents. Developers can quickly ingest API function descriptions and send natural language queries to the model with a single line of code. The nexusraven package also supports converting function calling code to JSON format for seamless integration with downstream software.

The post NexusRaven Outperforms GPT-4 for Zero-shot Function Calling appeared first on Analytics India Magazine.

Disclaimer

We strive to uphold the highest ethical standards in all of our reporting and coverage. We StartupNews.fyi want to be transparent with our readers about any potential conflicts of interest that may arise in our work. It’s possible that some of the investors we feature may have connections to other businesses, including competitors or companies we write about. However, we want to assure our readers that this will not have any impact on the integrity or impartiality of our reporting. We are committed to delivering accurate, unbiased news and information to our audience, and we will continue to uphold our ethics and principles in all of our work. Thank you for your trust and support.

Website Upgradation is going on for any glitch kindly connect at office@startupnews.fyi

Previous News

GitHub Releases Enterprise Server 3.11

Next News

Google Play removes 17 fake loan apps: Why users need to delete these apps right away

Editorial Team

StartupNews.fyi is a leading global startup and technology media platform known for its end-to-end coverage of the startup ecosystem across India and key international markets. Launched with the vision of becoming a single gateway for founders, investors, and ecosystem enablers, StartupNews.fyi has grown steadily over the years by publishing tens of thousands of verified news stories, insights, and ecosystem updates, reaching millions of startup enthusiasts every month through its digital platforms and communities.

More like this

NexusRaven Outperforms GPT-4 for Zero-shot Function Calling

Disclaimer

Popular

Govt Revises PM E-DRIVE’s Subsidy Timelines For E2Ws, E3Ws

9to5Mac Overtime 065: Untelling parts of the story w/ special guest David Pogue

Bellatrix Aerospace Nets $20 Mn To Scale Satellite Propulsion

OpenAI Sora dead, new funding “Ponzi scheme” … is the end of OpenAI nigh, and will Microsoft end up picking up the pieces?

The MacRumors Show: Apple Announces WWDC 2026

More Like this

Transporting Antimatter On a Truck Is Tricky…

The INIU Pocket Rocket P50 is the ultra-portable 10,000mAh power bank you’ve been waiting for

AirPods Pro 3 Hit $199 Record Low Price in Amazon’s Big Spring Sale

This RTX 5050 gaming laptop with an 18” FHD+ display, 16GB RAM, and 1TB SSD strikes a “happy medium between performance and price”

Three-Day Sanskrit Short Film Training Workshop Concludes Successfully in Sarvajanik University, Surat

RBI Lays Out Its Payments Vision For 2028

NexusRaven Outperforms GPT-4 for Zero-shot Function Calling

Disclaimer

More like this

Transporting Antimatter On a Truck Is Tricky…

The INIU Pocket Rocket P50 is the ultra-portable 10,000mAh...

AirPods Pro 3 Hit $199 Record Low Price in...

Popular

Block title

WhatsApp beta is testing Liquid Glass design for voice messages

Airtel Rs 399 Plan Reintroduced in Some Circles

Nuclear Fusion Startup Pranos Fusion Nets $6.8 Mn To Fast Track R&D & Commercialisation

Here’s how Xbox says devs should get ready for Project Helix

Apple Notifying WWDC 2026 Swift Student Challenge Winners

Best VPN for Windows PCs 2026: Browse the Web, Torrent, Stream and Game Privately

The drone market is about to go parabolic because of AI and war. How...

Startup Events

Trending News

Transporting Antimatter On a Truck Is Tricky…

The INIU Pocket Rocket P50 is the ultra-portable 10,000mAh power bank you’ve been waiting for

AirPods Pro 3 Hit $199 Record Low Price in Amazon’s Big Spring Sale

This RTX 5050 gaming laptop with an 18” FHD+ display, 16GB RAM, and 1TB SSD strikes a “happy medium between performance and price”

Three-Day Sanskrit Short Film Training Workshop Concludes Successfully in Sarvajanik University, Surat

About

Partnership

Contact us