NexusRaven Outperforms GPT-4 for Zero-shot Function Calling

All News General

Nexusflow.ai, has recently launched NexusRaven-V2, a powerful 13-billion parameter LLM that outperforms GPT-4 in zero-shot function calling. The open source model showcases a remarkable capability to transform natural language instructions into executable code, facilitating the utilisation of software tools by copilots and agents.

NexusRaven-V2 demonstrates superiority over GPT-4 by achieving up to a 7% higher success rate in function calling in human-generated use cases involving nested and composite functions. Notably, NexusRaven-V2 accomplishes this without prior training on the specific functions used in the evaluation.

Check out the model on GitHub here, and on Hugging Face here.

Nexusflow.ai introduces the Nexus-Function-Calling benchmark, establishing a Hugging Face leaderboard. This includes a diverse collection of real-life human-curated function-calling examples, with eight out of the nine benchmarks open-sourced.

Open models now starting to surpass GPT4 for specialized tasks. Let’s go!

Model by @NexusflowX: https://t.co/TBUBrevTpJ

Leaderboard: https://t.co/jbvk3U8Ibt pic.twitter.com/G3tEtB5zyp

— clem (@ClementDelangue) December 5, 2023

Built on top of Llama 2, leveraging CodeLlama-13B-instruct, NexusRaven-V2 is instruction-tuned and utilises curated data from Nexusflow’s pipeline. The model is commercially permissive, encouraging both community developers and enterprises to explore its capabilities.

Nexusflow.ai provides open-source utility artefacts, enabling users to seamlessly replace mainstream proprietary function calling APIs with NexusRaven-V2 in their software workflows. Online demos and Colab notebooks are also available for onboarding and integration demonstrations.

NexusRaven-V2 showcases a 4% higher success rate in function calling on average compared to the latest GPT-4 model, as observed in a human-curated benchmark. In tasks involving nested and composite function calls, NexusRaven-V2 exhibits a significant 7% advantage over GPT-4, highlighting its robustness in handling variations in developers’ descriptions of functions.

To ensure reproducibility and standardisation, Nexusflow.ai releases the benchmark and associated leaderboard along with model weights. The evaluation benchmark prioritises human-generated samples with meticulous checks on executability and encompasses a diverse representation of function calling use cases and difficulties.

Nexusflow.ai is also providing a Python package, “nexusraven,” facilitating easy integration with copilots or agents. Developers can quickly ingest API function descriptions and send natural language queries to the model with a single line of code. The nexusraven package also supports converting function calling code to JSON format for seamless integration with downstream software.

The post NexusRaven Outperforms GPT-4 for Zero-shot Function Calling appeared first on Analytics India Magazine.

Previous News

GitHub Releases Enterprise Server 3.11

Next News

Google Play removes 17 fake loan apps: Why users need to delete these apps right away

Disclaimer

Popular

Microsoft to Introduce Voice Reporting Feature for Xbox

Adobe teams up with India’s Education Ministry for creative learning initiative

Meta May Allow Instagram and Facebook Users in Europe to Pay to Avoid Ads

Indian fintechs amplify payments soundbox pitches to woo merchants

Fintech Unicorn Pine Labs Launches Mini — A QR-First Device With Card Support

More Like this

National Guard Discord leaker sentenced to 15 years in prison

Twitter’s succession: all the news about alternative social media platforms

Cloud management firm ScaleOps secures $58m in funding

Trader who lost $26M to copy-paste error says it’s been ‘max pain’

Apple begins selling new Gold Link Bracelet for Apple Watch

SBF to get the Girls treatment in Going Infinite film adaptation

NexusRaven Outperforms GPT-4 for Zero-shot Function Calling

Disclaimer

More like this

National Guard Discord leaker sentenced to 15 years in...

Twitter’s succession: all the news about alternative social media...

Cloud management firm ScaleOps secures $58m in funding

Popular

CRED Forays Into Insurance Vertical

Seizing A Trillion-Dollar Opportunity By 2030

Prediction markets are not being manipulated — Kalshi founder

8i Ventures Exits M2P Fintech With 12X Returns

US has 26M strong ‘crypto voting bloc’ ahead of elections — Survey

Elon Musk’s X is changing its privacy policy to allow third parties to train...

59 Cleantech Startups Working Towards Making India Greener

Upcoming Events

Cityscape Global | Riyadh | November 11 - 14

Product Marketing Summit | Chicago | November 13 - 14

L2con | Bangkok | November 13

AgriNext Conference 2024 | Dubai | November 13-14

KSC India Demo Day 2024 | Gurugram | November 14

StartupNews.fyi

StartupNews.fyi

NexusRaven Outperforms GPT-4 for Zero-shot Function Calling

Disclaimer

Popular

More Like this

NexusRaven Outperforms GPT-4 for Zero-shot Function Calling

Disclaimer

More like this

Popular

Upcoming Events

Newsletter Signup Form!

Newsletter Signup Form!