Databricks releases open-source Dolly 2.0, an instruction-following large language model for commercial use

Databricks, the creators of Apache Spark, has recently released Dolly 2.0, reportedly the first open-source, instruction-following large language model (LLM) for commercial use that has been fine-tuned on a human-generated data set. Dolly could serve as a compelling starting point for homebrew ChatGPT competitors.

Dolly 2.0 is based on EleutherAI’s pythia model family and has a 12 billion-parameter model, which makes it more aligned with OpenAI’s ChatGPT. The new model is exclusively fine-tuned on a training data set called “databricks-dolly-15k,” which was crowdsourced from Databricks employees. The calibration has provided Dolly with the ability to answer questions and engage in dialogue as a chatbot better.

Dolly 1.0 faced limitations regarding commercial use due to the training data, which contained output from ChatGPT and was subject to OpenAI’s terms of service. To address this issue, Databricks crowdsourced over 13,000 demonstrations of instruction-following behavior from more than 5,000 of its employees between March and April 2023.

The resulting data set, along with Dolly’s model weights and training code, have been released fully open source under a Creative Commons license, enabling anyone to use, modify, or extend the data set for any purpose, including commercial applications.

Dolly’s open-source nature sets it apart from proprietary models like OpenAI’s ChatGPT, which requires users to pay for API access and adhere to specific terms of service. Additionally, Meta’s LLaMA, which recently spawned a wave of derivatives after its weights leaked on BitTorrent, does not allow commercial use.

AI researcher Simon Willison called Dolly 2.0 “a really big deal” on Mastodon, praising its fine-tuning instruction set, which was hand-built by 5,000 Databricks employees and released under a CC license. This release could inspire more companies to develop and release their own LLMs, enabling businesses and organizations to create and customize their own chatbots without relying on third-party services.

Previous News

FTC warns Congress of AI’s potential to “turbocharge” fraud and scams

Next News

DuckDuckGo introduces DuckAssist: AI-powered Wikipedia summaries for certain queries

Databricks releases open-source Dolly 2.0, an instruction-following large language model for commercial use

Disclaimer

Popular

Microsoft to Introduce Voice Reporting Feature for Xbox

Adobe teams up with India’s Education Ministry for creative learning initiative

Meta May Allow Instagram and Facebook Users in Europe to Pay to Avoid Ads

Indian fintechs amplify payments soundbox pitches to woo merchants

Fintech Unicorn Pine Labs Launches Mini — A QR-First Device With Card Support

More Like this

Mphasis: A third of Mphasis’ deal pipeline is AI-led: CEO Nitin Rakesh

TikTok introduces feature that lets you find songs by singing them

SEC approves Grayscale Bitcoin Mini Trust for Trading on NYSE Arca

No BTC strategic reserve announcement from Sen. Lummis—Bitcoin 2024

9to5Rewards: MacBook Pro giveaway + Chargeasap Connect Pro 100W cable [Giveaway]

Karnataka Passes Bill To Levy 2% Cess On OTT Subscriptions

Databricks releases open-source Dolly 2.0, an instruction-following large language model for commercial use

Disclaimer

More like this

Mphasis: A third of Mphasis’ deal pipeline is AI-led:...

TikTok introduces feature that lets you find songs by...

SEC approves Grayscale Bitcoin Mini Trust for Trading on...

Popular

Budget 2024 Looks To Uplift MSME Exports, Financing

India scraps ‘angel tax’ in boost to startups

What Do Startups Expect From FM Sitharaman?

Made by Google 2024: Pixel 9, Gemini, a new foldable and other things to...

Loss Widens, But Co Expects Revenue To Improve

Ola Maps Announces New Pricing Structure To Woo Developers

Google Slashes Prices For Maps APIs For Indian Developers

Upcoming Events

Bangalore Cloud & Datacenter Convention 2024 | Bangalore | July 26

Founder & VC Mixer | Bangalore | July 26

Global Startups Club Startup Networking 2024 | Mumbai | July 27

Roundtable Meetup | Delhi | July 27

India Blockchain Tour 2024 | Hyderabad | July 27

StartupNews.fyi

StartupNews.fyi

Databricks releases open-source Dolly 2.0, an instruction-following large language model for commercial use

Disclaimer

Popular

More Like this

Databricks releases open-source Dolly 2.0, an instruction-following large language model for commercial use

Disclaimer

More like this

Popular

Upcoming Events

Newsletter Signup Form!

Newsletter Signup Form!