Thursday, April 13, 2023

“A really big deal”—Dolly is a free, open source, ChatGPT-style AI model


The Databricks Dolly logo

Enlarge (credit: Databricks)

On Wednesday, Databricks released Dolly 2.0, reportedly the first open source, instruction-following large language model (LLM) for commercial use that's been fine-tuned on a human-generated data set. It could serve as a compelling starting point for homebrew ChatGPT competitors.

Databricks is an American enterprise software company founded in 2013 by the creators of Apache Spark. They provide a web-based platform for working with Spark for big data and machine learning. By releasing Dolly, Databricks hopes to allow organizations to create and customize LLMs "without paying for API access or sharing data with third parties," according to the Dolly launch blog post.

Dolly 2.0, its new 12-billion parameter model, is based on EleutherAI's pythia model family and exclusively fine-tuned on training data (called "databricks-dolly-15k") crowdsourced from Databricks employees. That calibration gives it abilities more in line with OpenAI's ChatGPT, which is better at answering questions and engaging in dialogue as a chatbot than a raw LLM that has not been fine-tuned.

Read 8 remaining paragraphs | Comments

Reference : https://ift.tt/gJCAEGF

No comments:

Post a Comment

Google calls for halting use of WHOIS for TLS domain verifications

Enlarge (credit: Getty Images) Certificate authorities and browser makers are planning to end the use of WHOIS data verifying domai...