Stable Diffusion creators release StableLM language model for text and code generation

By Nick Heyman On Apr 21, 2023

StabilityAI, the research group behind the Stable Diffusion AI image generator, is releasing the first of its StableLM suite of Language Models.

The new open-source language model is called StableLM, and it is available for developers on GitHub. “Developers can freely inspect, use, and adapt our StableLM base models for commercial or research purposes, subject to the terms of the CC BY-SA-4.0 license,” wrote StabilityAI in a blog post.

StableLM works similarly to OpenAI’s ChatGPT and is trained to generate text and code. It is trained on a larger version of the open-source dataset known as the Pile. “The richness of this dataset gives StableLM surprisingly high performance in conversational and coding tasks, despite its small size of 3 to 7 billion parameters (by comparison, GPT-3 has 175 billion parameters),” wrote StabilityAI.

In addition to StableLM, StabilityAI is also releasing research models “that are instruction fine-tuned,” and use a combination of five recent open-source datasets for conversational agents: Alpaca, GPT4All, Dolly, ShareGPT, and HH.

Check out some examples of StableLM’s capabilities below:

A demo version of StableLM is available to try out now on HuggingFace. Learn more about the new model here.

Image credit: StabilityAI, Shutterstock

Source: StabilityAI

For all the latest Technology News Click Here

For the latest news and updates, follow us on Google News.

Read original article here

Denial of responsibility! TheDailyCheck is an automatic aggregator around the global media. All the content are available free on Internet. We have just arranged it in one platform for educational purpose only. In each content, the hyperlink to the primary source is specified. All trademarks belong to their rightful owners, all materials to their authors. If you are the owner of the content and do not want us to publish your materials on our website, please contact us by email – [email protected] The content will be deleted within 24 hours.