Stability.ai launches Stable Audio

September 15, 2023

1 min.

Stability.ai has launched a new AI model called Stable Audio can generate audio of any length, conditioned on text. This means that Stable Audio can be used to create realistic and high-quality music, sound effects, and other types of audio, simply by providing it with a text description.

Stable Audio is a type of diffusion model, which is a type of AI model that learns to generate data by gradually denoising a random noise input. Diffusion models have been shown to be very effective at generating realistic images, video, and audio.

One of the main challenges in generating audio using diffusion models is that diffusion models are usually trained to generate a fixed-size output. For example, an audio diffusion model might be trained on 30-second audio clips, and will only be able to generate audio in 30-second chunks. This is an issue when training on and trying to generate audio of greatly varying lengths, as is the case when generating full songs.

Stable Audio addresses this issue by conditioning the diffusion model on text metadata, as well as audio file duration and start time. This allows users to control the content and length of the generated audio.

To train Stable Audio, the researchers used a dataset of over 800,000 audio files containing music, sound effects, and single-instrument stems, as well as corresponding text metadata. This dataset adds up to over 19,500 hours of audio.

The Stable Audio model is able to generate 95 seconds of stereo audio at a 44.1 kHz sample rate in less than one second on an NVIDIA A100 GPU. This makes it much faster than previous audio diffusion models.

Stable Audio can be used to create new and innovative types of music, sound effects, and other audio content. It can also be used to improve the quality of existing audio content.

The sources for this piece include an article in Stability.ai.

Tags
Development

TND Newsdesk

SUBSCRIBE NOW

Become a member

New, Relevant Tech Stories. Our article selection is done by industry professionals. Our writers summarize them to give you the key takeaways

Subscribe Now

North Korean hacker infiltrates US security vendor, loads malware

CrowdStrike releases an update from initial Post Incident Review: Hashtag Trending Special Edition for Thursday July 25, 2024

Security vendor CrowdStrike issues an update from their initial Post Incident Review

CrowdStrike CEO summoned by Homeland Security committee over software disaster

Canadian schools sue social media giants over alleged harm to children

ChatGPT mobile mania: Why users are flocking to ChatGPT Plus

iOS update brings back photos users thought were permanently deleted

Microsoft reveals critical security flaw affecting Android apps

CrowdStrike faces backlash over $10 “apology” voucher

North Korean hacker infiltrates US security vendor, loads malware

Security company accidentally hires a North Korean state hacker: Cybersecurity Today for Friday, July 26, 2024

Security vendor CrowdStrike issues an update from their initial Post Incident Review

Stability.ai launches Stable Audio

North Korean hacker infiltrates US security vendor, loads malware

Security company accidentally hires a North Korean state hacker: Cybersecurity Today for Friday, July 26, 2024

CrowdStrike releases an update from initial Post Incident Review: Hashtag Trending Special Edition for Thursday July 25, 2024

Security vendor CrowdStrike issues an update from their initial Post Incident Review

Homeland Security committee demands appearance by CrowdStrike CEO

SUBSCRIBE NOW

Related articles

Target’s new AI is aimed at employees

The good and the bad of AI generated code

Microsoft’s AI success may spell defeat for it’s climate goals

OpenAI’s Chief Scientist Ilya Sutskever Departs Company

Become a member