Contents

audiocraft 0.0.1

0

Audio research library for PyTorch

The Audiocraft library is a powerful tool for audio processing and generation using deep learning. Developed by Facebook Research, it includes the Encodec audio compressor tokenizer and MusicGen, a language model for music generation that can be controlled with text and melody. This makes it a valuable resource for developers and researchers interested in music generation and audio processing. The library is built on PyTorch and requires a GPU for operation.

Stars: 8913, Watchers: 8913, Forks: 804, Open Issues: 110

The facebookresearch/audiocraft repo was created 1 months ago and the last code push was 2 weeks ago.
The project is extremely popular with a mindblowing 8913 github stars!

How to Install audiocraft

You can install audiocraft using pip

pip install audiocraft

or add it to a project with poetry

poetry add audiocraft

Package Details

Author
FAIR Speech & Audio
License
MIT License
Homepage
https://github.com/fairinternal/audiocraft
PyPi
https://pypi.org/project/audiocraft/
GitHub Repo
https://github.com/facebookresearch/audiocraft

Classifiers

  • Multimedia/Sound/Audio
  • Scientific/Engineering/Artificial Intelligence
No  audiocraft  pypi packages just yet.

Errors

A list of common audiocraft errors.

Code Examples

Here are some audiocraft code examples and snippets.

GitHub Issues

The audiocraft package has 110 open issues on GitHub

  • Audio Embeddings
  • Can't find Seed?
  • FFMPEG not found?
  • Version conflicts with hydra-core
  • –listen - how to make it work?
  • Is it possible to use multiple GPU for single generate?
  • Release Date of Traning Code
  • Can anyone explain how the top_p and top_k parameter affects the output of the model?
  • Update Readme: Add links to Replicate deployment of MusicGen
  • how to train and how much data to invest so that there is no noise?
  • Non-Usable download from Gradio?
  • Any chance for adjust the sample and bit rate of WAVs up?
  • help by install
  • How to download file using api
  • HuggingFace

See more issues on GitHub

Related Packages & Articles

audioflux 0.1.6

A library for audio and music analysis, feature extraction.

pyo 1.0.5

Python module to build digital signal processing program.

pydub 0.25.1

Manipulate audio with an simple and easy high level interface

deeplake 3.6.14

Deep Lake is a Database for AI powered by a unique storage format optimized for deep-learning and Large Language Model (LLM) based applications. It simplifies the deployment of enterprise-grade LLM-based products by offering storage for all data types (embeddings, audio, text, videos, images, pdfs, annotations, etc.), querying and vector search, data streaming while training models at scale, data versioning and lineage for all workloads, and integrations with popular tools such as LangChain, LlamaIndex, Weights & Biases, and many more.

monkeyplug 1.3.2

monkeyplug is a little script to mute profanity in audio files.