Contents

torchaudio 2.10.0

0

An audio package for PyTorch

An audio package for PyTorch

Stars: 2831, Watchers: 2831, Forks: 763, Open Issues: 319

The pytorch/audio repo was created 8 years ago and the last code push was 9 hours ago.
The project is very popular with an impressive 2831 github stars!

How to Install torchaudio

You can install torchaudio using pip

pip install torchaudio

or add it to a project with poetry

poetry add torchaudio

Package Details

Author
Soumith Chintala, David Pollack, Sean Naren, Peter Goldsborough, Moto Hira, Caroline Chen, Jeff Hwang, Zhaoheng Ni, Xiaohui Zhang
License
None
Homepage
https://github.com/pytorch/audio
PyPi:
https://pypi.org/project/torchaudio/
GitHub Repo:
https://github.com/pytorch/audio

Classifiers

  • Multimedia/Sound/Audio
  • Scientific/Engineering/Artificial Intelligence
No  torchaudio  pypi packages just yet.

Errors

A list of common torchaudio errors.

Code Examples

Here are some torchaudio code examples and snippets.

GitHub Issues

The torchaudio package has 319 open issues on GitHub

  • Fix ITokenizer return type annotation
  • Compatibility issues with torchaudio on NVIDIA Blackwell (GB10) / aarch64 using NGC PyTorch 25.12
  • Is the Speech Command Classification tutorial still available? If not, why has it been removed?
  • ITokenizer type hint error in wav2vec2 aligner pipeline
  • [Codemod][CppQualityCommon] Fix CQS signal facebook-unused-include-check in fbcode/pytorch/audio/src/libtorchaudio/forced_align
  • export-friendly check in melscale_fbanks
  • [Codemod][[cqs] Fix CQS signal. Id] 55563409 – performance-faster-string-find in fbcode/pytorch/audio/src/libtorchaudio/sox
  • Update on TorchAudio’s future
  • Testing….
  • Add Positional Encoding in Conformer Implementation
  • Cannot batch inference with WavLM in torchaudio.pipelines

See more issues on GitHub

Related Packages & Articles

datasets 4.5.0

HuggingFace community-driven open-source library of datasets

transformers 5.2.0

Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

nlp 0.4.0

HuggingFace/NLP is an open library of NLP datasets.

sagemaker 3.4.1

Open source library for training and deploying models on Amazon SageMaker.

pennylane 0.44.0

PennyLane is a cross-platform Python library for quantum computing, quantum machine learning, and quantum chemistry. Train a quantum computer the same way as a neural network.