Contents

speechbrain 1.0.1

0

All-in-one speech toolkit in pure Python and Pytorch

All-in-one speech toolkit in pure Python and Pytorch

Stars: 8709, Watchers: 8709, Forks: 1378, Open Issues: 141

The speechbrain/speechbrain repo was created 4 years ago and the last code push was Yesterday.
The project is extremely popular with a mindblowing 8709 github stars!

How to Install speechbrain

You can install speechbrain using pip

pip install speechbrain

or add it to a project with poetry

poetry add speechbrain

Package Details

Author
Mirco Ravanelli & Others
License
None
Homepage
https://speechbrain.github.io/
PyPi:
https://pypi.org/project/speechbrain/
GitHub Repo:
https://github.com/speechbrain/speechbrain

Classifiers

No  speechbrain  pypi packages just yet.

Errors

A list of common speechbrain errors.

Code Examples

Here are some speechbrain code examples and snippets.

GitHub Issues

The speechbrain package has 141 open issues on GitHub

  • convert "AISHELL-1/ASR/transformer" to onnx or nvidia tensorRT engine
  • [Bug]: inference issue when I run ksponSpeech using colab
  • [Bug]: RuntimeError: The size of tensor a (140590) must match the size of tensor b (2500) at non-singleton dimension 1
  • [Feature Request]: Missing DPTNet recipe?
  • Comments on the ESC50 recipes
  • K2 interface for speechbrain
  • Update DropBox Links
  • [Bug]: Python version
  • [Bug]: training on VoxCeleb2 cannot reach the 0.8 EER of ECAPA-TDNN on multi-gpu
  • Run "on_stage_end" on all processes and save on only a single process
  • Training Speaker_ID SpeechBrain
  • [Bug]: cannot reach the 0.8 EER of ECAPA-TDNN on multi-gpu
  • [Feature Request]: ECAPA-TDNN Model update
  • Updating early stopping for more intuitive use
  • [Bug]: Huggingface hosted inference API not working for speechbrain/emotion-recognition-wav2vec2-IEMOCAP

See more issues on GitHub

Related Packages & Articles

vosk 0.3.45

Offline open source speech recognition API based on Kaldi and Vosk

audioflux 0.1.9

A library for audio and music analysis, feature extraction.

SpeechRecognition 3.10.4

Library for performing speech recognition, with support for several engines and APIs, online and offline.

farm-haystack 1.26.3

LLM framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data.