nlp 0.4.0


HuggingFace/NLP is an open library of NLP datasets.

HuggingFace/NLP is an open library of NLP datasets.

Stars: 18320, Watchers: 18320, Forks: 2496, Open Issues: 672

The huggingface/datasets repo was created 4 years ago and the last code push was 2 hours ago.
The project is extremely popular with a mindblowing 18320 github stars!

How to Install nlp

You can install nlp using pip

pip install nlp

or add it to a project with poetry

poetry add nlp

Package Details

HuggingFace Inc.
Apache 2.0
GitHub Repo:


  • Scientific/Engineering/Artificial Intelligence
No  nlp  pypi packages just yet.


A list of common nlp errors.

Code Examples

Here are some nlp code examples and snippets.

GitHub Issues

The nlp package has 672 open issues on GitHub

  • Consider adding ipywidgets as a dependency.
  • Add Fon language tag
  • fix meta in mls
  • TIMIT Dataset not working with GPU
  • PR for the CFPB Consumer Complaints dataset
  • Make streamable the BnL Historical Newspapers dataset
  • Dataset BnL Historical Newspapers does not work in streaming mode
  • wikifix
  • Indexing bug after
  • Fixes to pubmed dataset download function
  • Add support for continuous metrics (RMSE, MAE)
  • Add MIT Scene Parsing Benchmark
  • Adding Turkic X-WMT evaluation set for machine translation
  • Dataset Viewer not showing Previews for Private Datasets
  • Add British Library books dataset

See more issues on GitHub

Related Packages & Articles

keras 3.2.0

Keras is a deep learning API written in Python, running on top of the machine learning platform TensorFlow. The core data structures of Keras are layers and models. The philosophy is to keep simple things simple, while allowing the user to be fully in control when they need to (the ultimate control being the easy extensibility of the source code via subclassing).

gensim 4.3.2

Python framework for fast Vector Space Modelling