Contents

sentencepiece 0.2.0

0

SentencePiece python wrapper

SentencePiece python wrapper

Stars: 10165, Watchers: 10165, Forks: 1168, Open Issues: 36

The google/sentencepiece repo was created 7 years ago and the last code push was 1 weeks ago.
The project is extremely popular with a mindblowing 10165 github stars!

How to Install sentencepiece

You can install sentencepiece using pip

pip install sentencepiece

or add it to a project with poetry

poetry add sentencepiece

Package Details

Author
Taku Kudo
License
Apache
Homepage
https://github.com/google/sentencepiece
PyPi:
https://pypi.org/project/sentencepiece/
GitHub Repo:
https://github.com/google/sentencepiece

Classifiers

  • Software Development/Libraries/Python Modules
  • Text Processing/Linguistic
No  sentencepiece  pypi packages just yet.

Errors

A list of common sentencepiece errors.

Code Examples

Here are some sentencepiece code examples and snippets.

GitHub Issues

The sentencepiece package has 36 open issues on GitHub

  • Fixed typo error
  • Expose CalculateEntropy and SampleEncodeAndScore methods to python
  • Training with sentence_iterator yields vocabulary size error
  • set num_threads = 56 but only 1 thread working most of the time
  • fix address sanitizers on clang problem
  • address sanitizers on clang
  • SentencePiece Installation Failed on M1 Mac
  • fix freebsd can't find endian.h problem
  • Treating tabs ('\t') as user defined symbols
  • Is the loss computation in UnigramTrainer correct?

See more issues on GitHub

Related Packages & Articles

thinc 9.1.1

A refreshing functional take on deep learning, compatible with your favorite libraries

textblob 0.18.0.post0

Simple, Pythonic text processing. Sentiment analysis, part-of-speech tagging, noun phrase parsing, and more.

spacy 3.8.2

Industrial-strength Natural Language Processing (NLP) in Python

nlp 0.4.0

HuggingFace/NLP is an open library of NLP datasets.

gensim 4.3.3

Python framework for fast Vector Space Modelling