Contents

jieba 0.42.1

0

Chinese Words Segmentation Utilities

Jieba is a Chinese text segmentation module, it strives to be the best Python Chinese word segmentation module. Jieba supports three types of segmentation modes, accurate mode, full mode and search engine mode.

Stars: 32300, Watchers: 32300, Forks: 6702, Open Issues: 668

The fxsjy/jieba repo was created 11 years ago and the last code push was 3 weeks ago.
The project is extremely popular with a mindblowing 32300 github stars!

How to Install jieba

You can install jieba using pip

pip install jieba

or add it to a project with poetry

poetry add jieba

Package Details

Author
Sun, Junyi
License
MIT
Homepage
https://github.com/fxsjy/jieba
PyPi:
https://pypi.org/project/jieba/
GitHub Repo:
https://github.com/fxsjy/jieba

Classifiers

  • Text Processing
  • Text Processing/Indexing
  • Text Processing/Linguistic
No  jieba  pypi packages just yet.

Errors

A list of common jieba errors.

Code Examples

Here are some jieba code examples and snippets.

GitHub Issues

The jieba package has 668 open issues on GitHub

  • 取消星星了,这个项目已经死了
  • 它这个自定义字典的话,是什么原理啊
  • >
  • How to use jieba in vue
  • 不相信 jieba cut 这么好用
  • 命令行分词仅使用单线程
  • 请问是否可以把一个句子分成多个有用词?
  • 无法导入paddle
  • 请问词典中的zg词性是什么意思?
  • 带中横杆"-"的词如何把它当做一个词
  • 【分享】好多人需要的:关键词带空格和特殊字符方法~~

See more issues on GitHub

Related Packages & Articles

huggingface-hub 0.22.2

Client library to download and publish models, datasets and other repos on the huggingface.co hub

htmldate 1.8.0

Fast and robust extraction of original and updated publication dates from URLs and web pages.

flair 0.13.1

A very simple framework for state-of-the-art NLP

datasets 2.18.0

HuggingFace community-driven open-source library of datasets

conllu 4.5.3

CoNLL-U Parser parses a CoNLL-U formatted string into a nested python dictionary