Contents

/pkg/j/jieba/jieba-banner.webp

jieba 0.42.1

0

Chinese Words Segmentation Utilities

Python Packages

08-10-2021 178 words One minute 0 views

Contents

Jieba is a Chinese text segmentation module, it strives to be the best Python Chinese word segmentation module. Jieba supports three types of segmentation modes, accurate mode, full mode and search engine mode.

33186 Stars :star:

Stars: 33186, Watchers: 33186, Forks: 6724, Open Issues: 677

The fxsjy/jieba repo was created 12 years ago and the last code push was 1 months ago.
The project is extremely popular with a mindblowing 33186 github stars!

How to Install jieba

You can install jieba using pip

pip install jieba

or add it to a project with poetry

poetry add jieba

Package Details

Author: Sun, Junyi
License: MIT
Homepage: https://github.com/fxsjy/jieba
PyPi:: https://pypi.org/project/jieba/
GitHub Repo:: https://github.com/fxsjy/jieba

Classifiers

Text Processing
Text Processing/Indexing
Text Processing/Linguistic

No jieba pypi packages just yet.

Errors

A list of common jieba errors.

Code Examples

Here are some jieba code examples and snippets.

GitHub Issues

The jieba package has 677 open issues on GitHub

取消星星了，这个项目已经死了
它这个自定义字典的话，是什么原理啊
>
How to use jieba in vue
不相信 jieba cut 这么好用
命令行分词仅使用单线程
请问是否可以把一个句子分成多个有用词？
无法导入paddle
请问词典中的zg词性是什么意思？
带中横杆"-"的词如何把它当做一个词
【分享】好多人需要的：关键词带空格和特殊字符方法~~

See more issues on GitHub

Related Packages & Articles

/pkg/h/huggingface-hub/huggingface-hub-banner.webp

huggingface-hub 0.25.2

Client library to download and publish models, datasets and other repos on the huggingface.co hub

/pkg/h/htmldate/htmldate-banner.webp

htmldate 1.9.1

Fast and robust extraction of original and updated publication dates from URLs and web pages.

/pkg/g/gluonnlp/gluonnlp-banner.webp

gluonnlp 0.10.0

MXNet Gluon NLP Toolkit

/pkg/f/flair/flair-banner.webp

flair 0.14.0

A very simple framework for state-of-the-art NLP

/pkg/d/datasets/datasets-banner.webp

datasets 3.0.1

HuggingFace community-driven open-source library of datasets

/pkg/c/conllu/conllu-banner.webp

conllu 5.0.2

CoNLL-U Parser parses a CoNLL-U formatted string into a nested python dictionary

/pkg/c/compound-word-splitter/compound-word-splitter-banner.webp

compound-word-splitter 0.4

Splits compound words, like German "Effektivitätsberechnung

/pkg/a/autogluon-core/autogluon-core-banner.webp

autogluon.core 1.1.1

Fast and Accurate ML in 3 Lines of Code

/pkg/a/allennlp/allennlp-banner.webp

allennlp 2.10.1

An open-source NLP research library, built on PyTorch.