Contents

recordlinkage 0.16

0

A record linkage toolkit for linking and deduplication

A record linkage toolkit for linking and deduplication

Stars: 953, Watchers: 953, Forks: 153, Open Issues: 63

The J535D165/recordlinkage repo was created 8 years ago and the last code push was 7 months ago.
The project is popular with 953 github stars!

How to Install recordlinkage

You can install recordlinkage using pip

pip install recordlinkage

or add it to a project with poetry

poetry add recordlinkage

Package Details

Author
License
BSD-3-Clause
Homepage
PyPi:
https://pypi.org/project/recordlinkage/
GitHub Repo:
https://github.com/J535D165/recordlinkage

Classifiers

No  recordlinkage  pypi packages just yet.

Errors

A list of common recordlinkage errors.

Code Examples

Here are some recordlinkage code examples and snippets.

GitHub Issues

The recordlinkage package has 63 open issues on GitHub

  • Address Matching Conditional on value of another column
  • ECMClassifier returns almost all candidate pairs
  • Add support for pandas==2

See more issues on GitHub

Related Packages & Articles

dedupe 3.0.3

A python library for accurate and scaleable data deduplication and entity-resolution

pyswarms 1.3.0

A Python-based Particle Swarm Optimization (PSO) library.

optimuspyspark 2.2.32

Optimus is the missing framework for cleaning and pre-processing data in a distributed fashion with pyspark.

vosk 0.3.45

Offline open source speech recognition API based on Kaldi and Vosk

huggingface-hub 0.25.2

Client library to download and publish models, datasets and other repos on the huggingface.co hub