/pkg/d/donut-python/donut-python-banner.webp

donut-python 1.0.9

OCR-free Document Understanding Transformer

06-24-2023 275 words 2 minutes 0 views

Contents

OCR-free Document Understanding Transformer

Stars: 5761, Watchers: 5761, Forks: 468, Open Issues: 195

The clovaai/donut repo was created 2 years ago and the last code push was 3 months ago.
The project is extremely popular with a mindblowing 5761 github stars!

How to Install donut-python

You can install donut-python using pip

pip install donut-python

or add it to a project with poetry

poetry add donut-python

Package Details

Author: Geewook Kim, Teakgyu Hong, Moonbin Yim, JeongYeon Nam, Jinyoung Park, Jinyeong Yim, Wonseok Hwang, Sangdoo Yun, Dongyoon Han, Seunghyun Park
License: MIT
Homepage: https://github.com/clovaai/donut
PyPi:: https://pypi.org/project/donut-python/
GitHub Repo:: https://github.com/clovaai/donut

Classifiers

Scientific/Engineering/Artificial Intelligence
Software Development/Libraries
Software Development/Libraries/Python Modules

No donut-python pypi packages just yet.

Errors

A list of common donut-python errors.

Code Examples

Here are some donut-python code examples and snippets.

GitHub Issues

The donut-python package has 195 open issues on GitHub

Issue while training with custom decoder and tokenizer
RuntimeError: CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling `cublasGemmStridedBatchedExFix
No answer in docVQA
Error in running sample colab - "Something went wrong Unexpected end of JSON input"
How to generate own dataset in parquet format, just like the data format given by readme
Are the parameters of swinTransformer trained during fine-tuning?
SwinTransformer' object has no attribute 'pos_drop'
size mismatch for encoder.model.layers.1.downsample.norm.weight: copying a param with shape torch.Size([1024]) from checkpoint, the shape in current model is torch.Size([512]).
Backslash Rendered as Boxes with Cross Inside
Data Extraction Fine Tuning
base donut model keeps producing repeated characters
Synthdog generates special symbols
receipt scanning - issue with same line items
Neither of the DocVQA Task1 (Document VQA) demos work
What are some available prompts