petastorm 0.12.1
0
Petastorm is a library enabling the use of Parquet storage from Tensorflow, Pytorch, and other Pytho
Contents
Petastorm is a library enabling the use of Parquet storage from Tensorflow, Pytorch, and other Python-based ML training frameworks.
Stars: 1787, Watchers: 1787, Forks: 286, Open Issues: 178The uber/petastorm
repo was created 6 years ago and the last code push was 10 months ago.
The project is very popular with an impressive 1787 github stars!
How to Install petastorm
You can install petastorm using pip
pip install petastorm
or add it to a project with poetry
poetry add petastorm
Package Details
- Author
- Uber Technologies, Inc.
- License
- Apache License, Version 2.0
- Homepage
- https://github.com/uber/petastorm
- PyPi:
- https://pypi.org/project/petastorm/
- GitHub Repo:
- https://github.com/uber/petastorm
Classifiers
Related Packages
Errors
A list of common petastorm errors.
Code Examples
Here are some petastorm
code examples and snippets.
GitHub Issues
The petastorm package has 178 open issues on GitHub
- Varying number of examples passed by DataLoader to Pytorch Lightning network
- Remove very old pickle compatibility code modifying old atg package names
- Support for parquet files with nested structures
- Support for Azure Blob Storage and Azure Data Lake