petastorm 0.12.1
0
Petastorm is a library enabling the use of Parquet storage from Tensorflow, Pytorch, and other Pytho
Contents
Petastorm is a library enabling the use of Parquet storage from Tensorflow, Pytorch, and other Python-based ML training frameworks.
Stars: 1742, Watchers: 1742, Forks: 279, Open Issues: 176The uber/petastorm
repo was created 5 years ago and the last code push was 4 months ago.
The project is very popular with an impressive 1742 github stars!
How to Install petastorm
You can install petastorm using pip
pip install petastorm
or add it to a project with poetry
poetry add petastorm
Package Details
- Author
- Uber Technologies, Inc.
- License
- Apache License, Version 2.0
- Homepage
- https://github.com/uber/petastorm
- PyPi:
- https://pypi.org/project/petastorm/
- GitHub Repo:
- https://github.com/uber/petastorm
Classifiers
Related Packages
Errors
A list of common petastorm errors.
Code Examples
Here are some petastorm
code examples and snippets.
GitHub Issues
The petastorm package has 176 open issues on GitHub
- Varying number of examples passed by DataLoader to Pytorch Lightning network
- Remove very old pickle compatibility code modifying old atg package names
- Support for parquet files with nested structures
- Support for Azure Blob Storage and Azure Data Lake