koalas 1.8.2
0
Koalas: pandas API on Apache Spark
Contents
Koalas: pandas API on Apache Spark
Stars: 3334, Watchers: 3334, Forks: 356, Open Issues: 114The databricks/koalas
repo was created 5 years ago and the last code push was 6 months ago.
The project is very popular with an impressive 3334 github stars!
How to Install koalas
You can install koalas using pip
pip install koalas
or add it to a project with poetry
poetry add koalas
Package Details
- Author
- Databricks
- License
- http://www.apache.org/licenses/LICENSE-2.0
- Homepage
- https://github.com/databricks/koalas
- PyPi:
- https://pypi.org/project/koalas/
- Documentation:
- https://koalas.readthedocs.io/
- GitHub Repo:
- https://github.com/databricks/koalas
Classifiers
Related Packages
Errors
A list of common koalas errors.
Code Examples
Here are some koalas
code examples and snippets.
GitHub Issues
The koalas package has 114 open issues on GitHub
- read_excel's parameter - mangle_dupe_cols is used to handle duplicate columns but fails if the duplicate columns are case sensitive.
- Write custom metadata to output files with dataframe.to_parquet?
- Series.to_json(orient='records') does not return records-based JSON
- ValueError: Cannot describe a DataFrame without columns