dask 2024.9.1
0
Parallel PyData with Task Scheduling
Contents
Parallel PyData with Task Scheduling
Stars: 12454, Watchers: 12454, Forks: 1698, Open Issues: 1109The dask/dask
repo was created 9 years ago and the last code push was 5 hours ago.
The project is extremely popular with a mindblowing 12454 github stars!
How to Install dask
You can install dask using pip
pip install dask
or add it to a project with poetry
poetry add dask
Package Details
- Author
- None
- License
- BSD-3-Clause
- Homepage
- None
- PyPi:
- https://pypi.org/project/dask/
- GitHub Repo:
- https://github.com/dask/dask
Classifiers
- Scientific/Engineering
- System/Distributed Computing
Related Packages
Errors
A list of common dask errors.
Code Examples
Here are some dask
code examples and snippets.
GitHub Issues
The dask package has 1109 open issues on GitHub
- Better error message for unsupported Array reshape operations
- Apply eager predicate-pushdown optimizations in Dask-DataFrame
- dask.array.core.map_blocks mis-handles kwargs in task name
- Pin
coverage
in CI - gpuCI out of memory error
- Really allow any iterable to be passed as a meta
- Downgrade
meta
error in #8563 to warning - Create dashboard documentation page
- Dropped Support for Custom Metadata Types
- AttributeError: 'DataFrame' object has no attribute 'name'; Various stack overflow / github suggested fixes not working
- Broken usage of capsys for inspecting logging output
read_sql_query
with meta converts dtypes from 32 to 64.- Improve to_parquet "Schemas are inconsistent" error messages
- Add first and last aggregate functions to
dask.dataframe.pivot_table
- [DISCUSSION] Layer-by-Layer Graph Execution