dask 2024.5.1


Parallel PyData with Task Scheduling

Parallel PyData with Task Scheduling

Stars: 12075, Watchers: 12075, Forks: 1676, Open Issues: 1063

The dask/dask repo was created 9 years ago and the last code push was 3 hours ago.
The project is extremely popular with a mindblowing 12075 github stars!

How to Install dask

You can install dask using pip

pip install dask

or add it to a project with poetry

poetry add dask

Package Details

GitHub Repo:


  • Scientific/Engineering
  • System/Distributed Computing
No  dask  pypi packages just yet.


A list of common dask errors.

Code Examples

Here are some dask code examples and snippets.

GitHub Issues

The dask package has 1063 open issues on GitHub

  • Better error message for unsupported Array reshape operations
  • Apply eager predicate-pushdown optimizations in Dask-DataFrame
  • dask.array.core.map_blocks mis-handles kwargs in task name
  • Pin coverage in CI
  • gpuCI out of memory error
  • Really allow any iterable to be passed as a meta
  • Downgrade meta error in #8563 to warning
  • Create dashboard documentation page
  • Dropped Support for Custom Metadata Types
  • AttributeError: 'DataFrame' object has no attribute 'name'; Various stack overflow / github suggested fixes not working
  • Broken usage of capsys for inspecting logging output
  • read_sql_query with meta converts dtypes from 32 to 64.
  • Improve to_parquet "Schemas are inconsistent" error messages
  • Add first and last aggregate functions to dask.dataframe.pivot_table
  • [DISCUSSION] Layer-by-Layer Graph Execution

See more issues on GitHub

Related Packages & Articles

nlp 0.4.0

HuggingFace/NLP is an open library of NLP datasets.

pandas 2.2.2

Powerful data structures for data analysis, time series, and statistics

numpy 1.26.4

Fundamental package for array computing in Python