Contents

dask 2024.9.1

0

Parallel PyData with Task Scheduling

Parallel PyData with Task Scheduling

Stars: 12454, Watchers: 12454, Forks: 1698, Open Issues: 1109

The dask/dask repo was created 9 years ago and the last code push was 5 hours ago.
The project is extremely popular with a mindblowing 12454 github stars!

How to Install dask

You can install dask using pip

pip install dask

or add it to a project with poetry

poetry add dask

Package Details

Author
None
License
BSD-3-Clause
Homepage
None
PyPi:
https://pypi.org/project/dask/
GitHub Repo:
https://github.com/dask/dask

Classifiers

  • Scientific/Engineering
  • System/Distributed Computing
No  dask  pypi packages just yet.

Errors

A list of common dask errors.

Code Examples

Here are some dask code examples and snippets.

GitHub Issues

The dask package has 1109 open issues on GitHub

  • Better error message for unsupported Array reshape operations
  • Apply eager predicate-pushdown optimizations in Dask-DataFrame
  • dask.array.core.map_blocks mis-handles kwargs in task name
  • Pin coverage in CI
  • gpuCI out of memory error
  • Really allow any iterable to be passed as a meta
  • Downgrade meta error in #8563 to warning
  • Create dashboard documentation page
  • Dropped Support for Custom Metadata Types
  • AttributeError: 'DataFrame' object has no attribute 'name'; Various stack overflow / github suggested fixes not working
  • Broken usage of capsys for inspecting logging output
  • read_sql_query with meta converts dtypes from 32 to 64.
  • Improve to_parquet "Schemas are inconsistent" error messages
  • Add first and last aggregate functions to dask.dataframe.pivot_table
  • [DISCUSSION] Layer-by-Layer Graph Execution

See more issues on GitHub

Related Packages & Articles

nlp 0.4.0

HuggingFace/NLP is an open library of NLP datasets.

pandas 2.2.3

Powerful data structures for data analysis, time series, and statistics

numpy 2.1.1

Fundamental package for array computing in Python