Tags
- scipy 85
- Python 83
- Programming 85
- dask 71
- pangeo 1
- HPC 3
- distributed 5
- jobqueue 1
- GPU 8
- array 2
- cupy 1
- Pandas 1
- dataframe 8
- release 3
- MPI 1
- RAPIDS 3
- Dask 1
- Dask-GLM 1
- CuPy 2
- Sparse 1
- numba 1
- python 3
- scikit-image 1
- dask-image 1
- IO 3
- User Survey 4
- imaging 5
- machine-learning 2
- dask-ml 2
- SciPy 1
- Community 3
- Talk 1
- config 1
- ray 1
- Tutorials 1
- Helm 2
- Dask Gateway 1
- Deployment 1
- memory 1
- profiling 1
- ram 1
- deep learning 1
- PyTorch 1
- life science 5
- skan 1
- skeleton analysis 1
- Dask Summit 2
- Distributed 1
- Tools 1
- Organisations 1
- Australia 1
- geoscience 1
- performance 2
- image analysis 1
- Kubernetes 2
- deployment 2
- kubernetes 1
- dask-kubernetes 1
- clusters 1
- Flyte 1
- p2p 1
- shuffling 1
- ecosystem 1
- pydata 1
- query optimizer 2
- dask array 1
- xarray 1
scipy
- Load Large Image Data with Dask Array
- Python and GPUs: A Status Update
- Dask on HPC
- Experiments in High Performance Networking with UCX and DGX
- Dask Version 1.0
- Refactor Documentation
- Dask Development Log
- Dask Release 0.19.0
- High level performance of Pandas, Dask, Spark, and Arrow
- Building SAGA optimization for Dask arrays
- Dask Development Log
- Pickle isn't slow, it's a protocol
- Dask Development Log, Scipy 2018
- Who uses Dask?
- Dask Development Log
- Dask Scaling Limits
- Dask Release 0.18.0
- Beyond Numpy Arrays in Python
- Dask Release 0.17.2
- Craft Minimal Bug Reports
- Dask Release 0.17.0
- Pangeo: JupyterHub, Dask, and XArray on the Cloud
- Dask Development Log
- Dask Release 0.16.0
- Optimizing Data Structure Access in Python
- Streaming Dataframes
- Notes on Kafka in Python
- Dask Release 0.15.3
- Fast GeoSpatial Analysis in Python
- Dask on HPC - Initial Work
- Dask Release 0.15.2
- Scikit-Image and Dask Performance
- Dask Benchmarks
- Use Apache Parquet
- Dask Release 0.15.0
- Dask Release 0.14.3
- Dask Development Log
- Asynchronous Optimization Algorithms with Dask
- Dask and Pandas and XGBoost
- Dask Release 0.14.1
- Developing Convex Optimization Algorithms in Dask
- Dask Release 0.14.0
- Dask Development Log
- Experiment with Dask and TensorFlow
- Two Easy Ways to Use Scikit Learn and Dask
- Dask Development Log
- Custom Parallel Algorithms on a Cluster with Dask
- Dask Development Log
- Distributed NumPy on a Cluster with Dask Arrays
- Distributed Pandas on a Cluster with Dask DataFrames
- Dask Release 0.13.0
- Dask Development Log
- Dask Development Log
- Dask Development Log
- Dask Development Log
- Dask Cluster Deployments
- Dask and Celery
- Dask Distributed Release 1.13.0
- Dask for Institutions
- Ad Hoc Distributed Random Forests
- Fast Message Serialization
- Distributed Dask Arrays
- Pandas on HDFS with Dask Dataframes
- Introducing Dask distributed
- Distributed Prototype
- Caching
- Custom Parallel Workflows
- Write Complex Parallel Algorithms
- Distributed Scheduling
- State of Dask
- Towards Out-of-core DataFrames
- Towards Out-of-core ND-Arrays -- Dask + Toolz = Bag
- Towards Out-of-core ND-Arrays -- Slicing and Stacking
- Towards Out-of-core ND-Arrays -- Spilling to Disk
- Towards Out-of-core ND-Arrays -- Benchmark MatMul
- Towards Out-of-core ND-Arrays -- Multi-core Scheduling
- Towards Out-of-core ND-Arrays -- Frontend
- Towards Out-of-core ND-Arrays
Python
- Managing dask workloads with Flyte
- Dask on HPC
- Dask Version 1.0
- Refactor Documentation
- Dask Development Log
- Dask Release 0.19.0
- High level performance of Pandas, Dask, Spark, and Arrow
- Building SAGA optimization for Dask arrays
- Dask Development Log
- Pickle isn't slow, it's a protocol
- Dask Development Log, Scipy 2018
- Who uses Dask?
- Dask Development Log
- Dask Scaling Limits
- Dask Release 0.18.0
- Beyond Numpy Arrays in Python
- Dask Release 0.17.2
- Craft Minimal Bug Reports
- Dask Release 0.17.0
- Credit Modeling with Dask
- Pangeo: JupyterHub, Dask, and XArray on the Cloud
- Dask Development Log
- Dask Release 0.16.0
- Optimizing Data Structure Access in Python
- Streaming Dataframes
- Notes on Kafka in Python
- Dask Release 0.15.3
- Fast GeoSpatial Analysis in Python
- Dask on HPC - Initial Work
- Dask Release 0.15.2
- Scikit-Image and Dask Performance
- Dask Benchmarks
- Use Apache Parquet
- Dask Release 0.15.0
- Dask Release 0.14.3
- Dask Development Log
- Asynchronous Optimization Algorithms with Dask
- Dask and Pandas and XGBoost
- Dask Release 0.14.1
- Developing Convex Optimization Algorithms in Dask
- Dask Release 0.14.0
- Dask Development Log
- Experiment with Dask and TensorFlow
- Two Easy Ways to Use Scikit Learn and Dask
- Dask Development Log
- Custom Parallel Algorithms on a Cluster with Dask
- Dask Development Log
- Distributed NumPy on a Cluster with Dask Arrays
- Distributed Pandas on a Cluster with Dask DataFrames
- Dask Release 0.13.0
- Dask Development Log
- Dask Development Log
- Dask Development Log
- Dask Development Log
- Dask Cluster Deployments
- Dask and Celery
- Dask Distributed Release 1.13.0
- Dask for Institutions
- Ad Hoc Distributed Random Forests
- Fast Message Serialization
- Distributed Dask Arrays
- Pandas on HDFS with Dask Dataframes
- Introducing Dask distributed
- Distributed Prototype
- Caching
- Custom Parallel Workflows
- Write Complex Parallel Algorithms
- Distributed Scheduling
- Towards Out-of-core DataFrames
- Towards Out-of-core ND-Arrays -- Dask + Toolz = Bag
- Towards Out-of-core ND-Arrays -- Slicing and Stacking
- Towards Out-of-core ND-Arrays -- Spilling to Disk
- Towards Out-of-core ND-Arrays -- Benchmark MatMul
- Towards Out-of-core ND-Arrays -- Multi-core Scheduling
- Towards Out-of-core ND-Arrays -- Frontend
- Towards Out-of-core ND-Arrays
Programming
- Dask on HPC
- Dask Version 1.0
- Refactor Documentation
- Dask Development Log
- Dask Release 0.19.0
- High level performance of Pandas, Dask, Spark, and Arrow
- Building SAGA optimization for Dask arrays
- Dask Development Log
- Pickle isn't slow, it's a protocol
- Dask Development Log, Scipy 2018
- Who uses Dask?
- Dask Development Log
- Dask Scaling Limits
- Dask Release 0.18.0
- Beyond Numpy Arrays in Python
- Dask Release 0.17.2
- Craft Minimal Bug Reports
- Dask Release 0.17.0
- Credit Modeling with Dask
- Pangeo: JupyterHub, Dask, and XArray on the Cloud
- Dask Development Log
- Dask Release 0.16.0
- Optimizing Data Structure Access in Python
- Streaming Dataframes
- Notes on Kafka in Python
- Dask Release 0.15.3
- Fast GeoSpatial Analysis in Python
- Dask on HPC - Initial Work
- Dask Release 0.15.2
- Scikit-Image and Dask Performance
- Dask Benchmarks
- Use Apache Parquet
- Dask Release 0.15.0
- Dask Release 0.14.3
- Dask Development Log
- Asynchronous Optimization Algorithms with Dask
- Dask and Pandas and XGBoost
- Dask Release 0.14.1
- Developing Convex Optimization Algorithms in Dask
- Dask Release 0.14.0
- Dask Development Log
- Experiment with Dask and TensorFlow
- Two Easy Ways to Use Scikit Learn and Dask
- Dask Development Log
- Custom Parallel Algorithms on a Cluster with Dask
- Dask Development Log
- Distributed NumPy on a Cluster with Dask Arrays
- Distributed Pandas on a Cluster with Dask DataFrames
- Dask Release 0.13.0
- Dask Development Log
- Dask Development Log
- Dask Development Log
- Dask Development Log
- Dask Cluster Deployments
- Dask and Celery
- Dask Distributed Release 1.13.0
- Dask for Institutions
- Dask and Scikit-Learn -- Model Parallelism
- Ad Hoc Distributed Random Forests
- Fast Message Serialization
- Distributed Dask Arrays
- Pandas on HDFS with Dask Dataframes
- Introducing Dask distributed
- Dask is one year old
- Distributed Prototype
- Caching
- Custom Parallel Workflows
- Write Complex Parallel Algorithms
- Distributed Scheduling
- State of Dask
- Towards Out-of-core DataFrames
- Towards Out-of-core ND-Arrays -- Dask + Toolz = Bag
- Towards Out-of-core ND-Arrays -- Slicing and Stacking
- Towards Out-of-core ND-Arrays -- Spilling to Disk
- Towards Out-of-core ND-Arrays -- Benchmark MatMul
- Towards Out-of-core ND-Arrays -- Multi-core Scheduling
- Towards Out-of-core ND-Arrays -- Frontend
- Towards Out-of-core ND-Arrays
dask
- Dask DataFrame is Fast Now
- High Level Query Optimization in Dask
- Upstream testing in Dask
- Shuffling large data at constant memory in Dask
- Managing dask workloads with Flyte
- Measuring Dask memory usage with dask-memusage
- Comparing Dask-ML and Ray Tune's Model Selection Algorithms
- DataFrame Groupby Aggregations
- Dask on HPC
- Composing Dask Array with Numba Stencils
- cuML and Dask hyperparameter optimization
- Extension Arrays in Dask DataFrame
- Dask Version 1.0
- Refactor Documentation
- Dask Development Log
- Dask Release 0.19.0
- High level performance of Pandas, Dask, Spark, and Arrow
- Building SAGA optimization for Dask arrays
- Dask Development Log
- Pickle isn't slow, it's a protocol
- Dask Development Log, Scipy 2018
- Who uses Dask?
- Dask Development Log
- Dask Scaling Limits
- Dask Release 0.18.0
- Beyond Numpy Arrays in Python
- Dask Release 0.17.2
- Dask Release 0.17.0
- Pangeo: JupyterHub, Dask, and XArray on the Cloud
- Dask Development Log
- Dask Release 0.16.0
- Optimizing Data Structure Access in Python
- Streaming Dataframes
- Notes on Kafka in Python
- Dask Release 0.15.3
- Fast GeoSpatial Analysis in Python
- Dask on HPC - Initial Work
- Dask Release 0.15.2
- Scikit-Image and Dask Performance
- Dask Benchmarks
- Use Apache Parquet
- Dask Release 0.15.0
- Dask Release 0.14.3
- Dask Release 0.14.1
- Dask Distributed Release 1.13.0
- Dask for Institutions
- Dask and Scikit-Learn -- Model Parallelism
- Ad Hoc Distributed Random Forests
- Fast Message Serialization
- Distributed Dask Arrays
- Pandas on HDFS with Dask Dataframes
- Introducing Dask distributed
- Dask is one year old
- Distributed Prototype
- Caching
- Custom Parallel Workflows
- Write Complex Parallel Algorithms
- Distributed Scheduling
- State of Dask
- Towards Out-of-core DataFrames
- Towards Out-of-core ND-Arrays -- Dask + Toolz = Bag
- Towards Out-of-core ND-Arrays -- Slicing and Stacking
- Towards Out-of-core ND-Arrays -- Spilling to Disk
- Towards Out-of-core ND-Arrays -- Benchmark MatMul
- Towards Out-of-core ND-Arrays -- Multi-core Scheduling
- Towards Out-of-core ND-Arrays -- Frontend
- Towards Out-of-core ND-Arrays
pangeo
HPC
distributed
- Shuffling large data at constant memory in Dask
- Data Proximate Computation on a Dask Cluster Distributed Between Data Centres
- Measuring Dask memory usage with dask-memusage
- Configuring a Distributed Dask Cluster
- Dask-jobqueue
jobqueue
GPU
- Easy CPU/GPU Arrays and Dataframes
- Large SVDs
- cuML and Dask hyperparameter optimization
- Building GPU Groupby-Aggregations for Dask
- Single-Node Multi-GPU Dataframe Joins
- Dask, Pandas, and GPUs: first steps
- GPU Dask Arrays, first steps
array
cupy
Pandas
dataframe
- Dask DataFrame is Fast Now
- Do you need consistent environments between the client, scheduler and workers?
- Deep Dive into creating a Dask DataFrame Collection with from_map
- Understanding Dask’s meta keyword argument
- DataFrame Groupby Aggregations
- Building GPU Groupby-Aggregations for Dask
- Single-Node Multi-GPU Dataframe Joins
- Extension Arrays in Dask DataFrame
release
MPI
RAPIDS
Dask
Dask-GLM
CuPy
Sparse
numba
python
- Load Large Image Data with Dask Array
- Python and GPUs: A Status Update
- Experiments in High Performance Networking with UCX and DGX
scikit-image
dask-image
IO
- Do you need consistent environments between the client, scheduler and workers?
- Deep Dive into creating a Dask DataFrame Collection with from_map
- Extracting fsspec from Dask
User Survey
- 2021 Dask User Survey
- The 2021 Dask User Survey is out now
- 2020 Dask User Survey
- 2019 Dask User Survey
imaging
- Skeleton analysis
- Dask with PyTorch for large scale image analysis
- Image segmentation with Dask
- Getting to know the life science community
- Dask and ITK for large scale image analysis
machine-learning
- Comparing Dask-ML and Ray Tune's Model Selection Algorithms
- Better and faster hyperparameter optimization with Dask
dask-ml
- Comparing Dask-ML and Ray Tune's Model Selection Algorithms
- Better and faster hyperparameter optimization with Dask
SciPy
Community
Talk
config
ray
Tutorials
Helm
Dask Gateway
Deployment
memory
profiling
ram
deep learning
PyTorch
life science
- Reflections on one year as the Dask life science fellow
- Mosaic Image Fusion
- CZI EOSS Update
- Life sciences at the 2021 Dask Summit
- Skeleton analysis
skan
skeleton analysis
Dask Summit
Distributed
Tools
Organisations
Australia
geoscience
performance
image analysis
Kubernetes
deployment
- Dask Kubernetes Operator
- Data Proximate Computation on a Dask Cluster Distributed Between Data Centres