The Wayback Machine - https://web.archive.org/web/20220529114545/https://github.com/topics/dask
Skip to content
#

dask

Here are 290 public repositories matching this topic...

kloczek
kloczek commented May 24, 2022

On building my packages I'm using sphinx-build command with -n switch which shows warmings about missing references. These are not critical issues.

Before sphinx starts reporting broken refs it reports problems with importing dask submodules. All it is despite correct sys.path alteration in
https://github.com/dask/dask/blob/4c0cdfdbdb013cc57cd702c8d79b52f37203f8f3/docs/source/conf.py#L

good first issue documentation
dchudz
dchudz commented May 14, 2022

Currently users call client.wait to wait for a certain number of workers.

Currently a common pattern is:

cluster.scale(100)
client.wait_for_workers(100)

Cluster managers (like Coiled) would like to include custom handling to give users nice information about what's happening (including especially failures) while waiting for workers. At the moment, this isn't possible.

Prop

good first issue p2 feature
gerritholl
gerritholl commented Jan 12, 2022

Feature Request

Is your feature request related to a problem? Please describe.

Whenever I report a bug, I need to confirm what satpy version I am using. This is of course important, but it's also an extra step that could be semi-automated.

Describe the solution you'd like

I would like that debug_on() prints the relevant versions. When we report bugs, we anyway call `debu

enhancement good first issue
djhoese
djhoese commented Feb 22, 2021

Code Sample, a minimal, complete, and verifiable piece of code

from pyresample.boundary import Boundary
b = Boundary(my_lons, my_lats)
print(b.contour_poly.area())

Problem description

The above code doesn't fail if the provided lons/lats are 2D (not sure on 3D+), but the class and all functions/utilities underneath it assume 1D arrays. The end results are incor

jdye64
jdye64 commented May 23, 2022

There are several places in the code base where Calcite is mentioned in the comments. All of those instances should be remove and replaced with the correlating DataFusion comment. This is not a simple find/replace but rather the wording needs to be adjusted to ensure it is still valid in DataFusion. If something doesn't fit it can just be removed.

documentation good first issue datafusion
lesteve
lesteve commented May 19, 2020
from dask_jobqueue import SLURMCluster 
cluster = SLURMCluster(cores=1, memory='1GB') 
print(cluster.job_script()) 
#!/usr/bin/env bash

#SBATCH -J dask-worker
#SBATCH -n 1
#SBATCH --cpus-per-task=1
#SBATCH --mem=954M
#SBATCH -t 00:30:00

/home/lesteve/miniconda3/bin/python -m distributed.cli.dask_worker tcp://192.168.0.11:44065 --nthreads 1 --memory-limit 1000.00MB -
help wanted good first issue
NeroCorleone
NeroCorleone commented Aug 11, 2020

Problem description

Reading a dataset with eager's read functionality raises a ValueError when providing columns.

Example code (ideally copy-pastable)

import pandas as pd

from tempfile import TemporaryDirectory
from functools import partial
from storefact import get_store_from_url

from kartothek.io.eager import store_dataframes_as_dataset, read_dataset_as_data
good first issue usability
climpred
RichardScottOZ
RichardScottOZ commented Mar 25, 2021

Without thinking I put resampling="bilinear" and got an error when I called .compute()

Traceback (most recent call last):
  File "carajas.py", line 92, in <module>
    band_medianNP = band_median.compute()
  File "/home/ubuntu/anaconda3/envs/richard/lib/python3.8/site-packages/xarray/core/dataarray.py", line 899, in compute
    return new.load(**kwargs)
  File "/home/ubuntu/anaco
good first issue

Improve this page

Add a description, image, and links to the dask topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the dask topic, visit your repo's landing page and select "manage topics."

Learn more