rapids
Here are 26 public repositories matching this topic...
We no longer need to control the number of concurrent kernels, since now we control the number of concurrent tasks
-
Updated
Aug 16, 2021 - Python
-
Updated
Aug 20, 2021 - Jupyter Notebook
Most classes are either under the com.nvidia.spark.rapids, org.apache.spark.sql.rapids, or some other Spark package prefix that ends in .rapids, but there are a few classes in the plugin that appear directly within an Apache Spark package:
- org.apache.spark.sql.catalyst.CudfUnsafeRow
- org.apache.spark.shuffle.RapidsShuffleExceptions.scala
Ideally these classes should be moved to an existi
It would be nice to be able to set the initial_pool_size with a string like "500mb" or "2gb" as opposed to integer sizes like 500000000. We could vendor the code Dask uses to accomplish this:
https://github.com/dask/dask/blob/31af7f7040643c447a72c87a8f12457094ec15ff/dask/utils.py#L1171
Let's show some examples of integration with kgextension
https://kgextension.readthedocs.io/en/latest/
Could be another notebook added to the tutorial.
Where it fits, we might also integrate as a dependency?
-
Updated
Aug 20, 2021 - Jupyter Notebook
In trying to write tests for #189, I'm finding very difficult to add columns to existing tests, as in some cases like the all_types table, the table is defined in a separate file than the tests and multiple tests try to write to the same table.
Additionally, our test suite doesn't prove that the data that are uploaded are the same as the data downloaded for all types.
We should consider m
-
Updated
Oct 4, 2019 - Jupyter Notebook
-
Updated
Mar 30, 2021 - Jupyter Notebook
-
Updated
Aug 17, 2021 - Shell
-
Updated
Oct 2, 2019
As seen with gumdropsteve/turbo-telegram@3e2f3b3, the data for the first half of 2016 can be downloaded & preprocessed just like that of 2015. Is there any other data in the effective range? I.e. is pre-2015 data recorded the same?
If so, let's add it.
-
Updated
Jun 23, 2021 - Jupyter Notebook
-
Updated
Apr 2, 2021 - Python
-
Updated
Feb 25, 2021 - Jupyter Notebook
-
Updated
Mar 13, 2020 - Shell
Improve this page
Add a description, image, and links to the rapids topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the rapids topic, visit your repo's landing page and select "manage topics."


Describe the bug
Clipping a DataFrame or Series using ints causes a cudf Failure because it won't handle the different dtypes (int and float)
Steps/Code to reproduce bug