Pull requests: huggingface/datasets
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix docs phrasing about supported formats when sharing a dataset
#6486
opened Dec 11, 2023 by
albertvillanova
Loading 1�7
Deprecate Beam API and download from HF GCS bucket
#6474
opened Dec 5, 2023 by
mariosasko
•
Draft
4 tasks
Add concurrent loading of shards to datasets.load_from_disk
#6464
opened Dec 1, 2023 by
kkoutini
Loading 1�7
Fix deprecation warning when building conda package
#6425
opened Nov 15, 2023 by
albertvillanova
Loading 1�7
Fix for continuation behaviour on broken dataset archives due to starving download connections via HTTP-GET
#6380
opened Nov 2, 2023 by
RuntimeRacer
Loading 1�7
Overwrite legacy default config name in
dataset_infos.json in packaged datasets
#6231
opened Sep 11, 2023 by
polinaeterna
Loading 1�7
feat: Return the name of the currently loaded file
#6170
opened Aug 23, 2023 by
Amitesh-Patel
Loading 1�7
Add
fsspec support for to_json, to_csv, and to_parquet
#6096
opened Jul 28, 2023 by
alvarobartt
Loading 1�7
Implement proper checkpointing for dataset uploading with resume function that does not require remapping shards that have already been uploaded
#6056
opened Jul 21, 2023 by
AntreasAntoniou
Loading 1�7
Previous Next
ProTip!
no:milestone will show everything without a milestone.

