-
Notifications
You must be signed in to change notification settings - Fork 2.5k
Insights: huggingface/datasets
Overview
Could not load contribution data
Please try again later
1 Release published by 1 person
-
2.19.2
published
Jun 3, 2024
8 Pull requests merged by 2 people
-
Update yanked version of minimum requests requirement
#6945 merged
Jun 3, 2024 -
Set dev version
#6944 merged
Jun 3, 2024 -
Release 2.19.2
#6943 merged
Jun 3, 2024 -
Revert ci user
#6934 merged
May 30, 2024 -
update ci user
#6933 merged
May 30, 2024 -
[WebDataset] Support compressed files
#6931 merged
May 29, 2024 -
Preserve JSON column order and support list of strings field
#6914 merged
May 29, 2024
3 Pull requests opened by 3 people
-
Update process.mdx: Code Listings Fixes
#6928 opened
May 29, 2024 -
Update dataset_dict.py
#6932 opened
May 30, 2024 -
Re-enable import sorting disabled by flake8:noqa directive when using ruff linter
#6946 opened
Jun 3, 2024
3 Issues closed by 1 person
-
ExpectedMoreSplits error when using data_dir
#6939 closed
May 31, 2024 -
NonMatchingSplitsSizesError when using data_dir
#6918 closed
May 31, 2024 -
Column order is nondeterministic when loading from JSON
#6913 closed
May 29, 2024
10 Issues opened by 9 people
-
FileNotFoundError:error when loading C4 dataset
#6947 opened
Jun 3, 2024 -
Import sorting is disabled by flake8 noqa directive after switching to ruff linter
#6942 opened
Jun 2, 2024 -
Supporting FFCV: Fast Forward Computer Vision
#6941 opened
Jun 1, 2024 -
Enable Sharding to Equal Sized Shards
#6940 opened
May 31, 2024 -
JSON loader implicitly coerces floats to integers
#6937 opened
May 31, 2024 -
save_to_disk() freezes when saving on s3 bucket with multiprocessing
#6936 opened
May 30, 2024 -
Support for pathlib.Path in datasets 2.19.0
#6935 opened
May 30, 2024 -
Avoid downloading the whole dataset when only README.me has been touched on hub.
#6929 opened
May 29, 2024 -
Caching map result of DatasetDict.
#6924 opened
May 28, 2024
6 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
Add MedImg for streaming
#6912 commented on
Jun 3, 2024 • 3 new comments -
support LargeListArray in pyarrow
#4800 commented on
May 30, 2024 • 2 new comments -
Feature request: IterableDataset.push_to_hub
#5665 commented on
May 31, 2024 • 1 new comment -
Add a data type for labeled images (image segmentation)
#3838 commented on
May 29, 2024 • 0 new comments -
[Resumable IterableDataset] Add IterableDataset state_dict
#6658 commented on
May 31, 2024 • 0 new comments -
Allow polars as valid output type
#6762 commented on
May 31, 2024 • 0 new comments