Skip to content

Pull requests: huggingface/datasets

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

fix bug #6877
#6889 opened May 9, 2024 by arthasking123 Loading…
Create function to convert to parquet
#6878 opened May 7, 2024 by albertvillanova Loading…
Unpin hfh
#6876 opened May 6, 2024 by lhoestq Loading…
Support folder-based datasets with large metadata.jsonl
#6859 opened May 2, 2024 by gbenson Loading…
fix webdataset filename split
#6849 opened Apr 29, 2024 by Bowser1704 Loading…
LargeListType support #6834
#6835 opened Apr 24, 2024 by Modexus Loading…
Make Image cast storage faster
#6786 opened Apr 5, 2024 by Modexus Loading…
Allow polars as valid output type
#6762 opened Mar 28, 2024 by psmyth94 Loading…
3x Faster Text Preprocessing
#6711 opened Mar 3, 2024 by ashvardanian Loading…
Persist IterableDataset epoch in workers
#6710 opened Mar 2, 2024 by lhoestq Loading…
__add__ for Dataset, IterableDataset
#6694 opened Feb 26, 2024 by oh-gnues-iohc Loading…
Run download_and_prepare if missing splits
#6639 opened Feb 2, 2024 by lhoestq Loading…
ProTip! no:milestone will show everything without a milestone.