Skip to content

Issues: Lightning-AI/litdata

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Assignee
Filter by who’s assigned
Sort

Issues list

Subsample StreamingDataset enhancement New feature or request help wanted Extra attention is needed
#135 opened May 21, 2024 by yhl48
Adding breakpoint in random_images function crashes pdb bug Something isn't working help wanted Extra attention is needed
#134 opened May 21, 2024 by cgebbe
StreamingDataset incompatibility with PyTorch Lightning bug Something isn't working help wanted Extra attention is needed
#133 opened May 20, 2024 by enrico-stauss
DataChunkRecipe is not working when used in litgpt's TinyLlama pretraining example bug Something isn't working help wanted Extra attention is needed
#130 opened May 15, 2024 by wen020
Pytorch lighting Fabric + lit data + DDP hangs when finishing epoch bug Something isn't working help wanted Extra attention is needed
#129 opened May 13, 2024 by miguelalba96
Stream selected channels enhancement New feature or request help wanted Extra attention is needed
#128 opened May 13, 2024 by robmarkcole
Cache directory resolution issues in Google Colab bug Something isn't working help wanted Extra attention is needed
#126 opened May 8, 2024 by awaelchli
Time per sample grows as processed samples grows bug Something isn't working help wanted Extra attention is needed
#119 opened May 5, 2024 by scritter
Slow Dataset Preprocessing due to CPU affinity (?) issues bug Something isn't working help wanted Extra attention is needed
#118 opened May 2, 2024 by mgolub2
optimize function on multiple machine writing to local pathes enhancement New feature or request help wanted Extra attention is needed
#105 opened Apr 22, 2024 by rakro101
Dataloading is not working when used in litgpt's debug pretraining example bug Something isn't working help wanted Extra attention is needed
#103 opened Apr 18, 2024 by iloshchilov
ValueError: buffer size must be a multiple of element size bug Something isn't working help wanted Extra attention is needed
#102 opened Apr 18, 2024 by awaelchli
Compression using the optimize function from litdata bug Something isn't working help wanted Extra attention is needed
#97 opened Apr 11, 2024 by rakro101
litdata.optimize accidentally deletes files from the local filesystem bug Something isn't working help wanted Extra attention is needed
#93 opened Apr 5, 2024 by hubertsiuzdak
Assert when deserializing no_header_numpy or no_header_tensor. bug Something isn't working help wanted Extra attention is needed
#92 opened Apr 4, 2024 by ouj
TPU support enhancement New feature or request
#79 opened Mar 26, 2024 by miguelalba96
Prints inside the worker processes mess up the progress bar bug Something isn't working help wanted Extra attention is needed
#76 opened Mar 24, 2024 by carmocca
litdata with huggingface instead of S3 enhancement New feature or request
#64 opened Mar 8, 2024 by ehartford
The tested speed is not as fast as expected. bug Something isn't working help wanted Extra attention is needed
#60 opened Mar 7, 2024 by tikboaHIT
Resuming StreamingDataloader with num_workers=0 fails bug Something isn't working
#24 opened Feb 26, 2024 by tchaton
Append data to pre-optimized dataset enhancement New feature or request
#23 opened Feb 26, 2024 by tchaton
ProTip! Updated in the last three days: updated:>2024-05-18.