Issues: Lightning-AI/litdata
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Subsample StreamingDataset
enhancement
New feature or request
help wanted
Extra attention is needed
#135
opened May 21, 2024 by
yhl48
Adding breakpoint in Something isn't working
help wanted
Extra attention is needed
random_images
function crashes pdb
bug
#134
opened May 21, 2024 by
cgebbe
StreamingDataset incompatibility with PyTorch Lightning
bug
Something isn't working
help wanted
Extra attention is needed
#133
opened May 20, 2024 by
enrico-stauss
DataChunkRecipe is not working when used in litgpt's TinyLlama pretraining example
bug
Something isn't working
help wanted
Extra attention is needed
#130
opened May 15, 2024 by
wen020
Pytorch lighting Fabric + lit data + DDP hangs when finishing epoch
bug
Something isn't working
help wanted
Extra attention is needed
#129
opened May 13, 2024 by
miguelalba96
Stream selected channels
enhancement
New feature or request
help wanted
Extra attention is needed
#128
opened May 13, 2024 by
robmarkcole
Cache directory resolution issues in Google Colab
bug
Something isn't working
help wanted
Extra attention is needed
#126
opened May 8, 2024 by
awaelchli
Optimizing dictionary data structures fails when using a partially initialized function
#120
opened May 6, 2024 by
enrico-stauss
Time per sample grows as processed samples grows
bug
Something isn't working
help wanted
Extra attention is needed
#119
opened May 5, 2024 by
scritter
Slow Dataset Preprocessing due to CPU affinity (?) issues
bug
Something isn't working
help wanted
Extra attention is needed
#118
opened May 2, 2024 by
mgolub2
optimize function on multiple machine writing to local pathes
enhancement
New feature or request
help wanted
Extra attention is needed
#105
opened Apr 22, 2024 by
rakro101
Dataloading is not working when used in litgpt's debug pretraining example
bug
Something isn't working
help wanted
Extra attention is needed
#103
opened Apr 18, 2024 by
iloshchilov
ValueError: buffer size must be a multiple of element size
bug
Something isn't working
help wanted
Extra attention is needed
#102
opened Apr 18, 2024 by
awaelchli
Question: is there a plan to support streaming from GCS?
enhancement
New feature or request
#101
opened Apr 13, 2024 by
dnnspark
Compression using the optimize function from litdata
bug
Something isn't working
help wanted
Extra attention is needed
#97
opened Apr 11, 2024 by
rakro101
GCSFuse mount + Vertex AI custom training jobs support
enhancement
New feature or request
#94
opened Apr 7, 2024 by
miguelalba96
litdata.optimize
accidentally deletes files from the local filesystem
bug
#93
opened Apr 5, 2024 by
hubertsiuzdak
Assert when deserializing Something isn't working
help wanted
Extra attention is needed
no_header_numpy
or no_header_tensor
.
bug
#92
opened Apr 4, 2024 by
ouj
Prints inside the worker processes mess up the progress bar
bug
Something isn't working
help wanted
Extra attention is needed
#76
opened Mar 24, 2024 by
carmocca
Allow a StreamingDataset to wrap around when running in a CombinedStreamingDataset
enhancement
New feature or request
#74
opened Mar 14, 2024 by
lantiga
litdata with huggingface instead of S3
enhancement
New feature or request
#64
opened Mar 8, 2024 by
ehartford
The tested speed is not as fast as expected.
bug
Something isn't working
help wanted
Extra attention is needed
#60
opened Mar 7, 2024 by
tikboaHIT
Resuming StreamingDataloader with num_workers=0 fails
bug
Something isn't working
#24
opened Feb 26, 2024 by
tchaton
Append data to pre-optimized dataset
enhancement
New feature or request
#23
opened Feb 26, 2024 by
tchaton
Previous Next
ProTip!
Updated in the last three days: updated:>2024-05-18.