Releases: activeloopai/deeplake
Releases · activeloopai/deeplake
Fixes to parallel computing 🌈
🧭 What's Changed
- Fix/num workers 0 bug (#1112) @davidbuniat
- Move hub.compute reporting to pipeline eval (#1116) @istranic
- Updates transform docstrings/variables to remove usage of the word "transform" (#1115) @AbhinavTuli
- Fixed bugout reporting paths and overreporting during Dataset initialization. (#1111) @davidbuniat
- Release/2.0.6 (#1107) @davidbuniat
- Faster chunk serialization (#1106) @AbhinavTuli
- Remove default logging code in client (#1097) @benchislett
⚙️ Who Contributes
@AbhinavTuli, @benchislett, @davidbuniat, @istranic and @mccrearyd
Refactors and Minor Updates
🩰 What's New
- Added hub.ingest for automatic creation of datasets
- Added hub.list to help users find publicly available datasets
- Lots of refactors to help developers
🧭 What's Changed
- kaggle argument fix (#1101) @thisiseshan
- Polish top directory (#1054) @kristinagrig06
- Ban dataset attributes as tensor names (#1103) @kristinagrig06
- update tensors (#1089) @mccrearyd
- More sample compressions (#1087) @farizrahman4u
- [small] all scalars have shape (1,) instead of () (#1102) @mccrearyd
- refactor input pipeline for samples (#1099) @mccrearyd
- Integrate hub auto + kaggle (#1075) @thisiseshan
- List datasets (#1048) @kristinagrig06
⚙️ Who Contributes
@AbhinavTuli, @farizrahman4u, @kristinagrig06, @mccrearyd and @thisiseshan
Adding metadata and parallel computations
🎁 What's New
- You can add metadata to datasets and tensors
- You can run computations in parallel using
hub.compute
- The dataset API is updated to be more intuitive
🧭 What's Changed
- Add static dataset delete (#1060) @benchislett
- 2.0.4 release Version update (#1094) @davidbuniat
- Adding back transforms for parallel dataset uploads (#1086) @AbhinavTuli
- Release 2.0.3 (#1084) @davidbuniat
- Update code snippets (#1088) @istranic
- [refactor] encoders base class (#1082) @mccrearyd
- Alias "jpg" to "jpeg" (#1073) @benchislett
- Updated readme (#1085) @istranic
- [small] implement
hub.like
in new api (#1083) @mccrearyd - Bugout reporting update (#1081) @istranic
- Info fixes (#1080) @farizrahman4u
- BUGGER_OFF=true when running tests in Circle CI. (#1079) @zomglings
- [small] Remove dataset link during creation of Hub datasets (#1078) @dhiganthrao
- Enable search in pdoc (#1074) @benchislett
- Added "dataset" class for interacting with underlying "Dataset" class (#1063) @AbhinavTuli
- [small] turn off activeloop reporting during circleci tests (#1076) @mccrearyd
- dataset/tensor
info
alongsidemeta
(#1066) @mccrearyd - [small] fix hub cloud throttling for tests (#1077) @mccrearyd
- Add back the history of master into main (#1061) @benchislett
- [Small PR] Removes tfds tests (#1070) @AbhinavTuli
- Old pytorch multiprocessing bug fix (#1068) @AbhinavTuli
- [Small PR] Renames hub.load to hub.read (#1064) @AbhinavTuli
- Made Dataset and LRUCache objects pickleable (#1049) @AbhinavTuli
- Alternate fix for tensor creation bug (#1065) @AbhinavTuli
⚙️ Who Contributes
@AbhinavTuli, @benchislett, @davidbuniat, @dhiganthrao, @farizrahman4u, @istranic, @mccrearyd, @tatevikh and @zomglings
Bug fix for .pytorch DataLoaders 🌈
🎁 What's New
- We mostly focused on refactoring and minor bugs.
- .pytorch() now works with pubic datasets hosted by team Activeloop (e.g. hub://activeloop/mnist-train).
- Underlying data format is now better! Since the new format is incompatible with the prior release, you should update to the new release using
pip3 install --upgrade hub
.
🧭 What's Changed
- version update (#1062) @davidbuniat
- Fixes an issue in which reporting configuration file was not being created if its parent directory didn't exist. (#1058) @zomglings
- Update PR template to new format (#1059) @benchislett
- Add back PR template from master (#1034) @benchislett
- Update htype docs (#1030) @benchislett
- Validate indexing when given, not at compute-time (#1033) @benchislett
- Update readme (#1057) @istranic
- fix meta non-persistence bug (adds test) (#1053) @mccrearyd
- Updating old pytorch warning message (#1055) @AbhinavTuli
- Changed from master to main (#1052) @Anselmoo
- [refactor] Tests/update fixtures (#1046) @mccrearyd
- NPZ replacement format (only) (#1047) @farizrahman4u
- Auto cast (#1041) @farizrahman4u
- Bring back tuple mode, this time serializable (#1028) @farizrahman4u
- Array interface for Tensor (#1042) @farizrahman4u
- Windows always uses old pytorch integration now (#1044) @AbhinavTuli
- [small] remove chunk sizes from htypes (#1037) @mccrearyd
- Small fix for Pytorch shared memory leak (#1040) @AbhinavTuli
- Fixes dataset creation bug with s3/hub cloud datasets having similar names (#1045) @AbhinavTuli
- [small] Update/2.0/hub cloud test (#1023) @mccrearyd
- Fix tensor creation bug (#1043) @farizrahman4u
- Refactor/fstrings (#1035) @dhiganthrao
- update sample compression API (#1038) @mccrearyd
- [small] Silence tensorflow logs in tests (#1029) @benchislett
- [small] update scalar test (#1022) @mccrearyd
🐛 Bug Fixes
- [small] pytorch readonly error bug fix (#1026) @mccrearyd
- [small] Fix/2.0/readonly (#1024) @mccrearyd
🔗 Dependency Updates
- Bump pillow from 7.2.0 to 8.2.0 in /requirements (#1018) @dependabot
⚙️ Who Contributes
@AbhinavTuli, @Anselmoo, @benchislett, @davidbuniat, @dependabot, @dhiganthrao, @farizrahman4u, @istranic, @mccrearyd, @tatevikh and @zomglings
Hub is in Beta!
What's New
- Hub core was redesigned to enable blazing-fast dataset creation. You can create a Hub dataset faster than copy/pasting files on your local machine
Features
- Super simple API
- Easy creation of datasets and hosting on Activellop Storage or S3
- Rapid dataset streaming to any machine
- Simple dataset integration to pytorch with no boilerplate code (Windows support will be added in the next release)
Pre-Release 2.0.1-alpha
Pre-release for Hub 2.0-alpha
2.0 Early Alpha
Merge pull request #916 from activeloopai/task/2.0/append-api-updates [2.0] Various API changes
1.3.7
🚀 New
- Pytorch data shuffling (#827) @AbhinavTuli
- Feature: multiple image loader function for extended folder structure in image classification (#799) @sparkingdark
- Add supervisely integration (#777) @haiyangdeperci
🐛 Bug Fixes
- GitHub Action CI/CD - Fixed issue #595 (#820) @Anselmoo
- Fix case with multiple class labels (#816) @kristinagrig06
- Fixed issue #843 by extending the classifier (#844) @Anselmoo
🔗 Dependency Updates
- Bump pytest from 6.2.3 to 6.2.4 (#830) @dependabot-preview
- Bump tiledb from 0.8.7 to 0.8.8 (#823) @dependabot-preview
1.3.5
🧭 What's Changed
- Added support for google objectron repos stored on GCS (#800) @AbhinavTuli
- Enhancements to how classlabels are stored (#744) @kristinagrig06
- Fixed issues with credentials getting expired (#784) @AbhinavTuli
- Added additional schema check (#788) @AbhinavTuli
- Changed default text dtype (#737) @AbhinavTuli
🐛 Bug Fixes
- Update .gitignore for .ddcache (#796) @Anselmoo
- Added image downloads within the tutorial notebook (#773) @AbhinavTuli
- Fix issue #778 (#794) @Anselmoo
- Update requirements-optional for fix issue #797 (#798) @Anselmoo
🔗 Dependency Updates
- Update fsspec requirement from <1,>=0.8 to >=0.8,<2022 (#782) @dependabot-preview
- Bump sphinx from 3.5.3 to 3.5.4 (#764) @dependabot-preview
- Update humbug requirement from <0.2,>=0.1.14 to >=0.1.14,<0.3 (#785) @dependabot-preview
- Bump boto3 from 1.17.54 to 1.17.59 (#808) @dependabot-preview
- Bump ray from 1.2.0 to 1.3.0 (#792) @dependabot-preview
- Bump tiledb from 0.8.6 to 0.8.7 (#765) @dependabot-preview
- Bump flake8 from 3.9.0 to 3.9.1 (#776) @dependabot-preview
- Bump boto3 from 1.17.43 to 1.17.54 (#787) @dependabot-preview
⚙️ Who Contributed
@AbhinavTuli, @Anselmoo, @istranic, @kristinagrig06 and @mynameisvinn
1.3.4
🐛 Bug Fixes
- Hotfix for pytorch slowdown issues (#781) @AbhinavTuli
- Fixes issue with image loading in docs (#771) @thisiseshan
- Fixes Hub conda release (#759) @haiyangdeperci
🧭 What's Changed
- Restructure webdataset benchmark setup and add new results (#767) @haiyangdeperci
- Prevent internal imports in setup (#769) @haiyangdeperci
- Update brew (#768) @haiyangdeperci
- Unify versioning info source (#741) @haiyangdeperci
- Notebook introduction to objectron dataset added (#749) @haiyangdeperci
🚀 New
- Hub now provided a link to the visualizer when a dataset is created (#755) @Diveafall
- Added WebDataset Hub benchmarks (#733) @DebadityaPal
🗂 Documentation
- Working with Images documentation (#743) @thisiseshan
🔗 Dependency Updates
- Bump sphinx-rtd-theme from 0.5.1 to 0.5.2 (#750) @dependabot-preview
⚙️ Who Contributed
@AbhinavTuli, @DebadityaPal, @Diveafall, @haiyangdeperci, @mikayelh and @thisiseshan