Releases: activeloopai/deeplake
1.3.7
🚀 New
- Pytorch data shuffling (#827) @AbhinavTuli
- Feature: multiple image loader function for extended folder structure in image classification (#799) @sparkingdark
- Add supervisely integration (#777) @haiyangdeperci
🐛 Bug Fixes
- GitHub Action CI/CD - Fixed issue #595 (#820) @Anselmoo
- Fix case with multiple class labels (#816) @kristinagrig06
- Fixed issue #843 by extending the classifier (#844) @Anselmoo
🔗 Dependency Updates
- Bump pytest from 6.2.3 to 6.2.4 (#830) @dependabot-preview
- Bump tiledb from 0.8.7 to 0.8.8 (#823) @dependabot-preview
1.3.5
🧭 What's Changed
- Added support for google objectron repos stored on GCS (#800) @AbhinavTuli
- Enhancements to how classlabels are stored (#744) @kristinagrig06
- Fixed issues with credentials getting expired (#784) @AbhinavTuli
- Added additional schema check (#788) @AbhinavTuli
- Changed default text dtype (#737) @AbhinavTuli
🐛 Bug Fixes
- Update .gitignore for .ddcache (#796) @Anselmoo
- Added image downloads within the tutorial notebook (#773) @AbhinavTuli
- Fix issue #778 (#794) @Anselmoo
- Update requirements-optional for fix issue #797 (#798) @Anselmoo
🔗 Dependency Updates
- Update fsspec requirement from <1,>=0.8 to >=0.8,<2022 (#782) @dependabot-preview
- Bump sphinx from 3.5.3 to 3.5.4 (#764) @dependabot-preview
- Update humbug requirement from <0.2,>=0.1.14 to >=0.1.14,<0.3 (#785) @dependabot-preview
- Bump boto3 from 1.17.54 to 1.17.59 (#808) @dependabot-preview
- Bump ray from 1.2.0 to 1.3.0 (#792) @dependabot-preview
- Bump tiledb from 0.8.6 to 0.8.7 (#765) @dependabot-preview
- Bump flake8 from 3.9.0 to 3.9.1 (#776) @dependabot-preview
- Bump boto3 from 1.17.43 to 1.17.54 (#787) @dependabot-preview
⚙️ Who Contributed
@AbhinavTuli, @Anselmoo, @istranic, @kristinagrig06 and @mynameisvinn
1.3.4
🐛 Bug Fixes
- Hotfix for pytorch slowdown issues (#781) @AbhinavTuli
- Fixes issue with image loading in docs (#771) @thisiseshan
- Fixes Hub conda release (#759) @haiyangdeperci
🧭 What's Changed
- Restructure webdataset benchmark setup and add new results (#767) @haiyangdeperci
- Prevent internal imports in setup (#769) @haiyangdeperci
- Update brew (#768) @haiyangdeperci
- Unify versioning info source (#741) @haiyangdeperci
- Notebook introduction to objectron dataset added (#749) @haiyangdeperci
🚀 New
- Hub now provided a link to the visualizer when a dataset is created (#755) @Diveafall
- Added WebDataset Hub benchmarks (#733) @DebadityaPal
🗂 Documentation
- Working with Images documentation (#743) @thisiseshan
🔗 Dependency Updates
- Bump sphinx-rtd-theme from 0.5.1 to 0.5.2 (#750) @dependabot-preview
⚙️ Who Contributed
@AbhinavTuli, @DebadityaPal, @Diveafall, @haiyangdeperci, @mikayelh and @thisiseshan
1.3.3 🚀
🧭 What's Changed
- to_pytorch now supports a new argument (key_list) that only passes certain tensors to it and speeds up iteration time in case multiple extra tensors are present. (#715) @AbhinavTuli
- Caching present within to_pytorch has been improved to tensors with dynamic shapes (earlier it was saving only the current sample in the cache) (#715) @AbhinavTuli
- Added ability to store DatasetView as a new Dataset (#740) @AbhinavTuli
- Introduces Windows and MacOS tests to circleci (#719) @haiyangdeperci
- Benchmark restructuring and memory profiling (#642) @benchislett
- changed default dtype of classlabel from uint16 to uint8 (#745) @AbhinavTuli
- Updated humbug version (#728) @zomglings
🗂️ Documentation
- Add examples of dataset generation and modification using transforms, trainings with TensorFlow and PyTorch (#675) @kristinagrig06
- Added code and testing notebook for running dataset transforms on a ray cluster. (#713) @kristinagrig06
🐛 Bug Fixes
- Fixed an issue when overwriting transform datasets (#724) @AbhinavTuli
🔗 Dependency Updates
- Bump boto3 from 1.17.41 to 1.17.43 (#742) @dependabot-preview
- Bump boto3 from 1.17.40 to 1.17.41 (#734) @dependabot-preview
- Bump torchvision from 0.9.0 to 0.9.1 (#720) @dependabot-preview
- Bump boto3 from 1.17.39 to 1.17.40 (#730) @dependabot-preview
- Bump boto3 from 1.17.36 to 1.17.39 (#726) @dependabot-preview
- Bump tiledb from 0.8.5 to 0.8.6 (#725) @dependabot-preview
⚙️ Who Contributed
@AbhinavTuli, @Diveafall, @benchislett, @haiyangdeperci, @imshashank, @kristinagrig06 and @zomglings
1.3.2
🚀 New
- Auto infer-schema & auto-directory ingestion! (#696) @McCrearyD
- Added a hello objectron notebook (#694) @haiyangdeperci
- Added ability to specify region in S3 (#715) @kevinlu1211
- CSV parsing added to hub.auto (#711) @dhiganthrao
- Added genomelake hub backend benchmarks (#680) @DebadityaPal
- Added unit test for utils.py (#668) @hakanbakacak
🧭 What's Changed
- to_tensorflow now supports a new argument (key_list) that only passes certain tensors to it and speeds up iteration time in case multiple extra tensors are present. (#689) @AbhinavTuli
- Caching present within to_tensorflow has been improved to tensors with dynamic shapes (earlier it was saving only the current sample in the cache) (#689) @AbhinavTuli
- Adds the option to specify None as compressor while defining the schema (#689) @AbhinavTuli
- Adds the ability to slice dynamically shaped tensors and obtain a list instead of iterating over them one by one. (#689) @AbhinavTuli
- transform logic has been modified to work properly with multiple workers (#689) @AbhinavTuli
- Added tags to usage and crash reports (#697) @zomglings
- Added ipynb file with benchmark tests for dnafrag package (#676) @DebadityaPal
- Relaxed hub requirements (#659) @haiyangdeperci
- Updated Objectron dataset tensors from generic types to hub schema representations (#705) @haiyangdeperci
🐛 Bug Fixes
- Removed mutable default args in client/base.py (#699) @TakshPanchal
- Fixes windows environment encoding (#671) @haiyangdeperci
- Fix/windows setup (#650) @haiyangdeperci
- Fixed README links (#682) @DebadityaPal
- Any dataset copy test that got interrupted midway through the test affected all subsequent test runs. This has now been fixed. (#689) @AbhinavTuli
- Fixed issue with resize in mode='a' (#718) @kristinagrig06
🗂 Documentation
- Russian translation for README (#656) @george-zakharov
- Update schema docs (#654) @thisiseshan
- Add Tutorial for Working with Text on Hub (#672) @dhiganthrao
- include consent language in readme (#666) @mynameisvinn
🔗 Dependency Updates
- Bumped humbug dependency version to ">=0.1.6" (#673) @zomglings
- Update zarr requirement from <2.7,>=2.4 to >=2.4,<2.8 (#717) @dependabot-preview
- Bump boto3 from 1.17.33 to 1.17.36 (#716) @dependabot-preview
- Bump boto3 from 1.17.30 to 1.17.33 (#701) @dependabot-preview
- Bump tensorflow from 2.4.0 to 2.4.1 (#706) @dependabot-preview
- Bump sphinx from 3.5.2 to 3.5.3 (#707) @dependabot-preview
- Bump tiledb from 0.7.6 to 0.8.5 (#703) @dependabot-preview
- Bump flake8 from 3.8.4 to 3.9.0 (#686) @dependabot-preview
- [Security] Bump tensorflow from 2.3.1 to 2.4.0 (#332) @dependabot-preview
- Bump pytest-cov from 2.10.1 to 2.11.1 (#474) @dependabot-preview
- Bump boto3 from 1.17.22 to 1.17.30 (#693) @dependabot-preview
⚙️ Who Contributed
@AbhinavTuli, @DebadityaPal, @McCrearyD, @TakshPanchal, @dependabot-preview, @dependabot-preview[bot], @dhiganthrao, @george-zakharov, @haiyangdeperci, @hakanbakacak, @kevinlu1211, @kristinagrig06, @madhucharan, @mynameisvinn, @thisiseshan, @zomglings
1.3.1
🚀 New
- Auto infer-schema & auto-directory ingestion! (#696) @McCrearyD
- Added a hello objectron notebook (#694) @haiyangdeperci
- Added ability to specify region in S3 (#715) @kevinlu1211
- CSV parsing added to hub.auto (#711) @dhiganthrao
- Added genomelake hub backend benchmarks (#680) @DebadityaPal
- Added unit test for utils.py (#668) @hakanbakacak
🧭 What's Changed
- to_tensorflow now supports a new argument (key_list) that only passes certain tensors to it and speeds up iteration time in case multiple extra tensors are present. (#689) @AbhinavTuli
- Caching present within to_tensorflow has been improved to tensors with dynamic shapes (earlier it was saving only the current sample in the cache) (#689) @AbhinavTuli
- Adds the option to specify None as compressor while defining the schema (#689) @AbhinavTuli
- Adds the ability to slice dynamically shaped tensors and obtain a list instead of iterating over them one by one. (#689) @AbhinavTuli
- transform logic has been modified to work properly with multiple workers (#689) @AbhinavTuli
- Added tags to usage and crash reports (#697) @zomglings
- Added ipynb file with benchmark tests for dnafrag package (#676) @DebadityaPal
- Relaxed hub requirements (#659) @haiyangdeperci
- Updated Objectron dataset tensors from generic types to hub schema representations (#705) @haiyangdeperci
🐛 Bug Fixes
- Removed mutable default args in client/base.py (#699) @TakshPanchal
- Fixes windows environment encoding (#671) @haiyangdeperci
- Fix/windows setup (#650) @haiyangdeperci
- Fixed README links (#682) @DebadityaPal
- Any dataset copy test that got interrupted midway through the test affected all subsequent test runs. This has now been fixed. (#689) @AbhinavTuli
- Fixed issue with resize in mode='a' (#718) @kristinagrig06
🗂 Documentation
- Russian translation for README (#656) @george-zakharov
- Update schema docs (#654) @thisiseshan
- Add Tutorial for Working with Text on Hub (#672) @dhiganthrao
- include consent language in readme (#666) @mynameisvinn
🔗 Dependency Updates
- Bumped humbug dependency version to ">=0.1.6" (#673) @zomglings
- Update zarr requirement from <2.7,>=2.4 to >=2.4,<2.8 (#717) @dependabot-preview
- Bump boto3 from 1.17.33 to 1.17.36 (#716) @dependabot-preview
- Bump boto3 from 1.17.30 to 1.17.33 (#701) @dependabot-preview
- Bump tensorflow from 2.4.0 to 2.4.1 (#706) @dependabot-preview
- Bump sphinx from 3.5.2 to 3.5.3 (#707) @dependabot-preview
- Bump tiledb from 0.7.6 to 0.8.5 (#703) @dependabot-preview
- Bump flake8 from 3.8.4 to 3.9.0 (#686) @dependabot-preview
- [Security] Bump tensorflow from 2.3.1 to 2.4.0 (#332) @dependabot-preview
- Bump pytest-cov from 2.10.1 to 2.11.1 (#474) @dependabot-preview
- Bump boto3 from 1.17.22 to 1.17.30 (#693) @dependabot-preview
⚙️ Who Contributed
@AbhinavTuli, @DebadityaPal, @McCrearyD, @TakshPanchal, @dependabot-preview, @dependabot-preview[bot], @dhiganthrao, @george-zakharov, @haiyangdeperci, @hakanbakacak, @kevinlu1211, @kristinagrig06, @madhucharan, @mynameisvinn, @thisiseshan, @zomglings
1.3.0
🧭 What's Changed
- Version Control has been added to Hub Datasets! (#610) @AbhinavTuli
- to_tensorflow now properly supports Text datasets (#658) @AbhinavTuli
- Hub crash and system information reports using Bugout (#624) @zomglings
- Added support for multiple BBox and Classlabel, instead of Sequences. (#658) @AbhinavTuli
- CLI name has been changed from hub to activeloop (#631) @haiyangdeperci
- Notebook example for creating dataset for object detection and instance segmentation added(#629) @haritsahm
- Tutorial for working with Audio Added (#592) @mynameisvinn
🚀 New
- Hub version command cli (#628) @sparkingdark
- Automatic Release Drafter added to repository (#598) @Anselmoo
- Improve Directory Structure of Examples (#630) @SauravMaheshkar
- Put zarr, tileDB, and hub benchmarks in one file (#534) @DebadityaPal
- Refactored Dataset Class (#576) @DebadityaPal
- Add Github Actions CI pipeline (#372) @ADI10HERO
- Improve Directory Structure of Examples (#630) @SauravMaheshkar
🐛 Bug Fixes
- Removed Assertions from shape_detector.py and added exceptions (#616) @DebadityaPal
- Adds support for dataset views in sharded dataset (#557) @AbhinavTuli
- Advanced slicing added for Sharded Dataset (#558) @AbhinavTuli
🗂 Documentation
- README added in Korean (#621) @HyeongminLEE
- README added in Bahasa Indonesia (#645) @haritsahm
- README added in French (#640) @MargauxMasson
- README added in Turkish (#608) @hakanbakacak
- Chinese Readme Proofread and Update (#613) @Cynthia7979
- Change ds.commit() to ds.flush() throughout in README.md (#619) @galbwe
- Added explaination for local file system to docs (#634) @McCrearyD
- Replaced commit() with flush() in documentation. (#604) @dhiganthrao
- Add MinIO to Data Storage docs (#605) @gabriel-milan
- Updated example notebooks with pip (#585) @MojammelHossain
- Typos fixed (#591) @dPacc
🔗 Dependency Updates
Bump pytest from 6.2.1 to 6.2.2 (#496) @dependabot-preview
Bump ray from 1.0.0 to 1.2.0 (#554) @dependabot-preview
Bump boto3 from 1.16.39 to 1.17.20 (#646) @dependabot-preview
⚙️ Who Contributed
@ADI10HERO, @AbhinavTuli, @Anselmoo, @Cynthia7979, @DebadityaPal, @HyeongminLEE, @MargauxMasson, @McCrearyD, @MojammelHossain, @SauravMaheshkar, @dPacc, @davidbuniat, @dhiganthrao, @gabriel-milan, @galbwe, @haiyangdeperci, @hakanbakacak, @haritsahm, @imshashank, @mikayelh, @mynameisvinn, @sparkingdark and @zomglings
1.2.3
Release Notes
- Reverting shape checks for Mask schema to maintain backward compatibility.
1.2.2
Release Notes
- Hotfix for a bug that resulted in incorrect slicing of TensorView.
1.2.1
Release Notes
- Dataset copying has been added allowing you to copy your own and other users' datasets easily. Datasets can be copied across gcs, s3, aws, local storage and hub storage. #454 (@AbhinavTuli)
- Many improvements to the benchmarks #508 #512 #531 #545 #550 (@haiyangdeperci @DebadityaPal)
- Development Roadmap added #511 (@mynameisvinn)
- Improved message for Hub transforms by displaying shard size #523 (@DebadityaPal)
- All windows have now been fixed. #528 (@AbhinavTuli)
- Hub dataset filtering has been overhauled and a section has been added for the same in the documentation #539 (@AbhinavTuli)
- to_tensorflow issues with Datasets containing Sequences (such as coco) have been fixed #540 (@AbhinavTuli)
- Adds get_label parameter to .compute() and .numpy(), to directly retrieve string label from ClassLabel #489 (@DebadityaPal)
- Tutorial added for using Hub with Hugging Face transformers #536 (@DebadityaPal)
- Some unit tests have now been parameterized to cover multiple datatypes #527 (@drewpotter)
- From directory function has been implemented to directly ingest categorical image data #459 (@sparkingdark)
- Example use case added for creating a Hub dataset for Deep Learning prediction of crop yield #559 (@MargauxMasson)
- MPL Headers have been added to source files #494 (@KrishnaChaitanya1)