Releases: openvinotoolkit/datumaro
Releases · openvinotoolkit/datumaro
Release 1.8.0
What's Changed - Brief version
New features
Enhancements
- Set label name with parents to avoid duplicates for AstypeAnnotations
(#1492) - Pass Keyword Argument to TabularDataBase
(#1522) - Support hierarchical structure for ImageNet dataset format
(#1528) - Enable dtype argument when calling media.data
(#1546)
Bug fixes
- Preserve end_frame information of a video when it is zero.
(#1541) - Changed the Datumaro format to ensure exported videos have relative paths and to prevent the same video from being overwritten.
(#1547)
What's Changed - Full version
- Set label name with parents to avoid duplicates for AstypeAnnotations by @sooahleex in #1492
- Add TabularValidator by @sooahleex in #1498
- Add TblStats in Configurable Validator by @sooahleex in #1504
- Bump github/codeql-action from 3.25.4 to 3.25.6 by @dependabot in #1502
- Bump aquasecurity/trivy-action from 0.20.0 to 0.21.0 by @dependabot in #1506
- Bump ruff from 0.4.3 to 0.4.5 by @dependabot in #1505
- Bump pozil/auto-assign-issue from 1.14.0 to 2.0.0 by @dependabot in #1500
- Bump ossf/scorecard-action from 2.3.1 to 2.3.3 by @dependabot in #1496
- Bump ruff from 0.4.5 to 0.4.6 by @dependabot in #1512
- Bump github/codeql-action from 3.25.6 to 3.25.7 by @dependabot in #1516
- Bump ruff from 0.4.6 to 0.4.7 by @dependabot in #1517
- Doc update to replace
--save-images
is replaced with--save-media
by @sooahleex in #1514 - Pass Keyword Argument to TabularDataBase by @sooahleex in #1522
- Add correct functionality for tabular data type by @sooahleex in #1513
- Add Clean Transform for tabular data type by @sooahleex in #1520
- Mergeback 1.7.0 to develop by @yunchu in #1538
- Bump aquasecurity/trivy-action from 0.21.0 to 0.23.0 by @dependabot in #1536
- Revert "Mergeback 1.7.0 to develop" by @yunchu in #1539
- Mergeback 1.7.0 to develop by @yunchu in #1540
- Bump pypa/gh-action-pypi-publish from 1.8.14 to 1.9.0 by @dependabot in #1535
- Bump ruff from 0.4.7 to 0.4.9 by @dependabot in #1532
- Bugfix when end_frame is zero by @jihyeonyi in #1541
- Bump github/codeql-action from 3.25.7 to 3.25.10 by @dependabot in #1531
- Bump opencv-python-headless from 4.9.0.80 to 4.10.0.84 by @dependabot in #1537
- Bump ruff from 0.4.9 to 0.4.10 by @dependabot in #1543
- Bump ipython from 8.24.0 to 8.25.0 by @dependabot in #1518
- Bump orjson from 3.10.3 to 3.10.5 by @dependabot in #1530
- Apply clean transform of updated annotations only for tabular annotation type by @sooahleex in #1533
- verify w and h input multiplication overflow to rleEncode() by @yunchu in #1548
- Video bug fix by @jihyeonyi in #1547
- Add notebook for data handling of kaggle dataset by @sooahleex in #1534
- Update pre-commit config to pin ruff dependency for nbqa-ruff by @yunchu in #1550
- Update for release 1.8.0rc0 by @yunchu in #1559
- Support hierarchical structure for ImageNet format by @itrushkin in #1562
- Fix typings in ImageNet format by @itrushkin in #1563
- Update tpp file by @yunchu in #1564
- Update version string to 1.8.0 by @yunchu in #1565
Full Changelog: v1.7.0...v1.8.0
Release 1.7.0
What's Changed - Brief Version
New features
- Support 'Video' media type in datumaro format
(#1491) - Add ann_types property for dataset
(#1422, #1479) - Add AnnotationType.rotated_bbox for oriented object detection
(#1459) - Add DOTA data format for oriented object detection task
(#1475) - Add AstypeAnnotations Transform
(#1484) - Enhance DatasetItem annotations for semantic segmentation model training use case
(#1503)
Enhancements
- Fix ambiguous COCO format detector
(#1442) - Get target information for tabular dataset
(#1471) - Add ExtractedMask and update importers who can use it to use it
(#1480) - Improve PIL and COLOR_BGR context image decode performance
(#1501) - Improve get_area() of Polygon through Shoelace formula
(#1507) - Improve _Shape point converter
(#1508)
Bug fixes
- Split the video directory into subsets to avoid overwriting
(#1485)
What's Changed - Full Version
- Bump github/codeql-action from 3.24.9 to 3.24.10 by @dependabot in #1418
- Bump ipython from 8.22.2 to 8.23.0 by @dependabot in #1413
- Bump lxml from 5.2.0 to 5.2.1 by @dependabot in #1414
- Bump pozil/auto-assign-issue from 1.13.0 to 1.14.0 by @dependabot in #1417
- Add task type information when importing by @wonjuleee in #1422
- Bump black from 24.3.0 to 24.4.0 by @dependabot in #1433
- Bump ruff from 0.3.5 to 0.3.7 by @dependabot in #1434
- Bump orjson from 3.10.0 to 3.10.1 by @dependabot in #1441
- Bump github/codeql-action from 3.24.10 to 3.25.0 by @dependabot in #1440
- Mergeback releases/1.6.0 to develop by @yunchu in #1428
- Fix ambiguous coco format detector by @wonjuleee in #1442
- Bump github/codeql-action from 3.25.0 to 3.25.1 by @dependabot in #1451
- Update dependabot config to prevent redundant PR creation by @yunchu in #1455
- Add new annotation type RotatedBbox by @wonjuleee in #1459
- Bump ruff from 0.3.7 to 0.4.1 by @dependabot in #1464
- Bump actions/checkout from 3 to 4 by @dependabot in #1462
- Update dependabot config by @yunchu in #1469
- Bump black from 24.4.0 to 24.4.1 by @dependabot in #1473
- Bump github/codeql-action from 3.25.1 to 3.25.2 by @dependabot in #1470
- Get target information for tabular dataset by @sooahleex in #1471
- Support DOTA data format for oriented object detection task by @wonjuleee in #1475
- Mergeback 1.6.1rc4 to develop by @yunchu in #1478
- Bump ruff from 0.4.1 to 0.4.2 by @dependabot in #1476
- Bump github/codeql-action from 3.25.2 to 3.25.3 by @dependabot in #1477
- Bump black from 24.4.1 to 24.4.2 by @dependabot in #1482
- Bump ipython from 8.23.0 to 8.24.0 by @dependabot in #1481
- Add ExtractedMask and update importers who can use it to use it by @vinnamkim in #1480
- Support annotation types instead of task type by @wonjuleee in #1479
- Split video directory by subset in datumaro format. by @jihyeonyi in #1485
- Update stability tests by @yunchu in #1483
- Add AstypeAnnotations Transform by @sooahleex in #1484
- Bump ruff from 0.4.2 to 0.4.3 by @dependabot in #1490
- Download Kaggle datasets by @itrushkin in #1487
- Bump orjson from 3.10.1 to 3.10.3 by @dependabot in #1489
- Bump github/codeql-action from 3.25.3 to 3.25.4 by @dependabot in #1493
- Bump aquasecurity/trivy-action from 0.19.0 to 0.20.0 by @dependabot in #1494
- Update pillow constraint to >=10.3.0 by @yunchu in #1495
- Enabled support for 'Video' media type in the datumaro format by @jihyeonyi in #1491
- Improve PIL and COLOR_BGR context image decode performance by @vinnamkim in #1501
- Enhance DatasetItem annotations for semantic segmentation model training use case by @vinnamkim in #1503
- Improve get_area() for polygons through Shoelace formula by @wonjuleee in #1507
- Improve Shape point converter by @wonjuleee in #1508
- Update codeql workflow by @yunchu in #1515
- Update for release 1.7.0 by @yunchu in #1526
Full Changelog: v1.6.1...v1.7.0
v1.7.0rc1
What's Changed
- Bump github/codeql-action from 3.24.9 to 3.24.10 by @dependabot in #1418
- Bump ipython from 8.22.2 to 8.23.0 by @dependabot in #1413
- Bump lxml from 5.2.0 to 5.2.1 by @dependabot in #1414
- Bump pozil/auto-assign-issue from 1.13.0 to 1.14.0 by @dependabot in #1417
- Add task type information when importing by @wonjuleee in #1422
- Bump black from 24.3.0 to 24.4.0 by @dependabot in #1433
- Bump ruff from 0.3.5 to 0.3.7 by @dependabot in #1434
- Bump orjson from 3.10.0 to 3.10.1 by @dependabot in #1441
- Bump github/codeql-action from 3.24.10 to 3.25.0 by @dependabot in #1440
- Mergeback releases/1.6.0 to develop by @yunchu in #1428
- Fix ambiguous coco format detector by @wonjuleee in #1442
- Bump github/codeql-action from 3.25.0 to 3.25.1 by @dependabot in #1451
- Update dependabot config to prevent redundant PR creation by @yunchu in #1455
- Add new annotation type RotatedBbox by @wonjuleee in #1459
- Bump ruff from 0.3.7 to 0.4.1 by @dependabot in #1464
- Bump actions/checkout from 3 to 4 by @dependabot in #1462
- Update dependabot config by @yunchu in #1469
- Bump black from 24.4.0 to 24.4.1 by @dependabot in #1473
- Bump github/codeql-action from 3.25.1 to 3.25.2 by @dependabot in #1470
- Get target information for tabular dataset by @sooahleex in #1471
- Support DOTA data format for oriented object detection task by @wonjuleee in #1475
- Mergeback 1.6.1rc4 to develop by @yunchu in #1478
- Bump ruff from 0.4.1 to 0.4.2 by @dependabot in #1476
- Bump github/codeql-action from 3.25.2 to 3.25.3 by @dependabot in #1477
- Bump black from 24.4.1 to 24.4.2 by @dependabot in #1482
- Bump ipython from 8.23.0 to 8.24.0 by @dependabot in #1481
- Add ExtractedMask and update importers who can use it to use it by @vinnamkim in #1480
- Support annotation types instead of task type by @wonjuleee in #1479
- Split video directory by subset in datumaro format. by @jihyeonyi in #1485
- Update stability tests by @yunchu in #1483
- Add AstypeAnnotations Transform by @sooahleex in #1484
- Bump ruff from 0.4.2 to 0.4.3 by @dependabot in #1490
- Download Kaggle datasets by @itrushkin in #1487
- Bump orjson from 3.10.1 to 3.10.3 by @dependabot in #1489
- Bump github/codeql-action from 3.25.3 to 3.25.4 by @dependabot in #1493
- Bump aquasecurity/trivy-action from 0.19.0 to 0.20.0 by @dependabot in #1494
- Update pillow constraint to >=10.3.0 by @yunchu in #1495
- Enabled support for 'Video' media type in the datumaro format by @jihyeonyi in #1491
- Improve PIL and COLOR_BGR context image decode performance by @vinnamkim in #1501
- Enhance DatasetItem annotations for semantic segmentation model training use case by @vinnamkim in #1503
- Improve get_area() for polygons through Shoelace formula by @wonjuleee in #1507
- Improve Shape point converter by @wonjuleee in #1508
- Update codeql workflow by @yunchu in #1515
Full Changelog: v1.6.1...v1.7.0rc1
Release 1.6.1
What's Changed - Brief Version
Enhancements
- Prevent AcLauncher for OpenVINO 2024.0
(#1450)
Bug fixes
- Modify lxml dependency constraint
(#1460) - Fix CLI error occurring when installed with default option only
(#1444, #1454) - Relax Pillow dependency constraint
(#1436) - Modify Numpy dependency constraint
(#1435) - Relax old pandas version constraint
(#1467)
What's Changed - Full Version
- Modify numpy dependency constraint by @yunchu in #1435
- Modify pillow constraint by @yunchu in #1436
- Update version string to 1.6.1rc2 by @yunchu in #1437
- Prevent AcLauncher for OpenVINO 2024.0 by @wonjuleee in #1450
- Handling tfds import part to prevent cli error when installing datumaro with default option only by @jihyeonyi in #1454
- Update lxml deps constraint by @yunchu in #1460
- Relax old pandas version constraint by @vinnamkim in #1467
- Update for release 1.6.1 by @yunchu in #1468
Full Changelog: v1.6.0...v1.6.1rc4
Release v1.6.1rc3
What's Changed
- Modify numpy dependency constraint by @yunchu in #1435
- Modify pillow constraint by @yunchu in #1436
- Update version string to 1.6.1rc2 by @yunchu in #1437
- Prevent AcLauncher for OpenVINO 2024.0 by @wonjuleee in #1450
- Handling tfds import part to prevent cli error when installing datumaro with default option only by @jihyeonyi in #1454
- Update lxml deps constraint by @yunchu in #1460
Full Changelog: v1.6.0...v1.6.1rc3
Release v1.6.1rc2
Release v1.6.1rc1
What's Changed
Full Changelog: v1.6.0...v1.6.1rc1
Releases v1.6.0
What's Changed - Brief Version
New features
- Changed supported Python version range (>=3.9, <=3.11)
(#1269) - Support MMDetection COCO format
(#1213) - Develop JsonSectionPageMapper in Rust API
(#1224) - Add Filtering via User-Provided Python Functions
(#1230, #1233) - Remove supporting MacOS platform
(#1235) - Support Kaggle image data (
KaggleImageCsvBase
,KaggleImageTxtBase
,KaggleImageMaskBase
,KaggleVocBase
,KaggleYoloBase
)
(#1240) - Add
__getitem__()
for random accessing with O(1) time complexity
(#1247) - Add Data-aware Anchor Generator
(#1251) - Support bounding box import within Kaggle extractors and add
KaggleCocoBase
(#1273)
Enhancements
- Optimize Python import to make CLI entrypoint faster
(#1182) - Add ImageColorScale context manager
(#1194) - Enhance visualizer to toggle plot title visibility
(#1228) - Enhance Datumaro data format detect() to be memory-bounded and performant
(#1229) - Change RoIImage and MosaicImage to have np.uint8 dtype as default
(#1245) - Enable image backend and color channel format to be selectable
(#1246) - Boost up
CityscapesBase
andKaggleImageMaskBase
by droppingnp.unique
(#1261) - Enhance RISE algortihm for explainable AI
(#1263) - Enhance explore unit test to use real dataset from ImageNet
(#1266) - Fix each method of the comparator to be used separately
(#1290) - Bump ONNX version to 1.16.0
(#1376) - Print the color channel format (RGB) for datum stats command
(#1389) - Add ignore_index argument to Mask.as_class_mask() and Mask.as_instance_mask()
(#1409)
Bug fixes
- Fix wrong example of Datumaro dataset creation in document
(#1195) - Fix wrong command to install datumaro from github
(#1202, #1207) - Update document to correct wrong
datum project import
command and add filtering example to filter out items containing annotations.
(#1210) - Fix label compare of distance method
(#1205) - Fix Datumaro visualizer's import errors after introducing lazy import
(#1220) - Fix broken link to supported formats in readme
(#1221) - Fix Kinetics data format to have media data
(#1223) - Handling undefined labels at the annotation statistics
(#1232) - Add unit test for item rename
(#1237) - Fix a bug in the previous behavior when importing nested datasets in the project
(#1243) - Fix Kaggle importer when adding duplicated labels
(#1244) - Fix input tensor shape in model interpreter for OpenVINO 2023.3
(#1251) - Add default value for target in prune cli
(#1253) - Remove deprecated MediaManager
(#1262) - Fix explore command without project
(#1271) - Fix enable COCO to import only bboxes
(#1360) - Fix resize transform for RleMask annotation
- (#1361)
- Fix import YOLO variants from extractor when
urls
is not specified
(#1362)
What's Changed- Full Version
- Adjust pr-check on condition by @yunchu in #1146
- Infra/mergeback 1.5.0 by @yunchu in #1152
- Bump version string by @yunchu in #1156
- Modify coco to export annotations without media by @jihyeonyi in #1158
- fix github link in docs by @smidm in #1159
- Fix broken YOLO ultralytics links by @daniilpastukhov in #1161
- Upgrade openvino to 2023.1 by @yunchu in #1167
- Add SAMAutomaticMaskGeneration transform by @vinnamkim in #1168
- Mergeback 1.5.1rc2 by @yunchu in #1181
- Optimize Python import to make CLI entrypoint faster by @vinnamkim in #1182
- Add ImageColorScale context manager by @vinnamkim in #1194
- Fix wrong example of Datumaro dataset creation in document by @sooahleex in #1195
- Mergeback 1.5.1 to develop by @yunchu in #1200
- update document regarding installation command from github by @jihyeonyi in #1202
- hotfix indent of installation document by @jihyeonyi in #1207
- correct datum project import examples by @jihyeonyi in #1210
- Fix label compare of distance method by @sooahleex in #1205
- Add MMDetection COCO format importer by @wonjuleee in #1213
- Update code scanning wf for reporting bandit detection as eror by @yunchu in #1215
- Update version constraint for dvc and tfds by @yunchu in #1218
- Upgrade OpenVINO by @sovrasov in #1196
- fix broken link to supported formats by @jihyeonyi in #1221
- Fix Datumaro visualizer's import errors after introducing lazy import by @vinnamkim in #1220
- Enable fuzzing to the CLI by @yunchu in #1222
- Develop JsonSectionPageMapper in Rust API by @vinnamkim in #1224
- Add nosec B311 directives by @vinnamkim in #1226
- Enhance visualizer to toggle plot title visibility by @vinnamkim in #1228
- Move Rust API unit tests to a separate directory by @vinnamkim in #1227
- [Bug] Fix Kinetics format to have media by @wonjuleee in #1223
- Enhance Datumaro data format detect() to be memory-bounded and performant by @vinnamkim in #1229
- Add Filtering via User-Provided Python Functions by @vinnamkim in #1230
- Handling undefined labels at the annotation statistics by @jihyeonyi in #1232
- Update filtering skill up page by @vinnamkim in #1233
- Upgrade CI to MACOS-12 by @vinnamkim in #1234
- Remove supporting MacOS platform by @yunchu in #1235
- Bump version string by @yunchu in #1236
- Add unit test for item rename by @sooahleex in #1237
- Fix filtering skill up doc by @vinnamkim in #1238
- Add Kaggle data supports by @wonjuleee in #1240
- Fix a bug in the previous behavior when importing nested datasets in the project by @vinnamkim in #1243
- Fix duplicated label import for Kaggle importer by @wonjuleee in #1244
- Change RoIImage and MosaicImage to have np.uint8 dtype as default by @vinnamkim in #1245
- Enable image backend and color channel format to be selectable by @vinnamkim in #1246
- Implement dm.Dataset index access by @itrushkin in https://github....
Release v1.5.2
What's Changed - Brief Version
Enhancements
- Add memory bounded datumaro data format detect.
(#1241) - Set protobuf version constraint to 4.*
(#1260)
What's Changed- Full Version
- Add memory bounded datumaro data format detect to release 1.5.1 by @vinnamkim in #1241
- Update protobuf version by @wonjuleee in #1248
- Update for 1.5.2 by @yunchu in #1250
- Update changelog and release note by @yunchu in #1256
- Set protobuf version constraint to 4.* by @yunchu in #1260
Full Changelog: v1.5.1...v1.5.2
Release v1.6.0rc1
What's Changed
- Add marker to new feature tests by @yunchu in #1148
- Add multi-threading option to ModelTransform and SAMBboxToInstanceMask by @vinnamkim in #1145
- Fix flaky test by setting the join timeout to infinite by @vinnamkim in #1149
- Update for release 1.5.0rc2 by @yunchu in #1150
- Update for release1.5.0 by @yunchu in #1151
- Fix typos in release notes and changelog by @yunchu in #1157
- Enhance Datumaro data format stream importer performance by @vinnamkim in #1153
- fix errata in voc doc by @jihyeonyi in #1163
- upgrade openvino version to 2023.1 by @yunchu in #1164
- Add SAMAutomaticMaskGeneration transform by @vinnamkim in #1168
- Fix doc links for 1.5.1 by @vinnamkim in #1165
- Update our CI OS from
windows-2019
towindows-2022
by @vinnamkim in #1172 - Fix memory unbounded Arrow data format export/import by @vinnamkim in #1169
- Revert openvino version to 2023.0 by @yunchu in #1170
- Change image default dtype from float32 to uint8 by @vinnamkim in #1175
- update explore path by @jihyeonyi in #1176
- Mergeback 1.5.1rc2 by @yunchu in #1181
- Fix imagenet when importing a dataset with non-directory files by @wonjuleee in #1179
- warning instead of raising an error for unsupported annotation types by @jihyeonyi in #1180
- Update CVAT format doc to bypass warning by @sooahleex in #1183
- Add comparison level-up doc by @sooahleex in #1174
- Optimize Python import to make CLI entrypoint faster by @vinnamkim in #1182
- Add ImportError to catch GitPython import error by @vinnamkim in #1186
- Versionup/1.5.1rc3 by @vinnamkim in #1187
- [Bug fix] restore test for splitting video by @wonjuleee in #1188
- Update publish workflow to use internal pypi by @yunchu in #1191
- Update tag checking step to parse from release branch by @yunchu in #1192
- Update release note by @yunchu in #1193
- Add ImageColorScale context manager by @vinnamkim in #1194
- Fix wrong example of Datumaro dataset creation in document by @sooahleex in #1195
- Mergeback 1.5.1 to develop by @yunchu in #1200
- update document regarding installation command from github by @jihyeonyi in #1202
- hotfix indent of installation document by @jihyeonyi in #1207
- correct datum project import examples by @jihyeonyi in #1210
- Fix label compare of distance method by @sooahleex in #1205
- Add MMDetection COCO format importer by @wonjuleee in #1213
- Update code scanning wf for reporting bandit detection as eror by @yunchu in #1215
- Update version constraint for dvc and tfds by @yunchu in #1218
- Upgrade OpenVINO by @sovrasov in #1196
- fix broken link to supported formats by @jihyeonyi in #1221
- Fix Datumaro visualizer's import errors after introducing lazy import by @vinnamkim in #1220
- Enable fuzzing to the CLI by @yunchu in #1222
- Develop JsonSectionPageMapper in Rust API by @vinnamkim in #1224
- Add nosec B311 directives by @vinnamkim in #1226
- Enhance visualizer to toggle plot title visibility by @vinnamkim in #1228
- Move Rust API unit tests to a separate directory by @vinnamkim in #1227
- [Bug] Fix Kinetics format to have media by @wonjuleee in #1223
- Enhance Datumaro data format detect() to be memory-bounded and performant by @vinnamkim in #1229
- Add Filtering via User-Provided Python Functions by @vinnamkim in #1230
- Handling undefined labels at the annotation statistics by @jihyeonyi in #1232
- Update filtering skill up page by @vinnamkim in #1233
- Upgrade CI to MACOS-12 by @vinnamkim in #1234
- Remove supporting MacOS platform by @yunchu in #1235
- Bump version string by @yunchu in #1236
- Add unit test for item rename by @sooahleex in #1237
- Fix filtering skill up doc by @vinnamkim in #1238
- Add Kaggle data supports by @wonjuleee in #1240
New Contributors
Full Changelog: v1.6.0rc0...v1.6.0rc1