Releases · apacha/OMR-Datasets

12 Jun 12:29

apacha

1.3.0

6d7a5e3

Adding DeepScores dataset and custom download capabilities Latest

Latest

Added the download of the DeepScores V1 dataset (subset with extended vocabulary) to the available datasets.
Also slightly rewrote the downloader, so you can specify a custom URL to any zip file that will be downloaded and extracted, in case your dataset is hosted elsewhere.

Assets 2

03 Jun 08:00

apacha

1.2.1

21928b7

Corrected Dependency Management and Semantic Versioning

Fixed an error in the dependency management of setup.py which prevented normal installation.
Also changing to semantic versioning with three numbers Major.Minor.Revision.

Assets 2

30 Apr 12:38

apacha

1.1

aeaa5f5

Upgraded Muscima++ Symbol Generator to V2

Updated MuscimaPlusPlusSymbolImageGenerator to work with MUSCIMA++ 2.0.

Added quality-of-life improvement suggested by @yvan674 to make importing
common classes such as the downloader easier.

Assets 2

30 Apr 12:37

apacha

1.0

e0d447e

Release of 1.0 with simplified interface

This release is a major new version that is incompatible with previous versions.

The project was dramatically simplified to ease common tasks such as downloading datasets two become much more user-friendly.

Removed mostly unused code and re-organized project structure and documentation.

Assets 2

20 Aug 14:20

apacha

0.19

32c9df9

Support generation of MUSCIMA++ mask images

Release of MaskImageGenerator to support three types of masks being generated from the MUSCIMA++ dataset, Version 2.0:

Instance Segmentation of Staff Lines
Instance Segmentation of Staff Blobs
Semantic Segmentation of all Node objects

Assets 2

13 Aug 10:07

apacha

0.18

39ca0eb

Support for new MUSCIMA++ v2.0 dataset

This release includes support for the new MUSCIMA++ v2.0 dataset.

Assets 2

25 Feb 17:10

apacha

0.15

252f803

Publishing MUSCIMA++ measure annotation dataset

In this release, we added a derived datasest from the MUSCIMA++ dataset, that contains annotations for staves, system measures and stave measures.

Further changes:

Improved the documentation, so readthedocs actually renders the content properly.
Significantly improved the download-progress reporting by resorting to TQDM instead of simply printing lines.
Dropping Python 3.5 support, because we're using string-interpolation

Assets 2

16 Nov 12:38

apacha

0.13

b81ad3a

Adding direct download of MUSCIMA-PP images

MuscimaPlusPlusDatasetDownloader now downloads the annotations and the annotated images together, to avoid the need for downloading 2GB if you just need the 7MB of images, that were actually annotated.

Assets 2

02 Feb 13:01

apacha

datasets

d1fee48

Moving datasets to Github

Moving all datasets to Github and mirroring existing datasets to improve performance and ensure they will be preserved, even if the original sites shut down.

Mirroring:

AudiverisOMR
CVC-MUSCIMA - (Write identification, Staff removal, and Multi-Condition-Aligned).
HOMUS 1.0 and 2.0
MUSCIMA++ v1.0 including the images of the CVC-MUSCIMA dataset, that were annotated (CVC_MUSCIMA_PP_Annotated-Images.zip, Measure-Annotations)
OpenOMR
PrintedMusicSymbols
Rebelo (full, symbols1, and symbols2)
Fornes Music Symbols (Music_Symbols.zip)
Capitan (BimodelHandwrittenSymbols.zip)
Byrd (ByrdOmrTestCorpus.zip)
Bargheer (MEI Annotations for measures only)
FreischuetzDigital (MEI Annotations for measures only)
Baro (BaroMuscima.zip)
DeepScores V1 with extended vocabulary and selected 100 pages (deep-scores-v1-extended-100pages.zip)
DeepScores V1 extended (deep-scores-v1-extended.zip)
DeepScores V2 dense (deep-scores-v2-dense.tar.gz)
AudioLabs_v1
Choi Accidentals
DoReMi V1
OpenScore Lieder (OpenScore-Lieder-Snapshot-2023-10-30.zip), Snapshot 2023-10-30
OpenScore String Quartets (OpenScore-StringQuartets-Snapshot-2023-10-30.zip), Snapshot 2023-10-30
MScoreLib (MScoreLib_all.zip, MScoreLib_Sergey_Prokofiev.zip, MScoreLib_Aleksandr_Scriabin.zip)

Hosting:

AudioLabs_v2

Assets 34

10 Aug 10:25

apacha

0.12

645ef33

Introducing Multi-Condition version of CVC-MUSCIMA

This release features a multi-condition version of the CVC-MUSCIMA that consists of files from the Staff-Removal and the Writer-Identification datasets. The images from both datasets have been aligned to have exactly the same dimensions and overlap as good as possible. The dataset contains a total of 10000 png files with 10 different conditions: grayscale, binary, interrupted, kanungo, staffline-thickness-variation-v1, staffline-thickness-variation-v2, staffline-y-variation-v1, staffline-y-variation-v2, typeset-emulation and whitespeckles.
Note that these augmented version are just taken from the Staff-Removal dataset and inverted to be consistent with all other images black on white background.

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: apacha/OMR-Datasets

Adding DeepScores dataset and custom download capabilities

Corrected Dependency Management and Semantic Versioning

Upgraded Muscima++ Symbol Generator to V2

Release of 1.0 with simplified interface

Support generation of MUSCIMA++ mask images

Support for new MUSCIMA++ v2.0 dataset

Publishing MUSCIMA++ measure annotation dataset

Adding direct download of MUSCIMA-PP images

Moving datasets to Github

Introducing Multi-Condition version of CVC-MUSCIMA