resolve #379 audio classification evaluator + docs #405

Plutone11011 · 2023-01-24T22:01:53Z

This is the PR addressing #379.

For testing, I used the audio file highlighted in the Pipelines API example, mainly because the superb dataset requires manual download, which I felt was more cumbersome and less aligned with the other evaluators' tests.

HuggingFaceDocBuilderDev · 2023-01-30T13:44:57Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

lvwerra · 2023-01-30T14:12:34Z

Looks awesome. Would you mind running make style && make quality such that the code quality tests pass?

lvwerra · 2023-02-01T13:23:11Z

Thanks a lot @Plutone11011 the PR looks very clean! I added @lewtun as a reviewer for the Audio specific part since he suggested this evaluator.

lewtun

Thanks a lot for adding this very clean implementation @Plutone11011 ! Overall it looks great - my only question is whether we should enable support for raw audio data since that would provide parity with the audio-classification pipeline

Would you like to extend your PR to support that?

lewtun · 2023-02-06T08:19:21Z

src/evaluate/evaluator/audio_classification.py

+    >>>     model_or_pipeline=""superb/wav2vec2-base-superb-ks"",
+    >>>     data=data,
+    >>>     label_column="label",
+    >>>     input_column="file",


Since the audio-classification pipeline supports inference on both raw waveforms and audio files, would it make sense to enable support for both in evaluation?

In other words, input_column could be either an audio column of type np.ndarray or a file column of type str.

Also, if I'm not mistaken, using audio files requires ffmpeg to be installed - perhaps we should add a note somewhere in the example?

Ok, I'll add support for this.
Do you think it makes sense to also add a test for the dataset with audio column?

lewtun · 2023-02-06T08:20:16Z

src/evaluate/evaluator/audio_classification.py

+    ```python
+    >>> from evaluate import evaluator
+    >>> from datasets import load_dataset
+    >>> task_evaluator = evaluator("audio-classification")


Nit: let's add a space between the imports and code

Suggested change

>>> task_evaluator = evaluator("audio-classification")

>>> task_evaluator = evaluator("audio-classification")

lewtun · 2023-02-06T08:21:12Z

src/evaluate/evaluator/audio_classification.py

+    Audio classification evaluator.
+    This audio classification evaluator can currently be loaded from [`evaluator`] using the default task name
+    `audio-classification`.
+    Methods in this class assume a data format compatible with the [`AudioClassificationPipeline`].


Nit:

Suggested change

Methods in this class assume a data format compatible with the [`AudioClassificationPipeline`].

Methods in this class assume a data format compatible with the [`transformers.AudioClassificationPipeline`].

lewtun · 2023-02-06T08:21:59Z

src/evaluate/evaluator/audio_classification.py

+        n_resamples: int = 9999,
+        device: int = None,
+        random_state: Optional[int] = None,
+        input_column: str = "file",


See comment about about extending this to support raw waveforms in an audio column

fixed typo in berscore readme

…ace#411) Added max_length kwarg to docstring of Perplexity measurement

Plutone11011 · 2023-02-06T14:41:45Z

@lewtun I made the changes. In the docs example I added a map operation to rearrange the dataset to be consistent with what the evaluator and pipeline expect, since usually audio datasets have an Audio column that looks like this

"audio": {
     'array': array([...], dtype=float32),
     'path': 'path/to/audio_1',
     'sampling_rate': 16000
}

EDIT:
sorry must have synced on this branch and included other commits in main

lvwerra

Looks good to me :) I think the other commits will go away when we squash-merge the PR.

…gface#405) * audio classification evaluator + docs * fix styling issue * [Docs] fixed a typo in bertscore readme (huggingface#386) fixed typo in berscore readme * Add max_length kwarg to docstring of Perplexity measurement (huggingface#411) Added max_length kwarg to docstring of Perplexity measurement * adding support raw audio input audio classification evaluator --------- Co-authored-by: Hazrul Akmal <hazrulakmal121@gmail.com> Co-authored-by: Kalyan Dutia <kalyan.dutia@gmail.com>

audio classification evaluator + docs

ce1d777

fix styling issue

80c3567

lvwerra requested a review from lewtun February 1, 2023 13:22

lewtun reviewed Feb 6, 2023

View reviewed changes

hazrulakmal and others added 3 commits February 6, 2023 15:27

[Docs] fixed a typo in bertscore readme (huggingface#386)

577a0c2

fixed typo in berscore readme

Add max_length kwarg to docstring of Perplexity measurement (huggingf…

4513d53

…ace#411) Added max_length kwarg to docstring of Perplexity measurement

adding support raw audio input audio classification evaluator

c7da24c

lvwerra approved these changes Feb 8, 2023

View reviewed changes

lvwerra merged commit a25b47b into huggingface:main Mar 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

resolve #379 audio classification evaluator + docs #405

resolve #379 audio classification evaluator + docs #405

Plutone11011 commented Jan 24, 2023

HuggingFaceDocBuilderDev commented Jan 30, 2023

lvwerra commented Jan 30, 2023

lvwerra commented Feb 1, 2023

lewtun left a comment

lewtun Feb 6, 2023

Plutone11011 Feb 6, 2023

lewtun Feb 6, 2023

lewtun Feb 6, 2023

lewtun Feb 6, 2023

Plutone11011 commented Feb 6, 2023 •

edited

Loading

lvwerra left a comment

	>>> task_evaluator = evaluator("audio-classification")

	>>> task_evaluator = evaluator("audio-classification")

	Methods in this class assume a data format compatible with the [`AudioClassificationPipeline`].
	Methods in this class assume a data format compatible with the [`transformers.AudioClassificationPipeline`].

resolve #379 audio classification evaluator + docs #405

resolve #379 audio classification evaluator + docs #405

Conversation

Plutone11011 commented Jan 24, 2023

HuggingFaceDocBuilderDev commented Jan 30, 2023

lvwerra commented Jan 30, 2023

lvwerra commented Feb 1, 2023

lewtun left a comment

Choose a reason for hiding this comment

lewtun Feb 6, 2023

Choose a reason for hiding this comment

Plutone11011 Feb 6, 2023

Choose a reason for hiding this comment

lewtun Feb 6, 2023

Choose a reason for hiding this comment

lewtun Feb 6, 2023

Choose a reason for hiding this comment

lewtun Feb 6, 2023

Choose a reason for hiding this comment

Plutone11011 commented Feb 6, 2023 • edited Loading

lvwerra left a comment

Choose a reason for hiding this comment

Plutone11011 commented Feb 6, 2023 •

edited

Loading