Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add empty audio catch #223

Conversation

chainyo
Copy link
Contributor

@chainyo chainyo commented Aug 30, 2023

Here is an example of an empty audio returned to a request:

{
    "utterances": [
        {
            "text": "Empty audio",
            "start": 0.0,
            "end": 27.52,
            "speaker": null,
            "words": null
        }
    ],
    "audio_duration": 27.52,
    "alignment": false,
    "num_speakers": -1,
    "diarization": false,
    "source_lang": "en",
    "timestamps": "s",
    "vocab": [],
    "word_timestamps": false,
    "internal_vad": false,
    "repetition_penalty": 1.2,
    "compression_ratio_threshold": 2.4,
    "log_prob_threshold": -1.0,
    "no_speech_threshold": 0.6,
    "condition_on_previous_text": true,
    "process_times": {
        "total": 0.0,
        "transcription": 0.0,
        "diarization": null,
        "alignment": null,
        "post_processing": 0.0
    },
    "dual_channel": false
}

@chainyo chainyo added api Everything related to the API implementation diarization Everything related to the diarization part labels Aug 30, 2023
@aleksandr-smechov
Copy link
Contributor

Let's do <EMPTY AUDIO>

Also why num_speakers -1 instead of 0?

@chainyo
Copy link
Contributor Author

chainyo commented Aug 30, 2023

Let's do <EMPTY AUDIO>

Changes are made.

Also why num_speakers -1 instead of 0?

@aleksandr-smechov This is the API request parameter, not the number of detected speakers. There is no features to extract the number of detected speaker yet

@chainyo chainyo merged commit adb2704 into main Aug 30, 2023
3 checks passed
@chainyo chainyo deleted the 222-find-a-way-to-return-a-blank-transcript-or-a-silence-annotation-on-no-speech-calls branch August 30, 2023 08:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
api Everything related to the API implementation diarization Everything related to the diarization part
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Find a way to return a blank transcript or a SILENCE annotation on no-speech calls
2 participants