Skip to content

Releases: Wordcab/rtasr

v0.0.7

13 Oct 08:35
Compare
Choose a tag to compare

This new release is correcting some nits on Wordcab side.

  • Adjust the pricing for Wordcab paid services #101
  • Fix a bug in the WordcabWord schema: score -> probability #104

v0.0.6

22 Sep 12:21
Compare
Choose a tag to compare

🚀 ASR Providers

  • Moved from Deepgram Nova to Nova 2 model #96
  • Allowed user to run transcription on local audio files or folders #90
  • Added the possibility to run transcription on self-hosted wordcab-transcribe version #94
  • Improved the way we handle errors for ASR providers #83
  • Fixed the retries strategy #81
  • Decreased the waiting time between job status #93

🎯 Evaluation

  • Added the wrr metric #87
  • Updated the results with the new Deepgram Nova 2 model #98

📁 Datasets

  • Fixed the fleurs dataset file path finding #80

💬 CLI commands

  • Added the pricing feature to the audio-length command #88

v0.0.5

30 Aug 19:27
Compare
Choose a tag to compare

🚀 ASR Providers

  • Simplified the launch/get_transcription function for each ASR provider #55
  • Implemented WER data preparation for 5 providers #66 #67

🎯 Evaluation

  • Implemented the WER evaluation process #70

📁 Datasets

  • Added a new dataset google/fleurs for WER #77
  • Fixes for AMI to avoid server overload #59
  • Added the number of files per dataset splits #52
  • Added compatible metrics for each dataset #60
  • Removed one audio type for AMI to simplify transcription #59
  • Added code to prepare AMI dataset for WER evaluation #57

💬 CLI commands

  • Created a command to create plots from evaluation results #73
  • Improved list command #60
  • Added an audio-length command to get the duration of a dataset per split #74

⚙️ Tests

  • Added a lot of unit tests #51

v0.0.4

25 Aug 15:47
Compare
Choose a tag to compare
  • Added RevAI and Speechmatics as asr providers for transcription #24 #25
  • Fixed the manifest file creation for AMI dataset #27
  • Added AsrOutputs for 5 providers #30 #32
  • Added the results_to_rttm for 5 providers #30 #34
  • Implemented use_cache for transcription #35
  • Fixed concurrency feature #32
  • Fixed AMI dataset download by enabling concurrency to avoid saturating the server #32
  • Define speaker mapping feature for all providers and datasets #37
  • Add the evaluation command #38
  • Fixed AssemblyAI transcription diarization params. #42
  • Added DER evaluation #38 #42 #48
  • Fixed UU speaker problem for Speechmatics #48
  • Added retries for providers #48
  • Added a script to generate plots for DER #48

v0.0.3

21 Aug 16:33
Compare
Choose a tag to compare
  • Define a base process for each ASR Providers #2
  • Add Wordcab, AssemblyAI and Deepgram #12 #13 #20
  • Add tests for the utils functions #15
  • Write the full API transcription step for the benchmark cli command #2 #17
  • Fix a bug with the AMI dataset files #22

v0.0.2

17 Aug 20:56
Compare
Choose a tag to compare

This release:

  • Init the CLI f42c380
  • Added the AMI dataset e137f9d
  • Added the VoxConverse dataset edb9d9d
  • Docs are also deployed on main branch push 3c635f6

v0.0.1

17 Aug 08:38
Compare
Choose a tag to compare

Init the repo with PYPI name reservation