Releases: Wordcab/rtasr
Releases · Wordcab/rtasr
v0.0.7
v0.0.6
🚀 ASR Providers
- Moved from Deepgram Nova to Nova 2 model #96
- Allowed user to run transcription on local audio files or folders #90
- Added the possibility to run transcription on self-hosted wordcab-transcribe version #94
- Improved the way we handle errors for ASR providers #83
- Fixed the retries strategy #81
- Decreased the waiting time between job status #93
🎯 Evaluation
📁 Datasets
- Fixed the
fleurs
dataset file path finding #80
💬 CLI commands
- Added the pricing feature to the
audio-length
command #88
v0.0.5
🚀 ASR Providers
- Simplified the
launch
/get_transcription
function for each ASR provider #55 - Implemented WER data preparation for 5 providers #66 #67
🎯 Evaluation
- Implemented the WER evaluation process #70
📁 Datasets
- Added a new dataset
google/fleurs
for WER #77 - Fixes for AMI to avoid server overload #59
- Added the number of files per dataset splits #52
- Added compatible metrics for each dataset #60
- Removed one audio type for AMI to simplify transcription #59
- Added code to prepare AMI dataset for WER evaluation #57
💬 CLI commands
- Created a command to create plots from evaluation results #73
- Improved
list
command #60 - Added an
audio-length
command to get the duration of a dataset per split #74
⚙️ Tests
- Added a lot of unit tests #51
v0.0.4
- Added
RevAI
andSpeechmatics
as asr providers for transcription #24 #25 - Fixed the manifest file creation for AMI dataset #27
- Added AsrOutputs for 5 providers #30 #32
- Added the
results_to_rttm
for 5 providers #30 #34 - Implemented
use_cache
for transcription #35 - Fixed concurrency feature #32
- Fixed AMI dataset download by enabling concurrency to avoid saturating the server #32
- Define speaker mapping feature for all providers and datasets #37
- Add the
evaluation
command #38 - Fixed AssemblyAI transcription diarization params. #42
- Added DER evaluation #38 #42 #48
- Fixed
UU
speaker problem for Speechmatics #48 - Added
retries
for providers #48 - Added a script to generate plots for DER #48