Baselines for the Zero-Resources Speech Challenge using VisuallyGrounded Models of Spoken Language, 2021 edition
challenge
deep-neural-networks
pytorch
representation-learning
speech-processing
weakly-supervised-learning
multimodal-learning
librispeech
visually-grounded-speech
spokencoco
-
Updated
Jun 1, 2021 - Python