audio-captioning

PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation"

transformers pytorch audio-captioning clotho-dataset dcase-challenge

Updated Jan 6, 2024
Jupyter Notebook

minguinho26 / Prefix_AAC_ICASSP2023

Star

Official Implementation of "Prefix tuning for Automated Audio Captioning(ICASSP 2023)"

deep-learning pytorch-implementation audio-captioning icassp2023

Updated Dec 6, 2023
Jupyter Notebook

ExplainableML / ZerAuCap

Star

[NeurIPS 2023 - ML for Audio Workshop (Oral)] Zero-shot audio captioning with audio-language model guidance and audio context keywords

audio zero-shot opt audio-captioning clotho-dataset large-language-models neurips-2023 audiocaps

Updated Nov 20, 2023

audio-captioning / dcase-2020-baseline

Star

Audio captioning baseline system for DCASE 2020 challenge.

machine-learning deep-neural-networks deep-learning signal-processing audio-signal-processing captioning dcase machine-listening audio-captioning dcase2020

Updated Aug 22, 2023
Python

lukewys / dcase_2020_T6

Star

2nd place solution for 2020 DCASE challenge task 6 audio captioning. http://dcase.community/challenge2020/task-automatic-audio-captioning-results#wuyusong2020_t6

deep-learning audio-captioning

Updated Aug 3, 2023
Python

zelaki / wsac

Star

This reporsitory code form Weakly Supervised Automaed Audio Captioning via Text Only Training

clap audio-captioning dcase2023

Updated Jun 12, 2023
Python

TheoCoombes / ClipCap

Star

Using pretrained encoder and language models to generate captions from multimedia inputs.

vqa image-captioning language-model encoder-decoder audio-captioning vision-transformer

Updated Mar 11, 2023
Python

blmoistawinde / fense

Star

Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval.

benchmark evaluation-metrics audio-captioning audiocaption

Updated Feb 1, 2023
Python

soham97 / sound_ai_progress

Star

Tracking states of the arts and recent results (bibliography) on sound tasks.

audio-processing sound-event-detection music-classification acoustic-scene-classification audio-captioning audio-generation audio-retrieval

Updated Jan 10, 2023

paniquex / Automated_Audio_Captioning_DCASE2020

Star

6-th task solution of DCASE2020

audio gru attention audio-processing mixup audio-captioning

Updated Jun 22, 2022
Python

ilaria-manco / muscaps

Star

Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)

music-information-retrieval mir multimodal-deep-learning audio-captioning

Updated Apr 30, 2022
Jupyter Notebook

Improve this page

Add a description, image, and links to the audio-captioning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the audio-captioning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

audio-captioning

Here are 26 public repositories matching this topic...

Labbeti / aac-metrics

iOPENCap / awesome-unimodal-training

abikaki / DCASE-Workshop-Papers

Labbeti / aac-datasets

Labbeti / conette-audio-captioning

soham97 / awesome-sound_event_detection

Sreyan88 / RECAP

ilaria-manco / song-describer

Labbeti / dcase2024-task6-baseline

slSeanWU / beats-conformer-bart-audio-captioner

minguinho26 / Prefix_AAC_ICASSP2023

ExplainableML / ZerAuCap

audio-captioning / dcase-2020-baseline

lukewys / dcase_2020_T6

zelaki / wsac

TheoCoombes / ClipCap

blmoistawinde / fense

soham97 / sound_ai_progress

paniquex / Automated_Audio_Captioning_DCASE2020

ilaria-manco / muscaps

Improve this page

Add this topic to your repo