captioning

Star

Here are 66 public repositories matching this topic...

mrazhou / SEN

Star

Single-stream Extractor Network with Contrastive Pre-training for Remote Sensing Change Captioning

deep-learning pytorch remote-sensing image-captioning captioning change-captioning

Updated Aug 2, 2024
Python

nssharmaofficial / reddit-hole

Star

Automated Reddit Scraper and Video Creator

aws automation reddit reddit-bot tts openai amazon-polly whisper reddit-crawler captioning reddit-scraper amazon-polly-api openai-whisper

Updated Jul 29, 2024
Python

ArchAngelAries / TagScribeR

Star

A tool to streamline AI image captioning

Updated Jul 15, 2024
Python

AMfeta99 / NLP_LLM

Star

This repository is dedicated to small projects and some theoretical material that I used to get into NLP and LLM in a practical and efficient way.

Updated Jul 4, 2024
Jupyter Notebook

Labbeti / aac-metrics

Star

Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.

audio metrics text captioning audio-captioning

Updated Jun 28, 2024
Python

WhoIsJayD / Socio-Caption

Star

python flask natural-language-processing image-processing transformers torch captioning flask-python flask-pymongo flask-bcrypt deepai deepai-api b2sdk

Updated Jun 18, 2024
Python

Labbeti / aac-datasets

Star

Audio Captioning datasets for PyTorch.

audio deep-learning pytorch dataset caption datasets captioning audio-captioning

Updated Aug 2, 2024
Python

aimagelab / PMA-Net

Star

With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning. ICCV 2023

transformer image-captioning captioning-images captioning vision-and-language vision-language memory-augmented-neural-networks iccv2023

Updated Jun 7, 2024
Python

facebookresearch / mmf

Star

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

deep-learning dialog pytorch vqa pretrained-models captioning multimodal multi-tasking textvqa hateful-memes

Updated May 25, 2024
Python

42lux / CaptainCaption

Star

A gradio based image captioning tool that uses the GPT-4-Vision API to generate detailed descriptions of images.

tagging gradio captioning openai-api gpt-4-vision

Updated Apr 27, 2024
Python

DavidHuji / CapDec

Star

CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)

clip zero-shot-learning captioning multimodal-deep-learning gpt-2 clipcap

Updated Jan 28, 2024
Python

ZhaoPeiduo / BLIP2-Japanese

Star

Modifying LAVIS' BLIP2 Q-former with models pretrained on Japanese datasets.

japanese pytorch captioning multimodal-deep-learning blip2

Updated Jan 16, 2024
Python

Anshler / ICG_sd_extension

Star

Image caption extension for A1111 Webui 👁️📜🖋️

gradio captioning caption-generation gpt-2 stable-diffusion-web-ui

Updated Dec 30, 2023
Python

sourceduty / Video_Caption_Summary

Star

📺 Software concept for summarizing YouTube video captions.

youtube youtube-video captions idea caption concept summary captioning youtubers video-caption

Updated Oct 27, 2023

trucaption / trucaption

Star

A real-time captioning system with support for large and small screen display.

accessibility speech captions speech-recognition speech-to-text cart transcription captioning caption-generation caption-generator

Updated Oct 22, 2023
JavaScript

DavidMChan / caption-by-committee

Star

Using LLMs and pre-trained caption models for super-human performance on image captioning.

python machine-learning image ai deep-learning captioning chatgpt

Updated Oct 13, 2023
Python

ebu / ebu-tt-live-toolkit

Star

Toolkit for supporting the EBU-TT Live specification

python video live captions subtitles broadcast ebu-tt subtitling captioning

Updated Oct 11, 2023
Python

ImKeTT / ZeroGen

Star

[NLPCC'23] ZeroGen: Zero-shot Multimodal Controllable Text Generation with Multiple Oracles PyTorch Implementation

decoding zero-shot captioning multimodal gpt2 vision-language nlpcc controllable-text-generation

Updated Oct 7, 2023
Python

nikhilkumarsingh / MemeGenerator

Star

Python program to generate memes.

python generator memes pillow captioning

Updated Oct 3, 2023
Jupyter Notebook

mitvis / vistext

Star

VisText is a benchmark dataset for semantically rich chart captioning.

charts dataset captioning-images captioning t5

Updated Oct 3, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the captioning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the captioning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

captioning

Here are 66 public repositories matching this topic...

mrazhou / SEN

nssharmaofficial / reddit-hole

ArchAngelAries / TagScribeR

AMfeta99 / NLP_LLM

Labbeti / aac-metrics

WhoIsJayD / Socio-Caption

Labbeti / aac-datasets

aimagelab / PMA-Net

facebookresearch / mmf

42lux / CaptainCaption

DavidHuji / CapDec

ZhaoPeiduo / BLIP2-Japanese

Anshler / ICG_sd_extension

sourceduty / Video_Caption_Summary

trucaption / trucaption

DavidMChan / caption-by-committee

ebu / ebu-tt-live-toolkit

ImKeTT / ZeroGen

nikhilkumarsingh / MemeGenerator

mitvis / vistext

Improve this page

Add this topic to your repo