A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
-
Updated
Jul 12, 2024 - Python
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
This is an open collection of state-of-the-art (SOTA), novel Text to X (X can be everything) methods (papers, codes and datasets).
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
Generate a video script, voice and a talking face completely with AI
Generate video from text using AI
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
Faceless Video Engine
Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text".
Telegram Bot For txt to Video @AshutoshGoswami24
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Stable Diffusion, SDXL, LoRA Training, DreamBooth Training, Automatic1111 Web UI, DeepFake, Deep Fakes, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI, Google Colab, RunPod, NoteBooks, ControlNet, TTS, Voice Cloning, AI, AI News, ML, ML News, News, Tech, Tech News, Kohya LoRA, Kandinsky 2, DeepFloyd IF, Midjourney
Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation
An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
Diffusion model papers, survey, and taxonomy
[Arxiv] A Survey on Video Diffusion Models
Official implementation of "AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising"
Cassette is designed to create 30-second explanatory videos suitable for Instagram Reels or YouTube Shorts.
Add a description, image, and links to the text-to-video topic page so that developers can more easily learn about it.
To associate your repository with the text-to-video topic, visit your repo's landing page and select "manage topics."