🥬
lose weight
PhD student, ShowLab @ NUS.
Video Multimodal.
-
National University of Singapore
- Singapore
- qinghonglin.github.io
- @KevinQHLin
Pinned Loading
-
showlab/Awesome-GUI-Agent
showlab/Awesome-GUI-Agent Public💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
-
showlab/EgoVLP
showlab/EgoVLP Public[NeurIPS2022] Egocentric Video-Language Pretraining
-
showlab/UniVTG
showlab/UniVTG Public[ICCV2023] UniVTG: Towards Unified Video-Language Temporal Grounding
-
facebookresearch/EgoVLPv2
facebookresearch/EgoVLPv2 PublicCode release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]
-
showlab/VLog
showlab/VLog PublicTransform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain.
-
showlab/videogui
showlab/videogui Publicofficial repo of "VideoGUI: A Benchmark for GUI Automation from Instructional Videos"
JavaScript 19
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.