Code and data for the benchmark "Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models"
-
Updated
Jun 28, 2024 - Python
Code and data for the benchmark "Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models"
Pressure Testing Large Video-Language Models (LVLM): Doing multimodal retrieval from LVLM at various video lengths to measure accuracy
Add a description, image, and links to the needle-in-a-haystack topic page so that developers can more easily learn about it.
To associate your repository with the needle-in-a-haystack topic, visit your repo's landing page and select "manage topics."