The Eval Index / Benchmarks / #163
mbzuai-oryx/Video-ChatGPT
by mbzuai-oryx ยท Benchmarks ยท updated 10mo ago
[ACL 2024 ๐ฅ] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
39
momentum
1,504
stars
129
forks
#163
rank
chatbotclipgpt-4llamallavamulit-modalvicunavideo-chatboatvideo-conversationvision-languagevision-language-pretraining
View on GitHub โ