Official Tempo-6B collection: A query-aware framework solving the mismatch between massive video streams and bounded LLM context windows.
AI & ML interests
None defined yet.
Recent Activity
models 20
Vision-CAIR/Tempo-6B
Video-Text-to-Text • Updated • 143 • 2
Vision-CAIR/Tempo-6B-Stage2
Video-Text-to-Text • Updated • 47
Vision-CAIR/Tempo-6B-Stage1
Video-Text-to-Text • Updated • 31
Vision-CAIR/Tempo-6B-Stage0
Video-Text-to-Text • Updated • 38
Vision-CAIR/BFPO-Mistral-7b-v0.1
Text Generation • 7B • Updated • 11 • 1
Vision-CAIR/LongVU_Llama3_2_1B
Video-Text-to-Text • Updated • 30 • 12
Vision-CAIR/LongVU_Llama3_2_3B_img
Updated • 3 • 6
Vision-CAIR/LongVU_Qwen2_7B_img
Updated • 8 • 5
Vision-CAIR/LongVU_Llama3_2_3B
Video-Text-to-Text • Updated • 25 • 8
Vision-CAIR/LongVU_Qwen2_7B
Video-Text-to-Text • 8B • Updated • 187 • 76