918博天堂(中国)

918博天堂(中国)BIGAI

科研成果

Efficient Temporal Extrapolation of Multimodal Large Language Models with Temporal Grounding Bridge

Varying Sentence Representations via Condition-Specified Routers

Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL

下载

SlotLifter: Slot-guided Feature Lifting for Learning Object-centric Radiance Fields

F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions

VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding