• 918博天堂(中国)

    918博天堂(中国)BIGAI

    论文索引

    Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL

    下载

    SlotLifter: Slot-guided Feature Lifting for Learning Object-centric Radiance Fields

    F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions

    VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding

    SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding

    Unifying 3D Vision-Language Understanding via Promptable Queries