918博天堂(中国)

918博天堂(中国)BIGAI

论文索引

F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions

VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding

SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding

Unifying 3D Vision-Language Understanding via Promptable Queries

LangSuit·E: Planning, Controlling and Interacting with Large Language Models in Embodied Text Environments

下载

Combining Supervised Learning and Reinforcement Learning for Multi-Label Classification Tasks with Partial Labels

下载