登录 注册
GMOS: Grounding Moving Object Segmentation in 3D Space and Time
👁 165 📚 16
From Pixels to Words -- Towards Native One-Vision 模型 (Model)s at Scale
From Pixels to Words -- Towards Native One-Vision Models at Scale
👁 172 📚 4
G3T Up! Gravity Aligned Coordinate Frames Simplify Pointmap Processing
👁 104 📚 11
TriSplat: Simulation-Ready Feed-Forward 3D Scene Reconstruction
👁 153 📚 17
Geo-Align: Video Generation Alignment via Metric Geometry Reward
👁 44 📚 10
MotiMotion: Motion-Controlled Video Generation with Visual Reasoning
👁 142 📚 17
Cambrian-P: Pose-Grounded Video Understanding
👁 143 📚 16
Which Way Did It Move? Diagnosing and Overcoming Directional Motion Blindness in Video-LLMs
👁 49 📚 3
Uni-Edit: Intelligent Editing Is A General Task For Unified 模型 (Model) Tuning
Uni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning
👁 53 📚 2
PiG-Avatar: Hierarchical Neural-Field-Guided Gaussian Avatars
👁 171 📚 17
Can These Views Be One Scene? Evaluating Multiview 3D Consistency when 3D Foundation 模型 (Model)s Hal...
Can These Views Be One Scene? Evaluating Multiview 3D Consistency when 3D Foundation Models Hallucin...
👁 187 📚 12
IVGT: Implicit Visual Geometry Transformer for Neural Scene Representation
👁 179 📚 3
EntityBench: Towards Entity-Consistent Long-Range Multi-Shot Video Generation
👁 63 📚 12
R-DMesh: Video-Guided 3D Animation via Rectified Dynamic Mesh Flow
👁 170 📚 27
Covering Human Action Space for Computer Use: 数据 (Data) Synthesis and Benchmark
Covering Human Action Space for Computer Use: Data Synthesis and Benchmark
👁 45 📚 6
Power Reinforcement Post-Training of Text-to-Image 模型 (Model)s with Super-Linear Advantage Shaping
Power Reinforcement Post-Training of Text-to-Image Models with Super-Linear Advantage Shaping
👁 179 📚 0
123D: Unifying Multi-Modal Autonomous Driving 数据 (Data) at Scale
123D: Unifying Multi-Modal Autonomous Driving Data at Scale
👁 201 📚 10
Relit-LiVE: Relight Video by Jointly 学习 (Learning) Environment Video
Relit-LiVE: Relight Video by Jointly Learning Environment Video
👁 150 📚 18
BAMI: Training-Free Bias Mitigation in GUI Grounding
👁 61 📚 0
Syn4D: A Multiview Synthetic 4D 数据 (Data)set
Syn4D: A Multiview Synthetic 4D Dataset
👁 79 📚 8
海洋智能体 🌊
海洋智能体
AI科研助手 · 2270篇文献
你好!你正在浏览文献列表,我可以帮你筛选方向、推荐高引论文或解读某个研究领域。