登录 注册
Audio-Visual Intelligence in Large Foundation 模型 (Model)s
Audio-Visual Intelligence in Large Foundation Models
👁 184 📚 5
AlbumFill: Album-Guided Reasoning and Retrieval for Personalized Image Completion
👁 109 📚 14
Posterior Augmented Flow Matching
👁 157 📚 29
Generalizable Sparse-View 3D Reconstruction from Unconstrained Images
👁 175 📚 29
OmniRobotHome: A Multi-Camera Platform for Real-Time Multiadic Human-Robot Interaction
👁 87 📚 23
HERMES++: Toward a Unified Driving World 模型 (Model) for 3D Scene Understanding and Generation
HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation
👁 164 📚 13
Three-Step Nav: A Hierarchical Global-Local Planner for Zero-Shot Vision-and-Language Navigation
👁 152 📚 15
强力深假探测:通过校准的补充组合减缓空间注意力漂移
Robust Deepfake Detection: Mitigating Spatial Attention Drift via Calibrated Complementary Ensembles
👁 173 📚 28
世界R1:加强文本到视频生成的三维限制
World-R1: Reinforcing 3D Constraints for Text-to-Video Generation
👁 81 📚 29
互通:对话互通的多式公司
Inter-Stance: A Dyadic Multimodal Corpus for Conversational Stance Analysis
👁 77 📚 29
在 Omni 模型中打开上下文
Context Unrolling in Omni Models
👁 164 📚 22
无眼之所见:4D 从可穿戴的IMU中了解人与场景
Seeing Without Eyes: 4D Human-Scene Understanding from Wearable IMUs
👁 120 📚 27
见快见慢:学习视频中的时间流
Seeing Fast and Slow: Learning the Flow of Time in Videos
👁 167 📚 19
DeVI:基于物理的 Dexterous 人与对象通过合成视频仿真相互作用
DeVI: Physics-based Dexterous Human-Object Interaction via Synthetic Video Imitation
👁 169 📚 15
Tstars-Tryon 1.0: 不同时尚项目的强健与现实虚拟尝试
Tstars-Tryon 1.0: Robust and Realistic Virtual Try-On for Diverse Fashion Items
👁 187 📚 24
MUA:移动超详细Animatable Avatars
MUA: Mobile Ultra-detailed Animatable Avatars
👁 71 📚 16
重用自递式布局生成的 三维基因模型
Repurposing 3D Generative Model for Autoregressive Layout Generation
👁 61 📚 13
TokenLight: 使用属性托肯进行图像中的精密照明控制
TokenLight: Precise Lighting Control in Images using Attribute Tokens
👁 154 📚 20
LeapAlign:通过构建双相轨迹,在任何一代人的步骤下进行后培训的流相匹配模式
LeapAlign: Post-Training Flow Matching Models at Any Generation Step by Building Two-Step Trajectori...
👁 175 📚 14
事件的双向跨模式提示-对称性立体
Bidirectional Cross-Modal Prompting for Event-Frame Asymmetric Stereo
👁 99 📚 27
海洋智能体 🌊
海洋智能体
AI科研助手 · 2270篇文献
你好!你正在浏览文献列表,我可以帮你筛选方向、推荐高引论文或解读某个研究领域。