登录 注册
每个高度选择框架一个托肯:实现对长视频理解的极端压缩
One Token per Highly Selective Frame: Towards Extreme Compression for Long Video Understanding
👁 56 📚 2
Lyra 2. 0: 可探索的基因3D世界
Lyra 2.0: Explorable Generative 3D Worlds
👁 204 📚 2
谁处理方向? 特征匹配中的差异调查
Who Handles Orientation? Investigating Invariance in Feature Matching
👁 199 📚 29
探戈:用于高效视频大语言模型的调制视觉信号
Tango: Taming Visual Signals for Efficient Video Large Language Models
👁 57 📚 13
当数字说话时:在文本到视频传播模型中调整文本数字和视觉实例
When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models
👁 90 📚 8
ETCH-X:用可编译的数据集来强健地给克洛斯德人配音
ETCH-X: Robustify Expressive Body Fitting to Clothed Humans with Composable Datasets
👁 188 📚 24
GaussiAnimate:具有动态水平的重构和硬动能分类
GaussiAnimate: Reconstruct and Rig Animatable Categories with Level of Dynamics
👁 50 📚 9
右侧: 运动控制完成
MoRight: Motion Control Done Right
👁 26 📚 17
行动图像:通过多视图视频生成进行端到端政策学习
Action Images: End-to-End Policy Learning via Multiview Video Generation
👁 49 📚 15
瓦纳斯特:通过合成 Triplet 监制与人类图像动画的虚拟尝试
Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision
👁 157 📚 21
CoME-VL: 放大多编码器视野-语言学习
CoME-VL: Scaling Complementary Multi-Encoder Vision-Language Learning
👁 187 📚 14
遗传性世界渲染器
Generative World Renderer
👁 117 📚 16
EventHub:无活性传感器的基于事件的立体声网络数据厂
EventHub: Data Factory for Generalizable Event-Based Stereo Networks without Active Sensors
👁 93 📚 5
HippoCamp:个人计算机上的背景代理基准
HippoCamp: Benchmarking Contextual Agents on Personal Computers
👁 56 📚 19
SurgNavAR: 通过头部上载显示的光学视觉外科导航框架
SurgNavAR: An Augmented Reality Surgical Navigation Framework for Optical See-Through Head Mounted D...
👁 15 📚 0
悬浮物体探测有条件极化指导
Conditional Polarization Guidance for Camouflaged Object Detection
👁 12 📚 0
三维几何计算机远景的博士级编码基准
Benchmarking PhD-Level Coding in 3D Geometric Computer Vision
👁 14 📚 0
视频模型 原因早期:探索计划承诺 Maze Solving
Video Models Reason Early: Exploiting Plan Commitment for Maze Solving
👁 10 📚 0
OmniRoam:通过长视全景视频生成世界漫游
OmniRoam: World Wandering via Long-Horizon Panoramic Video Generation
👁 12 📚 0
以模型为基础的加固学习
Focal plane wavefront control with model-based reinforcement learning
👁 15 📚 0
海洋智能体 🌊
海洋智能体
AI科研助手 · 2270篇文献
你好!你正在浏览文献列表,我可以帮你筛选方向、推荐高引论文或解读某个研究领域。