海洋科研 - 国际顶刊文献统计平台

每个高度选择框架一个托肯:实现对长视频理解的极端压缩

One Token per Highly Selective Frame: Towards Extreme Compression for Long Video Understanding

作者
Authors: Zheyu Zhang | Ziqi Pang | Shixing Chen | Xiang Hao... 期刊
Journal: arXiv 年份
Year: 2026 分类
Category: 计算机视觉
Computer Vision

👁 56 📚 2

Lyra 2. 0: 可探索的基因3D世界

Lyra 2.0: Explorable Generative 3D Worlds

作者
Authors: Tianchang Shen | Sherwin Bahmani | Kai He | Sangee... 期刊
Journal: arXiv 年份
Year: 2026 分类
Category: 计算机视觉
Computer Vision

👁 204 📚 2

谁处理方向? 特征匹配中的差异调查

Who Handles Orientation? Investigating Invariance in Feature Matching

作者
Authors: David Nordström | Johan Edstedt | Fredrik Kahl | G... 期刊
Journal: arXiv 年份
Year: 2026 分类
Category: 计算机视觉
Computer Vision

👁 199 📚 29

探戈:用于高效视频大语言模型的调制视觉信号

Tango: Taming Visual Signals for Efficient Video Large Language Models

作者
Authors: Shukang Yin | Sirui Zhao | Hanchao Wang | Baozhi J... 期刊
Journal: arXiv 年份
Year: 2026 分类
Category: 计算机视觉
Computer Vision

👁 57 📚 13

当数字说话时:在文本到视频传播模型中调整文本数字和视觉实例

When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models

作者
Authors: Zhengyang Sun | Yu Chen | Xin Zhou | Xiaofan Li | ... 期刊
Journal: arXiv 年份
Year: 2026 分类
Category: 计算机视觉
Computer Vision

👁 90 📚 8

ETCH-X:用可编译的数据集来强健地给克洛斯德人配音

ETCH-X: Robustify Expressive Body Fitting to Clothed Humans with Composable Datasets

作者
Authors: Xiaoben Li | Jingyi Wu | Zeyu Cai | Yu Siyuan | Bo... 期刊
Journal: arXiv 年份
Year: 2026 分类
Category: 计算机视觉
Computer Vision

👁 188 📚 24

GaussiAnimate:具有动态水平的重构和硬动能分类

GaussiAnimate: Reconstruct and Rig Animatable Categories with Level of Dynamics

作者
Authors: Jiaxin Wang | Dongxin Lyu | Zeyu Cai | Zhiyang Dou... 期刊
Journal: arXiv 年份
Year: 2026 分类
Category: 计算机视觉
Computer Vision

👁 50 📚 9

右侧: 运动控制完成

MoRight: Motion Control Done Right

作者
Authors: Shaowei Liu | Xuanchi Ren | Tianchang Shen | Huan ... 期刊
Journal: arXiv 年份
Year: 2026 分类
Category: 计算机视觉
Computer Vision

👁 26 📚 17

行动图像:通过多视图视频生成进行端到端政策学习

Action Images: End-to-End Policy Learning via Multiview Video Generation

作者
Authors: Haoyu Zhen | Zixian Gao | Qiao Sun | Yilin Zhao | ... 期刊
Journal: arXiv 年份
Year: 2026 分类
Category: 计算机视觉
Computer Vision

👁 49 📚 15

瓦纳斯特:通过合成 Triplet 监制与人类图像动画的虚拟尝试

Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision

作者
Authors: Hyunsoo Cha | Wonjung Woo | Byungjun Kim | Hanbyul... 期刊
Journal: arXiv 年份
Year: 2026 分类
Category: 计算机视觉
Computer Vision

👁 157 📚 21

CoME-VL: 放大多编码器视野-语言学习

CoME-VL: Scaling Complementary Multi-Encoder Vision-Language Learning

作者
Authors: Ankan Deria | Komal Kumar | Xilin He | Imran Razza... 期刊
Journal: arXiv 年份
Year: 2026 分类
Category: 计算机视觉
Computer Vision

👁 187 📚 14

遗传性世界渲染器

Generative World Renderer

作者
Authors: Zheng-Hui Huang | Zhixiang Wang | Jiaming Tan | Ru... 期刊
Journal: arXiv 年份
Year: 2026 分类
Category: 计算机视觉
Computer Vision

👁 117 📚 16

EventHub:无活性传感器的基于事件的立体声网络数据厂

EventHub: Data Factory for Generalizable Event-Based Stereo Networks without Active Sensors

作者
Authors: Luca Bartolomei | Fabio Tosi | Matteo Poggi | Stef... 期刊
Journal: arXiv 年份
Year: 2026 分类
Category: 计算机视觉
Computer Vision

👁 93 📚 5

HippoCamp:个人计算机上的背景代理基准

HippoCamp: Benchmarking Contextual Agents on Personal Computers

作者
Authors: Zhe Yang | Shulin Tian | Kairui Hu | Shuai Liu | H... 期刊
Journal: arXiv 年份
Year: 2026 分类
Category: 计算机视觉
Computer Vision

👁 56 📚 19

SurgNavAR: 通过头部上载显示的光学视觉外科导航框架

SurgNavAR: An Augmented Reality Surgical Navigation Framework for Optical See-Through Head Mounted D...