Reroute, Don't Remove: Recoverable Visual Token Routing for Vision-Language 模型 (Model)s
Reroute, Don't Remove: Recoverable Visual Token Routing for Vision-Language Models

作者
Authors Cheng-Yu Yang | Shao-Yuan Lo | Yu-Lun Liu

期刊
Journal arXiv

年份
Year 2026

分类
Category 计算机视觉
Computer Vision

国家
Country -

🔗 访问原文
🔗 Access Paper

📝 摘要
Abstract

Vision-language models (VLMs) project images into hundreds to thousands of visual tokens, making decoder inference expensive in both attention computation and KV-cache memory. Existing visual-token reduction methods largely follow a rank-and-remove paradigm: they score visual tokens, keep a compact subset, and permanently discard the rest. We show that this irreversible action is fragile because visual-token importance changes across decoder depth; tokens ranked low at one stage may become relevant in later layers, especially for grounding-sensitive queries. We propose Reroute, a training-free plug-in that replaces removal with recoverable routing. At each routing stage, selected vision tokens pass through decoder blocks, while deferred tokens bypass the stage and re-enter the candidate pool at the next routing decision. Reroute reuses existing attention-score ranking rules and stage-wise schedules, preserving the theoretical TFLOPs and KV-cache budget class of the pruning method it augments. Across FastV, PDrop, and Nüwa variants on LLaVA-1.5 and Qwen backbones, reroute improves grounding under aggressive token reduction while maintaining general VQA performance. These results suggest that VLM token reduction should not be viewed only as irreversible pruning, but also as recoverable routing. The code can be found here: https://github.com/elmma/mllm-reroute/

📊 文章统计
Article Statistics

基础数据
Basic Stats

37 浏览
Views

0 下载
Downloads

29 引用
Citations

引用趋势
Citation Trend

阅读国家分布
Country Distribution

阅读机构分布
Institution Distribution

月度浏览趋势
Monthly Views

影响因子分析
Impact Analysis

8.20 综合评分
Overall Score

引用影响力
Citation Impact

浏览热度
View Popularity

下载频次
Download Frequency

Reroute, Don't Remove: Recoverable Visual Token Routing for Vision-Language 模型 (Model)s
Reroute, Don't Remove: Recoverable Visual Token Routing for Vision-Language Models

📝 摘要
Abstract

📊 文章统计
Article Statistics

基础数据
Basic Stats

引用趋势
Citation Trend

阅读国家分布
Country Distribution

阅读机构分布
Institution Distribution

月度浏览趋势
Monthly Views

相关关键词
Related Keywords

影响因子分析
Impact Analysis

📄 相关文章
Related Articles

Reroute, Don't Remove: Recoverable Visual Token Routing for Vision-Language 模型 (Model)sReroute, Don't Remove: Recoverable Visual Token Routing for Vision-Language Models

📝 摘要Abstract

📊 文章统计Article Statistics

基础数据Basic Stats

引用趋势Citation Trend

阅读国家分布Country Distribution

阅读机构分布Institution Distribution

月度浏览趋势Monthly Views

相关关键词Related Keywords

影响因子分析Impact Analysis

📄 相关文章Related Articles

海洋智能分析Ocean AI Analysis

Reroute, Don't Remove: Recoverable Visual Token Routing for Vision-Language 模型 (Model)s
Reroute, Don't Remove: Recoverable Visual Token Routing for Vision-Language Models

📝 摘要
Abstract

📊 文章统计
Article Statistics

基础数据
Basic Stats

引用趋势
Citation Trend

阅读国家分布
Country Distribution

阅读机构分布
Institution Distribution

月度浏览趋势
Monthly Views

相关关键词
Related Keywords

影响因子分析
Impact Analysis

📄 相关文章
Related Articles