DV-World:真实世界情景中数据可视化代理的基准化
DV-World: Benchmarking Data Visualization Agents in Real-World Scenarios
作者
Authors
Jinxiang Meng | Shaoping Huang | Fangyu Lei | Jingyu Guo | Haoxiang Liu | Jiahao Su | Sihan Wang | Yao Wang | Enrui Wang | Ye Yang | Hongze Chai | Jinming Lv | Anbang Yu | Huangjing Zhang | Yitong Zhang | Yiming Huang | Zeyao Ma | Shizhu He | Jun Zhao | Kang Liu
期刊
Journal
暂无期刊信息
年份
Year
2026
分类
Category
国家
Country
-
📝 摘要
Abstract
Real-world data visualization (DV) requires native environmental grounding, cross-platform evolution, and proactive intent alignment. Yet, existing benchmarks often suffer from code-sandbox confinement, single-language creation-only tasks, and assumption of perfect intent. To bridge these gaps, we introduce DV-World, a benchmark of 260 tasks designed to evaluate DV agents across real-world professional lifecycles. DV-World spans three domains: DV-Sheet for native spreadsheet manipulation including chart and dashboard creation as well as diagnostic repair; DV-Evolution for adapting and restructuring reference visual artifacts to fit new data across diverse programming paradigms and DV-Interact for proactive intent alignment with a user simulator that mimics real-world ambiguous requirements. Our hybrid evaluation framework integrates Table-value Alignment for numerical precision and MLLM-as-a-Judge with rubrics for semantic-visual assessment. Experiments reveal that state-of-the-art models achieve less than 50% overall performance, exposing critical deficits in handling the complex challenges of real-world data visualization. DV-World provides a realistic testbed to steer development toward the versatile expertise required in enterprise workflows. Our data and code are available at \href{https://github.com/DA-Open/DV-World}{this project page}.
📊 文章统计
Article Statistics
基础数据
Basic Stats
172
浏览
Views
0
下载
Downloads
23
引用
Citations
引用趋势
Citation Trend
阅读国家分布
Country Distribution
阅读机构分布
Institution Distribution
月度浏览趋势
Monthly Views