MM- Web Agent: 用于网页生成的分级多式网络代理
MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation
作者
Authors
Yan Li | Zezi Zeng | Yifan Yang | Yuqing Yang | Ning Liao | Weiwei Guo | Lili Qiu | Mingxi Cheng | Qi Dai | Zhendong Wang | Zhengyuan Yang | Xue Yang | Ji Li | Lijuan Wang | Chong Luo
期刊
Journal
暂无期刊信息
年份
Year
2026
分类
Category
国家
Country
-
📝 摘要
Abstract
The rapid progress of Artificial Intelligence Generated Content (AIGC) tools enables images, videos, and visualizations to be created on demand for webpage design, offering a flexible and increasingly adopted paradigm for modern UI/UX. However, directly integrating such tools into automated webpage generation often leads to style inconsistency and poor global coherence, as elements are generated in isolation. We propose MM-WebAgent, a hierarchical agentic framework for multimodal webpage generation that coordinates AIGC-based element generation through hierarchical planning and iterative self-reflection. MM-WebAgent jointly optimizes global layout, local multimodal content, and their integration, producing coherent and visually consistent webpages. We further introduce a benchmark for multimodal webpage generation and a multi-level evaluation protocol for systematic assessment. Experiments demonstrate that MM-WebAgent outperforms code-generation and agent-based baselines, especially on multimodal element generation and integration. Code & Data: https://aka.ms/mm-webagent.
📊 文章统计
Article Statistics
基础数据
Basic Stats
39
浏览
Views
0
下载
Downloads
24
引用
Citations
引用趋势
Citation Trend
阅读国家分布
Country Distribution
阅读机构分布
Institution Distribution
月度浏览趋势
Monthly Views