CartoAgent: a multimodal large language model-powered multi-agent cartographic framework for map style transfer and evaluation

May 15, 2025 · Declared Dead · 🏛 International Journal of Geographical Information Science

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Chenglong Wang, Yuhao Kang, Zhaoya Gong, Pengjun Zhao, Yu Feng, Wenjia Zhang, Ge Li arXiv ID 2505.09936 Category cs.HC: Human-Computer Interaction Cross-listed cs.GR, cs.MA, cs.MM Citations 14 Venue International Journal of Geographical Information Science Last Checked 4 months ago

Abstract

The rapid development of generative artificial intelligence (GenAI) presents new opportunities to advance the cartographic process. Previous studies have either overlooked the artistic aspects of maps or faced challenges in creating both accurate and informative maps. In this study, we propose CartoAgent, a novel multi-agent cartographic framework powered by multimodal large language models (MLLMs). This framework simulates three key stages in cartographic practice: preparation, map design, and evaluation. At each stage, different MLLMs act as agents with distinct roles to collaborate, discuss, and utilize tools for specific purposes. In particular, CartoAgent leverages MLLMs' visual aesthetic capability and world knowledge to generate maps that are both visually appealing and informative. By separating style from geographic data, it can focus on designing stylesheets without modifying the vector-based data, thereby ensuring geographic accuracy. We applied CartoAgent to a specific task centered on map restyling-namely, map style transfer and evaluation. The effectiveness of this framework was validated through extensive experiments and a human evaluation study. CartoAgent can be extended to support a variety of cartographic design decisions and inform future integrations of GenAI in cartography.