AI & ML interests

Multimodal Large Language Models, Unified SVG Tasks

Recent Activity

We are the InternSVG team from the Shanghai AI Laboratory, dedicated to empowering the InternVL series models with unified capabilities for SVG vector graphic understanding, editing, and generation.

Current Work:

InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models

The InternSVG Family — a comprehensive suite that unifies data, benchmarks, and models for SVG understanding, editing, and generation. It consists of:

🧩 SAgoge — the largest and most diverse multimodal SVG dataset, covering icons, illustrations, chemistry diagrams, and dynamic animations;

šŸ† SArena — a companion benchmark offering unified task definitions and standardized evaluation protocols across SVG domains;

šŸ¤– InternSVG Models — multimodal large language models trained for SVG understanding, editing, and generation.

Project Links

🌐 Project Page: https://hmwang2002.github.io/release/internsvg/

šŸ“„ ArXiv Paper: https://arxiv.org/abs/2510.11341

šŸ’» GitHub Repository: https://github.com/hmwang2002/InternSVG

šŸ“Š SArena Benchmark: https://huggingface.co/datasets/InternSVG/SArena

šŸ“¦ SAgoge Dataset and InternSVG Model Weights — coming soon

models 0

None public yet