Goto

Collaborating Authors

 vector graphic


cdf1035c34ec380218a8cc9a43d438f9-Paper.pdf

Neural Information Processing Systems

In contrast to raster graphics which are widely used in image recognition, vector graphics can be scaled up or down into any resolution without aliasing or information loss, due totheanalytic representation oftheprimitivesinthe document.



MatthewFisher

Neural Information Processing Systems

In the first case, the non-standard representation prevents benefiting from latest network architectures for neural representations; while, in the latter case, therasterized representation, when encoded vianetworks, results inlossof data fidelity, as font-specific discontinuities like edges and corners are difficult torepresent using neural networks.


Recognizing Vector Graphics without Rasterization

Neural Information Processing Systems

In this paper, we consider a different data format for images: vector graphics. In contrast to raster graphics which are widely used in image recognition, vector graphics can be scaled up or down into any resolution without aliasing or information loss, due to the analytic representation of the primitives in the document. Furthermore, vector graphics are able to give extra structural information on how low-level elements group together to form high level shapes or structures. These merits of graphic vectors have not been fully leveraged in existing methods. To explore this data format, we target on the fundamental recognition tasks: object localization and classification. We propose an efficient CNN-free pipeline that does not render the graphic into pixels (i.e.





DeepSVG: A Hierarchical Generative Network for Vector Graphics Animation

Neural Information Processing Systems

Despite recent success of rasterized image generation and content creation, little effort has been directed towards generation of vector graphics. Y et, vector images, often in the form of Scalable V ector Graphics [20] (SVG), have become a standard in digital graphics, publication-ready image assets, and web-animations. The main advantage over their rasterized counterpart is their scaling ability, making the same image file suitable for both tiny web-icons or billboard-scale graphics.


VectorEdits: A Dataset and Benchmark for Instruction-Based Editing of Vector Graphics

Kuchař, Josef, Kadlčík, Marek, Spiegel, Michal, Štefánik, Michal

arXiv.org Artificial Intelligence

We introduce a large-scale dataset for instruction-guided vector image editing, consisting of over 270,000 pairs of SVG images paired with natural language edit instructions. Our dataset enables training and evaluation of models that modify vector graphics based on textual commands. We describe the data collection process, including image pairing via CLIP similarity and instruction generation with vision-language models. Initial experiments with state-of-the-art large language models reveal that current methods struggle to produce accurate and valid edits, underscoring the challenge of this task. To foster research in natural language-driven vector graphic generation and editing, we make our resources created within this work publicly available.


From Fragment to One Piece: A Survey on AI-Driven Graphic Design

Zou, Xingxing, Zhang, Wen, Zhao, Nanxuan

arXiv.org Artificial Intelligence

This survey provides a comprehensive overview of the advancements in Artificial Intelligence in Graphic Design (AIGD), focusing on integrating AI techniques to support design interpretation and enhance the creative process. We categorize the field into two primary directions: perception tasks, which involve understanding and analyzing design elements, and generation tasks, which focus on creating new design elements and layouts. The survey covers various subtasks, including visual element perception and generation, aesthetic and semantic understanding, layout analysis, and generation. We highlight the role of large language models and multimodal approaches in bridging the gap between localized visual features and global design intent. Despite significant progress, challenges remain to understanding human intent, ensuring interpretability, and maintaining control over multilayered compositions. This survey serves as a guide for researchers, providing information on the current state of AIGD and potential future directions\footnote{https://github.com/zhangtianer521/excellent\_Intelligent\_graphic\_design}.