AITopics

Country:

North America > United States > Massachusetts (0.28)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.15)

Industry: Information Technology (0.69)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.90)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.67)

Neural Information Processing SystemsFeb-9-2026, 21:46:31 GMT

My document

diffsketcher, sketch, text prompt, (16 more...)

Country: Asia > Middle East > Republic of Türkiye > Batman Province > Batman (0.04)

Genre: Research Report > New Finding (0.70)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Neural Information Processing SystemsFeb-7-2026, 21:24:04 GMT

21f76686538a5f06dc431efea5f475f5-Paper-Conference.pdf

arxiv preprint arxiv, clipdraw, synthesis, (14 more...)

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.15)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > Canada > Quebec > Montreal (0.04)

Industry: Information Technology (0.69)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.70)

Neural Information Processing SystemsDec-23-2025, 21:51:36 GMT

CLIPDraw: Exploring Text-to-Drawing Synthesis through Language-Image Encoders

clipdraw, exploring text-to-drawing synthesis, name change, (3 more...)

Technology: Information Technology > Artificial Intelligence (0.63)

Neural Information Processing SystemsOct-8-2025, 10:12:52 GMT

My document

diffsketcher, sketch, text prompt, (16 more...)

Country:

North America > United States > New York (0.04)
Asia > Middle East > Republic of Türkiye > Batman Province > Batman (0.04)

Genre: Research Report > New Finding (0.70)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.96)

Neural Information Processing SystemsOct-10-2024, 07:35:59 GMT

CLIPDraw: Exploring Text-to-Drawing Synthesis through Language-Image Encoders

clipdraw, exploring text-to-drawing synthesis, language-image encoder

Technology: Information Technology > Artificial Intelligence (0.77)

arXiv.org Artificial IntelligenceOct-1-2024

Khattat: Enhancing Readability and Concept Representation of Semantic Typography

Hussein, Ahmed, Elsetohy, Alaa, Hadhoud, Sama, Bakr, Tameem, Rohaim, Yasser, AlKhamissi, Badr

Designing expressive typography that visually conveys a word's meaning while maintaining readability is a complex task, known as semantic typography. It involves selecting an idea, choosing an appropriate font, and balancing creativity with legibility. We introduce an end-toend system that automates this process. First, a Large Language Model (LLM) generates imagery ideas for the word, useful for abstract concepts like "freedom." Then, the FontCLIP pre-trained model automatically selects a suitable font based on its semantic understanding of font attributes. The system identifies optimal regions of the word for morphing and iteratively transforms them using a pre-trained diffusion model. A key feature is our OCR-based loss function, which enhances readability and enables simultaneous stylization of multiple characters. We compare our method with other baselines, demonstrating great readability enhancement and versatility across multiple languages and writing scripts.

diffusion model, large language model, machine learning, (20 more...)

2410.03748

Country:

North America > United States > New York > New York County > New York City (0.14)
Africa > Middle East > Egypt (0.05)
Europe > Switzerland > Vaud > Lausanne (0.04)
(4 more...)

Genre: Research Report (0.85)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Vinker, Yael, Pajouheshgar, Ehsan, Bo, Jessica Y., Bachmann, Roman Christian, Bermano, Amit Haim, Cohen-Or, Daniel, Zamir, Amir, Shamir, Ariel

CLIPasso: Semantically-Aware Object Sketching

arXiv.org Artificial IntelligenceFeb-11-2022

Abstraction is at the heart of sketching due to the simple and minimal nature of line drawings. Abstraction entails identifying the essential visual properties of an object or scene, which requires semantic understanding and prior knowledge of high-level concepts. Abstract depictions are therefore challenging for artists, and even more so for machines. We present an object sketching method that can achieve different levels of abstraction, guided by geometric and semantic simplifications. While sketch generation methods often rely on explicit sketch datasets for training, we utilize the remarkable ability of CLIP (Contrastive-Language-Image-Pretraining) to distill semantic concepts from sketches and images alike. We define a sketch as a set of B\'ezier curves and use a differentiable rasterizer to optimize the parameters of the curves directly with respect to a CLIP-based perceptual loss. The abstraction degree is controlled by varying the number of strokes. The generated sketches demonstrate multiple levels of abstraction while maintaining recognizability, underlying structure, and essential visual components of the subject drawn.

abstraction, input image, sketch, (17 more...)

2202.05822

Country:

Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Ali, Safinah, Parikh, Devi

Telling Creative Stories Using Generative Visual Aids

arXiv.org Artificial IntelligenceOct-27-2021

Can visual artworks created using generative visual algorithms inspire human creativity in storytelling? We asked writers to write creative stories from a starting prompt, and provided them with visuals created by generative AI models from the same prompt. Compared to a control group, writers who used the visuals as story writing aid wrote significantly more creative, original, complete and visualizable stories, and found the task more fun. Of the generative algorithms used (BigGAN, VQGAN, DALL-E, CLIPDraw), VQGAN was the most preferred. The control group that did not view the visuals did significantly better in integrating the starting prompts. Findings indicate that cross modality inputs by AI can benefit divergent aspects of creativity in human-AI co-creation, but hinders convergent thinking.

creative story, creativity, vqgan, (11 more...)

2110.1481

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > California > San Mateo County > Menlo Park (0.04)
Europe > France > Bourgogne-Franche-Comté > Doubs > Besançon (0.04)

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.42)
Health & Medicine > Therapeutic Area > Immunology (0.42)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.71)

Mihai, Daniela, Hare, Jonathon

Shared Visual Representations of Drawing for Communication: How do different biases affect human interpretability and intent?

arXiv.org Artificial IntelligenceOct-15-2021

We present an investigation into how representational losses can affect the drawings produced by artificial agents playing a communication game. Building upon recent advances, we show that a combination of powerful pretrained encoder networks, with appropriate inductive biases, can lead to agents that draw recognisable sketches, whilst still communicating well. Further, we start to develop an approach to help automatically analyse the semantic content being conveyed by a sketch and demonstrate that current approaches to inducing perceptual biases lead to a notion of objectness being a key feature despite the agent training being self-supervised.

agent, mihai & hare, sketch, (12 more...)

2110.08203

Country:

Europe > United Kingdom > England > Hampshire > Southampton (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)

Genre: Research Report (0.83)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.95)
(2 more...)