AITopics | clipart

Collaborating Authors

clipart

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

ReducingtheCovariateShiftbyMirrorSamplesin CrossDomainAlignment

Neural Information Processing SystemsFeb-8-2026, 15:15:52 GMT

The proposed method does not distort the internal structure of the underlyingdistribution.

artificial intelligence, machine learning, xtj, (18 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

SVGCraft: Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout

Banerjee, Ayan, Mathur, Nityanand, Lladós, Josep, Pal, Umapada, Dutta, Anjan

arXiv.org Artificial IntelligenceMar-30-2024

Generating VectorArt from text prompts is a challenging vision task, requiring diverse yet realistic depictions of the seen as well as unseen entities. However, existing research has been mostly limited to the generation of single objects, rather than comprehensive scenes comprising multiple elements. In response, this work introduces SVGCraft, a novel end-to-end framework for the creation of vector graphics depicting entire scenes from textual descriptions. Utilizing a pre-trained LLM for layout generation from text prompts, this framework introduces a technique for producing masked latents in specified bounding boxes for accurate object placement. It introduces a fusion mechanism for integrating attention maps and employs a diffusion U-Net for coherent composition, speeding up the drawing process. The resulting SVG is optimized using a pre-trained encoder and LPIPS loss with opacity modulation to maximize similarity. Additionally, this work explores the potential of primitive shapes in facilitating canvas completion in constrained environments. Through both qualitative and quantitative assessments, SVGCraft is demonstrated to surpass prior works in abstraction, recognizability, and detail, as evidenced by its performance metrics (CLIP-T: 0.4563, Cosine Similarity: 0.6342, Confusion: 0.66, Aesthetic: 6.7832). The code will be available at github.com/SVGCraft.

svgcraft, text prompt, top prediction, (15 more...)

arXiv.org Artificial Intelligence

2404.00412

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Asia > Middle East > Republic of Türkiye > Batman Province > Batman (0.04)
North America > United States > New York (0.04)
(4 more...)

Genre: Research Report > Promising Solution (0.46)

Industry: Transportation (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Add feedback

Asking the Right Question at the Right Time: Human and Model Uncertainty Guidance to Ask Clarification Questions

Testoni, Alberto, Fernández, Raquel

arXiv.org Artificial IntelligenceFeb-9-2024

Clarification questions are an essential dialogue tool to signal misunderstanding, ambiguities, and under-specification in language use. While humans are able to resolve uncertainty by asking questions since childhood, modern dialogue systems struggle to generate effective questions. To make progress in this direction, in this work we take a collaborative dialogue task as a testbed and study how model uncertainty relates to human uncertainty -- an as yet under-explored problem. We show that model uncertainty does not mirror human clarification-seeking behavior, which suggests that using human clarification questions as supervision for deciding when to ask may not be the most effective way to resolve model uncertainty. To address this issue, we propose an approach to generating clarification questions based on model uncertainty estimation, compare it to several alternatives, and show that it leads to significant improvements in terms of task success. Our findings highlight the importance of equipping dialogue systems with the ability to assess their own uncertainty and exploit in interaction.

clarification question, dialogue, model uncertainty, (14 more...)

arXiv.org Artificial Intelligence

2402.06509

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Dominican Republic (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)
(11 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

Taking Action Towards Graceful Interaction: The Effects of Performing Actions on Modelling Policies for Instruction Clarification Requests

Madureira, Brielen, Schlangen, David

arXiv.org Artificial IntelligenceJan-30-2024

Clarification requests are a mechanism to help solve communication problems, e.g. due to ambiguity or underspecification, in instruction-following interactions. Despite their importance, even skilful models struggle with producing or interpreting such repair acts. In this work, we test three hypotheses concerning the effects of action taking as an auxiliary task in modelling iCR policies. Contrary to initial expectations, we conclude that its contribution to learning an iCR policy is limited, but some information can still be extracted from prediction uncertainty. We present further evidence that even well-motivated, Transformer-based models fail to learn good policies for when to ask Instruction CRs (iCRs), while the task of determining what to ask about can be more successfully modelled. Considering the implications of these findings, we further discuss the shortcomings of the data-driven paradigm for learning meta-communication acts.

clipart, computational linguistic, proceedings, (16 more...)

arXiv.org Artificial Intelligence

2401.17039

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Dominican Republic (0.04)
North America > Canada > Ontario > Toronto (0.04)
(26 more...)

Genre: Research Report (1.00)

Industry: Leisure & Entertainment > Games (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

SynCDR : Training Cross Domain Retrieval Models with Synthetic Data

Mishra, Samarth, Saenko, Kate, Saligrama, Venkatesh

arXiv.org Artificial IntelligenceDec-31-2023

In cross-domain retrieval, a model is required to identify images from the same semantic category across two visual domains. For instance, given a sketch of an object, a model needs to retrieve a real image of it from an online store's catalog. A standard approach for such a problem is learning a feature space of images where Euclidean distances reflect similarity. Even without human annotations, which may be expensive to acquire, prior methods function reasonably well using unlabeled images for training. Our problem constraint takes this further to scenarios where the two domains do not necessarily share any common categories in training data. This can occur when the two domains in question come from different versions of some biometric sensor recording identities of different people. We posit a simple solution, which is to generate synthetic data to fill in these missing category examples across domains. This, we do via category preserving translation of images from one visual domain to another. We compare approaches specifically trained for this translation for a pair of domains, as well as those that can use large-scale pre-trained text-to-image diffusion models via prompts, and find that the latter can generate better replacement synthetic data, leading to more accurate cross-domain retrieval models. Code for our work is available at https://github.com/samarth4149/SynCDR .

category, prec, synthetic data, (15 more...)

arXiv.org Artificial Intelligence

2401.0042

Country: Europe > Greece (0.04)

Genre: Research Report (0.50)

Industry: Information Technology > Security & Privacy (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Multi-Domain Long-Tailed Learning by Augmenting Disentangled Representations

Yang, Xinyu, Yao, Huaxiu, Zhou, Allan, Finn, Chelsea

arXiv.org Artificial IntelligenceOct-6-2023

There is an inescapable long-tailed class-imbalance issue in many real-world classification problems. Current methods for addressing this problem only consider scenarios where all examples come from the same distribution. However, in many cases, there are multiple domains with distinct class imbalance. We study this multi-domain long-tailed learning problem and aim to produce a model that generalizes well across all classes and domains. Towards that goal, we introduce TALLY, a method that addresses this multi-domain long-tailed learning problem. Built upon a proposed selective balanced sampling strategy, TALLY achieves this by mixing the semantic representation of one example with the domain-associated nuisances of another, producing a new representation for use as data augmentation. To improve the disentanglement of semantic representations, TALLY further utilizes a domain-invariant class prototype that averages out domain-specific effects. We evaluate TALLY on several benchmarks and real-world datasets and find that it consistently outperforms other state-of-the-art methods in both subpopulation and domain shift. Our code and data have been released at https://github.com/huaxiuyao/TALLY.

machine learning research, representation, tally, (14 more...)

arXiv.org Artificial Intelligence

2210.14358

Country:

North America > United States > New Mexico > Bernalillo County > Albuquerque (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > United States > North Carolina (0.04)

Genre: Research Report > Promising Solution (0.34)

Industry: Education > Focused Education > Special Education (0.44)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

"Are you telling me to put glasses on the dog?'' Content-Grounded Annotation of Instruction Clarification Requests in the CoDraw Dataset

Madureira, Brielen, Schlangen, David

arXiv.org Artificial IntelligenceJul-26-2023

Instruction Clarification Requests are a mechanism to solve communication problems, which is very functional in instruction-following interactions. Recent work has argued that the CoDraw dataset is a valuable source of naturally occurring iCRs. Beyond identifying when iCRs should be made, dialogue models should also be able to generate them with suitable form and content. In this work, we introduce CoDraw-iCR (v2), extending the existing iCR identifiers with fine-grained information grounded in the underlying dialogue game items and possible actions. Our annotation can serve to model and evaluate repair capabilities of dialogue agents.

machine learning, natural language, utterance, (17 more...)

arXiv.org Artificial Intelligence

2306.02377

Country:

Europe > Portugal > Lisbon > Lisbon (0.04)
Europe > Germany > Brandenburg > Potsdam (0.04)
North America > United States > Washington > King County > Seattle (0.04)
(11 more...)

Genre: Research Report (0.50)

Industry: Leisure & Entertainment > Sports (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.68)

Add feedback

GitHub - aiskunks/Generative_AI_Clipart: Generative AI Clipart

#artificialintelligenceFeb-20-2023, 02:35:10 GMT

Visual Notes for Machine Learning - this folder has images produced with generative AI that illustrate various concepts on AI and machine learning. We use them to make visual notes, for slide decks and for articles. They can be used for whatever in the case somebody finds them useful.

clipart, generative ai clipart, github

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Generation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

What is being transferred in transfer learning?

Neyshabur, Behnam, Sedghi, Hanie, Zhang, Chiyuan

arXiv.org Machine LearningAug-26-2020

One desired capability for machines is the ability to transfer their knowledge of one domain to another where data is (usually) scarce. Despite ample adaptation of transfer learning in various deep learning applications, we yet do not understand what enables a successful transfer and which part of the network is responsible for that. In this paper, we provide new tools and analyses to address these fundamental questions. Through a series of analyses on transferring to block-shuffled images, we separate the effect of feature reuse from learning low-level statistics of data and show that some benefit of transfer learning comes from the latter. We present that when training from pre-trained weights, the model stays in the same basin in the loss landscape and different instances of such model are similar in feature space and close in parameter space.

artificial intelligence, linear interpolation coefficient, machine learning, (17 more...)

arXiv.org Machine Learning

2008.11687

Country: Europe > Sweden > Stockholm > Stockholm (0.04)

Genre: Research Report (0.64)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

This Google AI Turns Your Bad Doodles Into Polished Drawings

#artificialintelligenceApr-19-2017, 21:25:24 GMT

This week, Google released a new AI experiment called AutoDraw, which turns your half-baked scribbles into poster-ready clipart. The tool uses machine learning to guess what you're trying to draw and then gives you the option to replace your bad drawing with more polished ones, created by illustrators and design studios like HAWRAF, Erin Butner, Julia Melograna, Pei Liew, Simone Noronha, Tori Hinn, and Selman Design. It's a simple tool that gives those of us without fancy (and expensive) design programs a way to make reasonably professional graphics. In my first attempt to draw something, I started to draw a person. After sketching a blob that distantly resembled a foot, and the program quickly showed me several different types of feet and socks.

artificial intelligence, google ai turn, machine learning, (6 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.31)

Add feedback