AITopics

2305.09067

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Mantri, Krishna Sri Ipsit, Sasikumar, Nevasini

Interactive Fashion Content Generation Using LLMs and Latent Diffusion Models

arXiv.org Artificial IntelligenceMay-15-2023

Fashionable image generation aims to synthesize images of diverse fashion prevalent around the globe, helping fashion designers in real-time visualization by giving them a basic customized structure of how a specific design preference would look in real life and what further improvements can be made for enhanced customer satisfaction. Moreover, users can alone interact and generate fashionable images by just giving a few simple prompts. Recently, diffusion models have gained popularity as generative models owing to their flexibility and generation of realistic images from Gaussian noise. Latent diffusion models are a type of generative model that use diffusion processes to model the generation of complex data, such as images, audio, or text. They are called "latent" because they learn a hidden representation, or latent variable, of the data that captures its underlying structure. We propose a method exploiting the equivalence between diffusion models and energy-based models (EBMs) and suggesting ways to compose multiple probability distributions. We describe a pipeline on how our method can be used specifically for new fashionable outfit generation and virtual try-on using LLM-guided text-to-image generation. Our results indicate that using an LLM to refine the prompts to the latent diffusion model assists in generating globally creative and culturally diversified fashion styles and reducing bias.

computer vision, large language model, machine learning, (15 more...)

2306.05182

Country:

Europe > France (0.04)
Asia > India > Karnataka > Bengaluru (0.04)

Genre:

Research Report (0.70)
Overview (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

arXiv.org Artificial IntelligenceMay-15-2023

Uncovering the Potential of ChatGPT for Discourse Analysis in Dialogue: An Empirical Study

Fan, Yaxin, Jiang, Feng

Large Language Models (LLMs) like ChatGPT have proven a great shallow understanding of many traditional NLP tasks, such as translation, summarization, etc. However, its performance on high-level understanding, such as dialogue discourse analysis task that requires a higher level of understanding and reasoning, remains less explored. This study investigates ChatGPT's capabilities in three dialogue discourse tasks: topic segmentation, discourse relation recognition, and discourse parsing, of varying difficulty levels. To adapt ChatGPT to these tasks, we propose discriminative and generative paradigms and introduce the Chain of Thought (COT) approach to improve ChatGPT's performance in more difficult tasks. The results show that our generative paradigm allows ChatGPT to achieve comparative performance in the topic segmentation task comparable to state-of-the-art methods but reveals room for improvement in the more complex tasks of discourse relation recognition and discourse parsing. Notably, the COT can significantly enhance ChatGPT's performance with the help of understanding complex structures in more challenging tasks. Through a series of case studies, our in-depth analysis suggests that ChatGPT can be a good annotator in topic segmentation but has difficulties understanding complex rhetorical structures. We hope these findings provide a foundation for future research to refine dialogue discourse analysis approaches in the era of LLMs.

large language model, machine learning, natural language, (20 more...)

2305.08391

Country:

North America > United States > Ohio (0.04)
North America > United States > Missouri (0.04)
North America > Dominican Republic (0.04)
(5 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

TIME - TechMay-14-2023, 11:00:12 GMT

Microsoft's Satya Nadella Doesn't Think Now Is the Time to Stop on AI

The last year has been characterized by a rush of new artificial intelligence (AI) programs being released into the world since OpenAI, a lab backed by Microsoft, launched ChatGPT in November 2022. Both Microsoft and Google rolled out products in March that they say will use AI to transform work, and IBM's CEO Arvind Krishna said the company's AI tool will be able to reduce 30 to 50% of repetitive office work. Since taking the helm at Microsoft in 2014, at a time when its market dominance with traditional software offerings was waning, Satya Nadella has focused on ensuring the company remains relevant. . The company has invested heavily in Azure, its cloud computing platform, and in AI, pouring at least $13 billion in the leading lab OpenAI. Microsoft's share price has risen nearly tenfold since Nadella became CEO, outperforming the S&P 500, which has merely doubled its value over the same time.

conséquence, economic growth, microsoft, (16 more...)

TIME - Tech

Country:

Europe (0.16)
North America > United States (0.05)

Industry:

Information Technology (1.00)
Banking & Finance > Trading (0.55)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.90)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.75)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.55)

Daily Mail - Science & techMay-14-2023, 09:37:50 GMT

AI do! Meet the couples using artificial intelligence in their WEDDINGS

Writing speeches, sending out invites and finding the perfect dress are among the numerous things to juggle when planning a wedding. But now, many couples are saying'AI do' to ChatGPT, as they rely on artificial intelligence to pull together their big day. One surprised bride-to-be admitted the bot's ability to write touching wedding vows was better than her own when tested in the lead up to her wedding. In a TikTok, Lynnzee Highland from Seattle prompted ChatGPT to write words that were'funny and romantic', adding: 'My fiance really knows how to make me laugh. We love running together even though I can hardly keep up.

artificial intelligence, chatgpt, wedding vow, (14 more...)

Daily Mail - Science & tech

Country:

Europe > Italy (0.05)
Europe > Denmark (0.05)
Asia > Middle East > Republic of Türkiye > Batman Province > Batman (0.05)

Industry: Leisure & Entertainment > Social Events (0.86)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Semantic Composition in Visually Grounded Language Models

Pandey, Rohan

What is sentence meaning and its ideal representation? Much of the expressive power of human language derives from semantic composition, the mind's ability to represent meaning hierarchically & relationally over constituents. At the same time, much sentential meaning is outside the text and requires grounding in sensory, motor, and experiential modalities to be adequately learned. Although large language models display considerable compositional ability, recent work shows that visually-grounded language models drastically fail to represent compositional structure. In this thesis, we explore whether & how models compose visually grounded semantics, and how we might improve their ability to do so. Specifically, we introduce 1) WinogroundVQA, a new compositional visual question answering benchmark, 2) Syntactic Neural Module Distillation, a measure of compositional ability in sentence embedding models, 3) Causal Tracing for Image Captioning Models to locate neural representations vital for vision-language composition, 4) Syntactic MeanPool to inject a compositional inductive bias into sentence embeddings, and 5) Cross-modal Attention Congruence Regularization, a self-supervised objective function for vision-language relation alignment. We close by discussing connections of our work to neuroscience, psycholinguistics, formal semantics, and philosophy.

large language model, machine learning, natural language, (16 more...)

2305.16328

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > India (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
(6 more...)

Genre: Research Report (0.81)

Industry:

Education (0.67)
Health & Medicine > Therapeutic Area (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Su, Chia-Yi, Bansal, Aakash, Jain, Vijayanta, Ghanavati, Sepideh, McMillan, Collin

A Language Model of Java Methods with Train/Test Deduplication

This tool demonstration presents a research toolkit for a language model of Java source code. The target audience includes researchers studying problems at the granularity level of subroutines, statements, or variables in Java. In contrast to many existing language models, we prioritize features for researchers including an open and easily-searchable training set, a held out test set with different levels of deduplication from the training set, infrastructure for deduplicating new examples, and an implementation platform suitable for execution on equipment accessible to a relatively modest budget. Our model is a GPT2-like architecture with 350m parameters. Our training set includes 52m Java methods (9b tokens) and 13m StackOverflow threads (10.5b tokens). To improve accessibility of research to more members of the community, we limit local resource requirements to GPUs with 16GB video memory. We provide a test set of held out Java methods that include descriptive comments, including the entire Java projects for those methods. We also provide deduplication tools using precomputed hash tables at various similarity thresholds to help researchers ensure that their own test examples are not in the training set. We make all our tools and data open source and available via Huggingface and Github.

large language model, machine learning, natural language, (16 more...)

2305.08286

Country:

North America > United States > Indiana > St. Joseph County > Notre Dame (0.04)
North America > United States > Maine > Penobscot County > Orono (0.04)
North America > United States > New York > New York County > New York City (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Large Language Model Guided Tree-of-Thought

Long, Jieyi

artificial intelligence, language model guided tree-of-thought, large language model, (1 more...)

In this paper, we introduce the Tree-of-Thought (ToT) framework, a novel approach aimed at improving the problem-solving capabilities of auto-regressive large language models (LLMs). The ToT technique is inspired by the human mind's approach for solving complex reasoning tasks through trial and error. In this process, the human mind explores the solution space through a tree-like thought process, allowing for backtracking when necessary. To implement ToT as a software system, we augment an LLM with additional modules including a prompter agent, a checker module, a memory module, and a ToT controller. In order to solve a given problem, these modules engage in a multi-round conversation with the LLM. The memory module records the conversation and state history of the problem solving process, which allows the system to backtrack to the previous steps of the thought-process and explore other directions from there. To verify the effectiveness of the proposed technique, we implemented a ToT-based solver for the Sudoku Puzzle. Experimental results show that the ToT framework can significantly increase the success rate of Sudoku puzzle solving. Our implementation of the ToT-based Sudoku solver is available on GitHub: \url{https://github.com/jieyilong/tree-of-thought-puzzle-solver}.

2305.08291

Genre: Research Report (1.00)

Industry: Leisure & Entertainment > Games > Sudoku (0.64)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Thakkar, Gaurish, Preradovic, Nives Mikelic, Tadić, Marko

Croatian Film Review Dataset (Cro-FiReDa): A Sentiment Annotated Dataset of Film Reviews

This paper introduces Cro-FiReDa, a sentiment-annotated dataset for Croatian in the domain of movie reviews. The dataset, which contains over 10,000 sentences, has been annotated at the sentence level. In addition to presenting the overall annotation process, we also present benchmark results based on the transformer-based fine-tuning approach

large language model, machine learning, natural language, (19 more...)

2305.08173

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.05)
Europe > Croatia > Zagreb County > Zagreb (0.05)
(12 more...)

Genre: Research Report (0.64)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.34)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)
(2 more...)

AI for Agile development: a Meta-Analysis

Cabrero-Daniel, Beatriz

This study explores the benefits and challenges of integrating Artificial Intelligence with Agile software development methodologies, focusing on improving continuous integration and delivery. A systematic literature review and longitudinal meta-analysis of the retrieved studies was conducted to analyse the role of Artificial Intelligence and it's future applications within Agile software development. The review helped identify critical challenges, such as the need for specialised socio-technical expertise. While Artificial Intelligence holds promise for improved software development practices, further research is needed to better understand its impact on processes and practitioners, and to address the indirect challenges associated with its implementation.

large language model, machine learning, natural language, (18 more...)

2305.08093

Country:

North America > United States > New York > New York County > New York City (0.05)
Europe > Sweden > Vaestra Goetaland > Gothenburg (0.05)
Europe > Netherlands (0.04)

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.69)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)