AITopics

2310.17567

Country:

North America > United States > New York (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > Middle East > Republic of Türkiye > Istanbul Province > Istanbul (0.04)
Asia > Middle East > Republic of Türkiye > Istanbul Province > Istanbul (0.04)

Genre:

Research Report (1.00)
Personal > Interview (0.46)

Industry:

Education (1.00)
Government (0.67)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Gholami, Peyman, Xiao, Robert

Diffusion Brush: A Latent Diffusion Model-based Editing Tool for AI-generated Images

arXiv.org Artificial IntelligenceOct-26-2023

Text-to-image generative models have made remarkable advancements in generating high-quality images. However, generated images often contain undesirable artifacts or other errors due to model limitations. Existing techniques to fine-tune generated images are time-consuming (manual editing), produce poorly-integrated results (inpainting), or result in unexpected changes across the entire image (variation selection and prompt fine-tuning). In this work, we present Diffusion Brush, a Latent Diffusion Model-based (LDM) tool to efficiently fine-tune desired regions within an AI-synthesized image. Our method introduces new random noise patterns at targeted regions during the reverse diffusion process, enabling the model to efficiently make changes to the specified regions while preserving the original context for the rest of the image. We evaluate our method's usability and effectiveness through a user study with artists, comparing our technique against other state-of-the-art image inpainting techniques and editing software for fine-tuning AI-generated imagery.

adjustment, diffusion brush, participant, (14 more...)

2306.00219

Country:

North America > Canada > British Columbia (0.04)
Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)

Genre:

Research Report (0.64)
Questionnaire & Opinion Survey (0.57)
Personal (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning

Sprague, Zayne, Ye, Xi, Bostrom, Kaj, Chaudhuri, Swarat, Durrett, Greg

While large language models (LLMs) equipped with techniques like chain-of-thought prompting have demonstrated impressive capabilities, they still fall short in their ability to reason robustly in complex settings. However, evaluating LLM reasoning is challenging because system capabilities continue to grow while benchmark datasets for tasks like logical deduction have remained static. We introduce MuSR, a dataset for evaluating language models on multistep soft reasoning tasks specified in a natural language narrative. This dataset has two crucial features. First, it is created through a novel neurosymbolic synthetic-to-natural generation algorithm, enabling the construction of complex reasoning instances that challenge GPT-4 (e.g., murder mysteries roughly 1000 words in length) and which can be scaled further as more capable LLMs are released. Second, our dataset instances are free text narratives corresponding to real-world domains of reasoning; this makes it simultaneously much more challenging than other synthetically-crafted benchmarks while remaining realistic and tractable for human annotators to solve with high accuracy. We evaluate a range of LLMs and prompting techniques on this dataset and characterize the gaps that remain for techniques like chain-of-thought to perform robust reasoning.

dataset, reasoning, winston, (16 more...)

2310.16049

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
North America > Canada > Ontario > Toronto (0.04)
(4 more...)

Genre:

Research Report (1.00)
Personal > Interview (0.45)
Personal > Obituary (0.45)

Industry:

Transportation > Air (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

RAPL: A Relation-Aware Prototype Learning Approach for Few-Shot Document-Level Relation Extraction

Meng, Shiao, Hu, Xuming, Liu, Aiwei, Li, Shu'ang, Ma, Fukun, Yang, Yawen, Wen, Lijie

How to identify semantic relations among entities in a document when only a few labeled documents are available? Few-shot document-level relation extraction (FSDLRE) is crucial for addressing the pervasive data scarcity problem in real-world scenarios. Metric-based meta-learning is an effective framework widely adopted for FSDLRE, which constructs class prototypes for classification. However, existing works often struggle to obtain class prototypes with accurate relational semantics: 1) To build prototype for a target relation type, they aggregate the representations of all entity pairs holding that relation, while these entity pairs may also hold other relations, thus disturbing the prototype. 2) They use a set of generic NOTA (none-of-the-above) prototypes across all tasks, neglecting that the NOTA semantics differs in tasks with different target relation types. In this paper, we propose a relation-aware prototype learning method for FSDLRE to strengthen the relational semantics of prototype representations. By judiciously leveraging the relation descriptions and realistic NOTA instances as guidance, our method effectively refines the relation prototypes and generates task-specific NOTA prototypes. Extensive experiments demonstrate that our method outperforms state-of-the-art approaches by average 2.61% $F_1$ across various settings of two FSDLRE benchmarks.

extraction, prototype, relation extraction, (15 more...)

2310.15743

Country:

South America > Venezuela (0.04)
Oceania > Australia > Australian Capital Territory > Canberra (0.04)
North America > United States > Connecticut > Windham County > Windham (0.04)
(2 more...)

Genre:

Research Report (0.84)
Personal (0.67)

Industry:

Media (1.00)
Leisure & Entertainment (1.00)
Government (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Abdin, Marah I, Gunasekar, Suriya, Chandrasekaran, Varun, Li, Jerry, Yuksekgonul, Mert, Peshawaria, Rahee Ghosh, Naik, Ranjita, Nushi, Besmira

KITAB: Evaluating LLMs on Constraint Satisfaction for Information Retrieval

We study the ability of state-of-the art models to answer constraint satisfaction queries for information retrieval (e.g., 'a list of ice cream shops in San Diego'). In the past, such queries were considered to be tasks that could only be solved via web-search or knowledge bases. More recently, large language models (LLMs) have demonstrated initial emergent abilities in this task. However, many current retrieval benchmarks are either saturated or do not measure constraint satisfaction. Motivated by rising concerns around factual incorrectness and hallucinations of LLMs, we present KITAB, a new dataset for measuring constraint satisfaction abilities of language models. KITAB consists of book-related data across more than 600 authors and 13,000 queries, and also offers an associated dynamic data collection and constraint verification approach for acquiring similar test data for other authors. Our extended experiments on GPT4 and GPT3.5 characterize and decouple common failure modes across dimensions such as information popularity, constraint types, and context availability. Results show that in the absence of context, models exhibit severe limitations as measured by irrelevant information, factual errors, and incompleteness, many of which exacerbate as information popularity decreases. While context availability mitigates irrelevant information, it is not helpful for satisfying constraints, identifying fundamental barriers to constraint satisfaction. We open source our contributions to foster further research on improving constraint satisfaction abilities of future models.

book constraint, constraint, query, (15 more...)

2310.15511

Country:

North America > United States > California > San Diego County > San Diego (0.24)
North America > Central America (0.04)
South America > Uruguay (0.04)
(14 more...)

Genre:

Personal > Honors (0.46)
Research Report > New Finding (0.34)
Research Report > Promising Solution (0.34)

Industry:

Government (0.46)
Consumer Products & Services > Food, Beverage, Tobacco & Cannabis (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Nousi, Paraskevi, Avramelou, Loukia, Rodinos, Georgios, Tzelepi, Maria, Manousis, Theodoros, Tsampazis, Konstantinos, Stefanidis, Kyriakos, Spanos, Dimitris, Kirtas, Manos, Tosidis, Pavlos, Tsantekidis, Avraam, Passalis, Nikolaos, Tefas, Anastasios

Leveraging Deep Learning and Online Source Sentiment for Financial Portfolio Management

Financial markets analysis has been and remains a topic of intense research interest since the seminal work of Markowitz [1] detailing his theory on portfolio choice, for which he was awarded the Nobel Prize in 1990. The rapid advancements of Machine Learning (ML) and, more specifically those made in the field of Deep Learning (DL) and Deep Reinforcement Learning (DRL), further fueled interest in the field. Financial markets analysts began using ML-based techniques and combining them with their own knowledge of the field [2]. As early as 1992, Neural Networks (NNs) were already being used for equity index futures trading [3]. More recently, DL research in financial market analysis has focused on high frequency trading, i.e., an algorithmic financial trading method where high speeds and large volumes are the main characteristics. The kind of data used in works that focus on this type of trading include Limit Order Book (LOB) data [4] as well as candle data for assets such as FOREX or Cryptocurrencies [5]. Candle data contain the Open, High, Low and Close prices for assets in a requested frequency, e.g., at the minute or hour level. Price forecasting is a first step towards solving the very complex task of portfolio management, and has proved to be a sufficiently difficult problem to tackle itself. One way to sufficiently solve it is by transforming the problem into one of classification, i.e., predicting the price movement instead of its actual value in the next step [4].

agent, information, nikolao passalis, (15 more...)

2309.16679

Country:

North America > United States > New York > New York County > New York City (0.04)
Europe > Greece > Central Macedonia > Thessaloniki (0.04)
North America > Canada (0.04)
Asia (0.04)

Genre:

Overview (1.00)
Personal > Honors (0.54)
Research Report > New Finding (0.46)

Industry: Banking & Finance > Trading (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Models

Kim, Gangwoo, Kim, Sungdong, Jeon, Byeongguk, Park, Joonsuk, Kang, Jaewoo

Questions in open-domain question answering are often ambiguous, allowing multiple interpretations. One approach to handling them is to identify all possible interpretations of the ambiguous question (AQ) and to generate a long-form answer addressing them all, as suggested by Stelmakh et al., (2022). While it provides a comprehensive response without bothering the user for clarification, considering multiple dimensions of ambiguity and gathering corresponding knowledge remains a challenge. To cope with the challenge, we propose a novel framework, Tree of Clarifications (ToC): It recursively constructs a tree of disambiguations for the AQ -- via few-shot prompting leveraging external knowledge -- and uses it to generate a long-form answer. ToC outperforms existing baselines on ASQA in a few-shot setup across the metrics, while surpassing fully-supervised baselines trained on the whole training set in terms of Disambig-F1 and Disambig-ROUGE. Code is available at https://github.com/gankim/tree-of-clarifications.

clarification, disambiguation, long-form answer, (15 more...)

2310.14696

Country:

Europe > Russia (0.05)
Asia > Russia (0.05)
Asia > Middle East > Qatar (0.04)
(10 more...)

Genre:

Research Report (0.82)
Personal > Honors (0.46)

Industry:

Leisure & Entertainment > Sports > Soccer (1.00)
Leisure & Entertainment > Sports > Baseball (1.00)
Media > Television (0.70)
Media > Film (0.69)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Lee, V Vien, van der Lubbe, Stephanie C. C., Goh, Lay Hoon, Valderas, Jose M.

Harnessing ChatGPT for thematic analysis: Are we ready?

ChatGPT is an advanced natural language processing tool with growing applications across various disciplines in medical research. Thematic analysis, a qualitative research method to identify and interpret patterns in data, is one application that stands to benefit from this technology. This viewpoint explores the utilization of ChatGPT in three core phases of thematic analysis within a medical context: 1) direct coding of transcripts, 2) generating themes from a predefined list of codes, and 3) preprocessing quotes for manuscript inclusion. Additionally, we explore the potential of ChatGPT to generate interview transcripts, which may be used for training purposes. We assess the strengths and limitations of using ChatGPT in these roles, highlighting areas where human intervention remains necessary. Overall, we argue that ChatGPT can function as a valuable tool during analysis, enhancing the efficiency of the thematic analysis and offering additional insights into the qualitative data.

chatgpt, thematic analysis, transcript, (14 more...)

2310.14545

Country:

North America > United States (0.14)
Asia > Singapore > Central Region > Singapore (0.04)
Europe > United Kingdom (0.04)
(5 more...)

Genre:

Research Report (0.83)
Personal > Interview (0.68)

Industry:

Health & Medicine > Health Care Technology > Medical Record (0.68)
Health & Medicine > Therapeutic Area > Endocrinology > Diabetes (0.55)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Chatzimparmpas, Angelos, Martins, Rafael M., Telea, Alexandru C., Kerren, Andreas

DeforestVis: Behavior Analysis of Machine Learning Models with Surrogate Decision Stumps

As the complexity of machine learning (ML) models increases and their application in different (and critical) domains grows, there is a strong demand for more interpretable and trustworthy ML. A direct, model-agnostic, way to interpret such models is to train surrogate models, such as rule sets and decision trees, that sufficiently approximate the original ones while being simpler and easier-to-explain. Yet, rule sets can become very lengthy, with many if-else statements, and decision tree depth grows rapidly when accurately emulating complex ML models. In such cases, both approaches can fail to meet their core goal, providing users with model interpretability. To tackle this, we propose DeforestVis, a visual analytics tool that offers user-friendly summarization of the behavior of complex ML models by providing surrogate decision stumps (one-level decision trees) generated with the adaptive boosting (AdaBoost) technique. DeforestVis helps users to explore the complexity vs fidelity trade-off by incrementally generating more stumps, creating attribute-based explanations with weighted stumps to justify decision making, and analyzing the impact of rule overriding on training instance allocation between one or more stumps. An independent test set allows users to monitor the effectiveness of manual rule changes and form hypotheses based on case-by-case analyses. We show the applicability and usefulness of DeforestVis with two use cases and expert interviews with data analysts and model developers.

decision stump, stump, surrogate model, (16 more...)

2304.00133

Country:

North America > United States > Wisconsin (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)
(3 more...)

Genre:

Research Report (1.00)
Personal > Interview (0.66)

Industry: Health & Medicine > Therapeutic Area (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Chung, Chanyoung, Whang, Joyce Jiyoung

Learning Representations of Bi-level Knowledge Graphs for Reasoning beyond Link Prediction

Knowledge graphs represent known facts using triplets. While existing knowledge graph embedding methods only consider the connections between entities, we propose considering the relationships between triplets. For example, let us consider two triplets $T_1$ and $T_2$ where $T_1$ is (Academy_Awards, Nominates, Avatar) and $T_2$ is (Avatar, Wins, Academy_Awards). Given these two base-level triplets, we see that $T_1$ is a prerequisite for $T_2$. In this paper, we define a higher-level triplet to represent a relationship between triplets, e.g., $\langle T_1$, PrerequisiteFor, $T_2\rangle$ where PrerequisiteFor is a higher-level relation. We define a bi-level knowledge graph that consists of the base-level and the higher-level triplets. We also propose a data augmentation strategy based on the random walks on the bi-level knowledge graph to augment plausible triplets. Our model called BiVE learns embeddings by taking into account the structures of the base-level and the higher-level triplets, with additional consideration of the augmented triplets. We propose two new tasks: triplet prediction and conditional link prediction. Given a triplet $T_1$ and a higher-level relation, the triplet prediction predicts a triplet that is likely to be connected to $T_1$ by the higher-level relation, e.g., $\langle T_1$, PrerequisiteFor, ?$\rangle$. The conditional link prediction predicts a missing entity in a triplet conditioned on another triplet, e.g., $\langle T_1$, PrerequisiteFor, (Avatar, Wins, ?)$\rangle$. Experimental results show that BiVE significantly outperforms all other methods in the two new tasks and the typical base-level link prediction in real-world bi-level knowledge graphs.

higher-level triplet, knowledge graph, triplet, (16 more...)

doi: 10.1609/aaai.v37i4.25538

2302.02601

Country:

North America > United States > New York (0.05)
Europe > Sweden > Stockholm > Stockholm (0.04)
Europe > Italy (0.04)
(12 more...)

Genre:

Research Report > New Finding (0.34)
Personal > Honors (0.34)

Industry:

Media > Music (1.00)
Media > Film (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
(2 more...)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (1.00)