AITopics

The Integration of Semantic and Structural Knowledge in Knowledge Graph Entity Typing

Li, Muzhi, Hu, Minda, King, Irwin, Leung, Ho-fung

The Knowledge Graph Entity Typing (KGET) task aims to predict missing type annotations for entities in knowledge graphs. Recent works only utilize the \textit{\textbf{structural knowledge}} in the local neighborhood of entities, disregarding \textit{\textbf{semantic knowledge}} in the textual representations of entities, relations, and types that are also crucial for type inference. Additionally, we observe that the interaction between semantic and structural knowledge can be utilized to address the false-negative problem. In this paper, we propose a novel \textbf{\underline{S}}emantic and \textbf{\underline{S}}tructure-aware KG \textbf{\underline{E}}ntity \textbf{\underline{T}}yping~{(SSET)} framework, which is composed of three modules. First, the \textit{Semantic Knowledge Encoding} module encodes factual knowledge in the KG with a Masked Entity Typing task. Then, the \textit{Structural Knowledge Aggregation} module aggregates knowledge from the multi-hop neighborhood of entities to infer missing types. Finally, the \textit{Unsupervised Type Re-ranking} module utilizes the inference results from the two models above to generate type predictions that are robust to false-negative samples. Extensive experiments show that SSET significantly outperforms existing state-of-the-art methods.

artificial intelligence, knowledge, natural language, (19 more...)

2404.08313

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > New York > New York County > New York City (0.05)
Asia > China > Hong Kong (0.04)
(8 more...)

Genre:

Personal > Honors (0.68)
Research Report > Promising Solution (0.48)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.82)

Thematic Analysis with Large Language Models: does it work with languages other than English? A targeted test in Italian

De Paoli, Stefano

This paper proposes a test to perform Thematic Analysis (TA) with Large Language Model (LLM) on data which is in a different language than English. While there has been initial promising work on using pre-trained LLMs for TA on data in English, we lack any tests on whether these models can reasonably perform the same analysis with good quality in other language. In this paper a test will be proposed using an open access dataset of semi-structured interviews in Italian. The test shows that a pre-trained model can perform such a TA on the data, also using prompts in Italian. A comparative test shows the model capacity to produce themes which have a good resemblance with those produced independently by human researchers. The main implication of this study is that pre-trained LLMs may thus be suitable to support analysis in multilingual situations, so long as the language is supported by the model used.

large language model, machine learning, natural language, (19 more...)

2404.08488

Country:

Europe > Italy > Emilia-Romagna > Metropolitan City of Bologna > Bologna (0.04)
Europe > Switzerland (0.04)

Genre:

Research Report (1.00)
Personal > Interview (0.34)

Industry:

Law (0.93)
Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Alvarez, R. Michael, Morrier, Jacob

Evaluating the Quality of Answers in Political Q&A Sessions with Large Language Models

This paper presents a new approach to evaluating the quality of answers in political question-and-answer sessions. We propose to measure an answer's quality based on the degree to which it allows us to infer the initial question accurately. This conception of answer quality inherently reflects their relevance to initial questions. Drawing parallels with semantic search, we argue that this measurement approach can be operationalized by fine-tuning a large language model on the observed corpus of questions and answers without additional labeled data. We showcase our measurement approach within the context of the Question Period in the Canadian House of Commons. Our approach yields valuable insights into the correlates of the quality of answers in the Question Period. We find that answer quality varies significantly based on the party affiliation of the members of Parliament asking the questions and uncover a meaningful correlation between answer quality and the topics of the questions.

government, question and answer, question period, (14 more...)

2404.08816

Country:

Asia > China (0.05)
Asia > Myanmar (0.04)
North America > Canada > Ontario > Toronto (0.04)
(8 more...)

Genre:

Research Report (1.00)
Personal > Interview (0.84)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Law (1.00)
Health & Medicine > Therapeutic Area (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

JailbreakLens: Visual Analysis of Jailbreak Attacks Against Large Language Models

Feng, Yingchaojie, Chen, Zhizhang, Kang, Zhining, Wang, Sijia, Zhu, Minfeng, Zhang, Wei, Chen, Wei

The proliferation of large language models (LLMs) has underscored concerns regarding their security vulnerabilities, notably against jailbreak attacks, where adversaries design jailbreak prompts to circumvent safety mechanisms for potential misuse. Addressing these concerns necessitates a comprehensive analysis of jailbreak prompts to evaluate LLMs' defensive capabilities and identify potential weaknesses. However, the complexity of evaluating jailbreak performance and understanding prompt characteristics makes this analysis laborious. We collaborate with domain experts to characterize problems and propose an LLM-assisted framework to streamline the analysis process. It provides automatic jailbreak assessment to facilitate performance evaluation and support analysis of components and keywords in prompts. Based on the framework, we design JailbreakLens, a visual analysis system that enables users to explore the jailbreak performance against the target model, conduct multi-level analysis of prompt characteristics, and refine prompt instances to verify findings. Through a case study, technical evaluations, and expert interviews, we demonstrate our system's effectiveness in helping users evaluate model security and identify model weaknesses.

jailbreak prompt, keyword, template, (15 more...)

2404.08793

Country:

North America > United States (0.04)
North America > Canada > Ontario > Toronto (0.04)

Genre:

Research Report (1.00)
Personal > Interview (0.34)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Azzopardi, Leif, Dubiel, Mateusz, Halvey, Martin, Dalton, Jeffery

A Conceptual Framework for Conversational Search and Recommendation: Conceptualizing Agent-Human Interactions During the Conversational Search Process

While past work has started to tease out different actions that users and agents The conversational search task aims to enable a user to resolve perform and respond to during the conversational search process, information needs via natural language dialogue with there has been little work on formalizing these actions an agent. In this paper, we aim to develop a conceptual and decisions. Thus the goal of this paper is to develop a framework of the actions and intents of users and agents conceptual framework of different actions and intents, along explaining how these actions enable the user to explore the with the key decision points within the conversation. Our search space and resolve their information need. We outline aim is to make these tasks explicit in order to formalize the different actions and intents, before discussing key decision the research, development and evaluation of conversational points in the conversation where the agent needs to search agents. To this end, we first examine the key actions decide how to steer the conversational search process to a successful and intents identified in past work, and enumerate these and/or satisfactory conclusion. Essentially, this paper along with others that can be naturally inferred from a simulated provides a conceptualization of the conversational search process conversational context, before discussing the key decisions between an agent and user, which provides a framework that the agent needs to make in order to advance the and a starting point for research, development and evaluation conversation to a satisfactory or successful end. of conversational search agents.

agent, holiday, information, (11 more...)

2404.0863

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
North America > United States > Michigan > Washtenaw County > Ann Arbor (0.05)
Europe > Italy > Tuscany (0.05)
(9 more...)

Genre:

Research Report (0.64)
Personal > Interview (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)

WIREDApr-11-2024, 17:14:03 GMT

How Election Deniers Became Mainstream--and Are Weaponizing Tech

Election deniers are mobilizing their supporters and rolling out new tech to disrupt the November election. These groups are already organizing on hyperlocal levels, and learning to monitor polling places, target election officials, and challenge voter rolls. And though their work was once fringe, its become mainstreamed in the Republican Party. Today on WIRED Politics Lab, we focus on what these groups are doing, and what this means for voters and the election workers already facing threats and harassment. Write to us at politicslab@wired.com. Our show is produced by produced by Jake Harper. Jake Lummus is our studio engineer and Amar Lal mixed this episode. Jordan Bell is the Executive Producer of Audio Development and Chris Bannon is Global Head of Audio at Conde Nast. Also be sure to subscribe to the WIRED Politics Lab newsletter here. You can always listen to this week's podcast through the audio player on this page, but if you want to subscribe for free to get every episode, here's how: If you're on an iPhone or iPad, open the app called Podcasts, or just tap this link. Leah Feiger: Welcome to WIRED Politics Lab, a show about how tech is changing politics. Today, we're going to talk about how election deniers are mobilizing their supporters and rolling out new tech to disrupt November.

david gilbert, leah feiger, vittoria elliott, (12 more...)

WIRED

Country:

North America > United States > Arizona (0.04)
North America > United States > Washington (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
Europe > Ireland > Munster > County Cork > Cork (0.04)

Genre: Personal (0.46)

Industry:

Government > Voting & Elections (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.94)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)

Sinha, Shiven, Prabhu, Ameya, Kumaraguru, Ponnurangam, Bhat, Siddharth, Bethge, Matthias

Wu's Method can Boost Symbolic AI to Rival Silver Medalists and AlphaGeometry to Outperform Gold Medalists at IMO Geometry

arXiv.org Artificial IntelligenceApr-11-2024

Proving geometric theorems constitutes a hallmark of visual reasoning combining both intuitive and logical skills. Therefore, automated theorem proving of Olympiad-level geometry problems is considered a notable milestone in human-level automated reasoning. The introduction of AlphaGeometry, a neuro-symbolic model trained with 100 million synthetic samples, marked a major breakthrough. It solved 25 of 30 International Mathematical Olympiad (IMO) problems whereas the reported baseline based on Wu's method solved only ten. In this note, we revisit the IMO-AG-30 Challenge introduced with AlphaGeometry, and find that Wu's method is surprisingly strong. Wu's method alone can solve 15 problems, and some of them are not solved by any of the other methods. This leads to two key findings: (i) Combining Wu's method with the classic synthetic methods of deductive databases and angle, ratio, and distance chasing solves 21 out of 30 methods by just using a CPU-only laptop with a time limit of 5 minutes per problem. Essentially, this classic method solves just 4 problems less than AlphaGeometry and establishes the first fully symbolic baseline strong enough to rival the performance of an IMO silver medalist. (ii) Wu's method even solves 2 of the 5 problems that AlphaGeometry failed to solve. Thus, by combining AlphaGeometry with Wu's method we set a new state-of-the-art for automated theorem proving on IMO-AG-30, solving 27 out of 30 problems, the first AI method which outperforms an IMO gold medalist.

alphageometry, baseline, geometry, (15 more...)

2404.06405

Country:

Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre:

Personal > Honors (0.77)
Research Report > New Finding (0.67)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)

New ScientistApr-10-2024, 10:00:46 GMT

Mathematician wins Turing award for harnessing randomness

The mathematician Avi Wigderson has won the 2023 Turing award, often referred to as the Nobel prize for computing, for his work on understanding how randomness can shape and improve computer algorithms. Wigderson, who also won the prestigious Abel prize in 2021 for his mathematical contributions to computer science, was taken aback by the award. "The [Turing] committee fooled me into believing that we were going to have some conversation about collaborating," he says. "When I zoomed in, the whole committee was there and they told me. I was excited, surprised and happy."

randomness, turing award, wigderson, (9 more...)

New Scientist

Country:

North America > United States > New Jersey > Mercer County > Princeton (0.06)
Asia > Middle East > Israel (0.06)

Genre: Personal > Honors > Award (0.95)

Technology: Information Technology > Artificial Intelligence (0.53)

Wegmann, Anna, Broek, Tijs van den, Nguyen, Dong

What's Mine becomes Yours: Defining, Annotating and Detecting Context-Dependent Paraphrases in News Interview Dialogs

arXiv.org Artificial IntelligenceApr-9-2024

Best practices for high conflict conversations like counseling or customer support almost always include recommendations to paraphrase the previous speaker. Although paraphrase classification has received widespread attention in NLP, paraphrases are usually considered independent from context, and common models and datasets are not applicable to dialog settings. In this work, we investigate paraphrases in dialog (e.g., Speaker 1: "That book is mine." becomes Speaker 2: "That book is yours."). We provide an operationalization of context-dependent paraphrases, and develop a training for crowd-workers to classify paraphrases in dialog. We introduce a dataset with utterance pairs from NPR and CNN news interviews annotated for context-dependent paraphrases. To enable analyses on label variation, the dataset contains 5,581 annotations on 600 utterance pairs. We present promising results with in-context learning and with token classification models for automatic paraphrase detection in dialog.

annotation, annotator, computational linguistic, (16 more...)

2404.0667

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Singapore (0.04)
(20 more...)

Genre:

Personal > Interview (0.82)
Research Report (0.64)

Industry:

Government (1.00)
Leisure & Entertainment (0.93)
Media > News (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.67)