AITopics | Personal

Collaborating Authors

Personal

Leveraging Large Language Models to Power Chatbots for Collecting User Self-Reported Data

Wei, Jing, Kim, Sungdong, Jung, Hyunhoon, Kim, Young-Ho

arXiv.org Artificial IntelligenceSep-22-2023

Large language models (LLMs) provide a new way to build chatbots by accepting natural language prompts. Yet, it is unclear how to design prompts to power chatbots to carry on naturalistic conversations while pursuing a given goal, such as collecting self-report data from users. We explore what design factors of prompts can help steer chatbots to talk naturally and collect data reliably. To this aim, we formulated four prompt designs with different structures and personas. Through an online study (N = 48) where participants conversed with chatbots driven by different designs of prompts, we assessed how prompt designs and conversation topics affected the conversation flows and users' perceptions of chatbots. Our chatbots covered 79% of the desired information slots during conversations, and the designs of prompts and topics significantly influenced the conversation flows and the data collection performance. We discuss the opportunities and challenges of building chatbots with LLMs.

chatbot, participant, proceedings, (15 more...)

arXiv.org Artificial Intelligence

2301.05843

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > Dominican Republic (0.04)
(7 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Questionnaire & Opinion Survey (1.00)
Personal > Interview (1.00)

Industry:

Health & Medicine > Consumer Health (1.00)
Information Technology > Services (0.67)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

Can large language models generate salient negative statements?

Arnaout, Hiba, Razniewski, Simon

arXiv.org Artificial IntelligenceSep-21-2023

We examine the ability of large language models (LLMs) to generate salient (interesting) negative statements about real-world entities; an emerging research topic of the last few years. We probe the LLMs using zero- and k-shot unconstrained probes, and compare with traditional methods for negation generation, i.e., pattern-based textual extractions and knowledge-graph-based inferences, as well as crowdsourced gold statements. We measure the correctness and salience of the generated lists about subjects from different domains. Our evaluation shows that guided probes do in fact improve the quality of generated negatives, compared to the zero-shot variant. Nevertheless, using both prompts, LLMs still struggle with the notion of factuality of negatives, frequently generating many ambiguous statements, or statements with negative keywords but a positive meaning.

llm, negative statement, salience, (15 more...)

arXiv.org Artificial Intelligence

2305.16755

Country:

Europe > Germany (0.15)
Asia > Middle East > Lebanon (0.05)
North America > United States > Illinois > Cook County > Chicago (0.05)
(7 more...)

Genre:

Personal > Honors (0.93)
Research Report (0.82)

Industry:

Leisure & Entertainment > Sports > Basketball (1.00)
Information Technology (0.93)
Media (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.54)

Add feedback

"It's a Fair Game'', or Is It? Examining How Users Navigate Disclosure Risks and Benefits When Using LLM-Based Conversational Agents

Zhang, Zhiping, Jia, Michelle, Hao-Ping, null, Lee, null, Yao, Bingsheng, Das, Sauvik, Lerner, Ada, Wang, Dakuo, Li, Tianshi

arXiv.org Artificial IntelligenceSep-20-2023

The widespread use of Large Language Model (LLM)-based conversational agents (CAs), especially in high-stakes domains, raises many privacy concerns. Building ethical LLM-based CAs that respect user privacy requires an in-depth understanding of the privacy risks that concern users the most. However, existing research, primarily model-centered, does not provide insight into users' perspectives. To bridge this gap, we analyzed sensitive disclosures in real-world ChatGPT conversations and conducted semi-structured interviews with 19 LLM-based CA users. We found that users are constantly faced with trade-offs between privacy, utility, and convenience when using LLM-based CAs. However, users' erroneous mental models and the dark patterns in system design limited their awareness and comprehension of the privacy risks. Additionally, the human-like interactions encouraged more sensitive disclosures, which complicated users' ability to navigate the trade-offs. We discuss practical design guidelines and the needs for paradigmatic shifts to protect the privacy of LLM-based CA users.

chatgpt, information, participant, (13 more...)

arXiv.org Artificial Intelligence

2309.11653

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(4 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Questionnaire & Opinion Survey (1.00)
Personal > Interview (1.00)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Education (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.99)

Add feedback

A Survey on Privacy in Graph Neural Networks: Attacks, Preservation, and Applications

Zhang, Yi, Zhao, Yuying, Li, Zhaoqing, Cheng, Xueqi, Wang, Yu, Kotevska, Olivera, Yu, Philip S., Derr, Tyler

arXiv.org Artificial IntelligenceSep-19-2023

Privacy attack is a popular and well-developed topic in various fields such as social network analysis, healthcare, finance, system, etc. [88], [89], [90]. During recent years, the surge of machine learning has provided powerful tools to solve many practical problems. However, data-driven approaches also threaten users' privacy due to the associated risks of data leakage and inference [85]. Consequently, a substantial amount of work has been devoted to investigate the vulnerabilities of ML models and the risks of privacy leakage [47]. A branch of privacy research is to develop privacy attack models, which has received much attention during the past few years. However, attack models with respect to GNNs have only been explored very recently because GNN techniques are relatively new compared with CNN/transformers in image/natural language processing(NLP) domains, and the irregular graph structure poses unique challenges to transfer existing attack techniques that are well-established in other domains. In this section, we summarize papers that have developed attack models specifically targeting GNNs. Figure 1: Illustrations of the four categories of privacy attack We classify the privacy attack models on GNN into models on graphs: a) Model extraction attacks (MEA); b) four categories (which are visualized in Figure 4): a) model Graph structure reconstruction (GSR); c) Attribute inference extraction attack (MEA), b) graph structure reconstruction attacks (AIA); and d) Membership inference attacks (MIA).

graph, information, neural network, (15 more...)

arXiv.org Artificial Intelligence

2308.16375

Country:

Asia > China > Hong Kong (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > Tennessee > Davidson County > Nashville (0.04)
(10 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.46)
Personal > Honors (0.45)
Instructional Material > Course Syllabus & Notes (0.45)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Regional Government > North America Government > United States Government (0.92)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(3 more...)

Add feedback

Sen. Richard Blumenthal Defends His Controversial Bill Regulating Social Media for Kids

SlateSep-18-2023, 20:18:57 GMT

For a while now, Washington has been wrestling with two big forces shaping technology: social media and artificial intelligence. Who should do it--and how? Currently, Congress is considering a bill that would regulate how social media companies treat minors: the Kids Online Safety Act. Although it has bipartisan support, KOSA is not without controversy. Several critics have called it "government censorship." One group, the Electronic Frontier Foundation, says it is "one of the most dangerous bills in years."

algorithm, attorney general, state attorney general, (12 more...)

Slate

Country:

North America > United States > California (0.15)
North America > United States > Missouri (0.05)
Europe > France (0.05)
(7 more...)

Genre: Personal > Interview (0.69)

Industry:

Law > Statutes (1.00)
Law > Civil Rights & Constitutional Law (1.00)
Information Technology > Security & Privacy (1.00)
(2 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

Towards Better Modeling with Missing Data: A Contrastive Learning-based Visual Analytics Perspective

Xie, Laixin, Ouyang, Yang, Chen, Longfei, Wu, Ziming, Li, Quan

arXiv.org Artificial IntelligenceSep-18-2023

Missing data can pose a challenge for machine learning (ML) modeling. To address this, current approaches are categorized into feature imputation and label prediction and are primarily focused on handling missing data to enhance ML performance. These approaches rely on the observed data to estimate the missing values and therefore encounter three main shortcomings in imputation, including the need for different imputation methods for various missing data mechanisms, heavy dependence on the assumption of data distribution, and potential introduction of bias. This study proposes a Contrastive Learning (CL) framework to model observed data with missing values, where the ML model learns the similarity between an incomplete sample and its complete counterpart and the dissimilarity between other samples. Our proposed approach demonstrates the advantages of CL without requiring any imputation. To enhance interpretability, we introduce CIVis, a visual analytics system that incorporates interpretable techniques to visualize the learning process and diagnose the model status. Users can leverage their domain knowledge through interactive sampling to identify negative and positive pairs in CL. The output of CIVis is an optimized model that takes specified features and predicts downstream tasks. We provide two usage scenarios in regression and classification tasks and conduct quantitative experiments, expert interviews, and a qualitative user study to demonstrate the effectiveness of our approach. In short, this study offers a valuable contribution to addressing the challenges associated with ML modeling in the presence of missing data by providing a practical solution that achieves high predictive accuracy and model interpretability.

civis, full data, negative sample, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/TVCG.2023.3285210

2309.09744

Country:

North America > United States (0.14)
Asia > China > Shanghai > Shanghai (0.04)
Asia > China > Hubei Province > Wuhan (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)

Genre:

Research Report (1.00)
Questionnaire & Opinion Survey (1.00)
Personal > Interview (0.34)

Industry:

Banking & Finance (0.68)
Health & Medicine (0.46)

Technology:

Information Technology > Data Science > Data Quality (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Add feedback

Safe and Accelerated Deep Reinforcement Learning-based O-RAN Slicing: A Hybrid Transfer Learning Approach

Nagib, Ahmad M., Abou-Zeid, Hatem, Hassanein, Hossam S.

arXiv.org Artificial IntelligenceSep-18-2023

The open radio access network (O-RAN) architecture supports intelligent network control algorithms as one of its core capabilities. Data-driven applications incorporate such algorithms to optimize radio access network (RAN) functions via RAN intelligent controllers (RICs). Deep reinforcement learning (DRL) algorithms are among the main approaches adopted in the O-RAN literature to solve dynamic radio resource management problems. However, despite the benefits introduced by the O-RAN RICs, the practical adoption of DRL algorithms in real network deployments falls behind. This is primarily due to the slow convergence and unstable performance exhibited by DRL agents upon deployment and when encountering previously unseen network conditions. In this paper, we address these challenges by proposing transfer learning (TL) as a core component of the training and deployment workflows for the DRL-based closed-loop control of O-RAN functionalities. To this end, we propose and design a hybrid TL-aided approach that leverages the advantages of both policy reuse and distillation TL methods to provide safe and accelerated convergence in DRL-based O-RAN slicing. We conduct a thorough experiment that accommodates multiple services, including real VR gaming traffic to reflect practical scenarios of O-RAN slicing. We also propose and implement policy reuse and distillation-aided DRL and non-TL-aided DRL as three separate baselines. The proposed hybrid approach shows at least: 7.7% and 20.7% improvements in the average initial reward value and the percentage of converged scenarios, and a 64.6% decrease in reward variance while maintaining fast convergence and enhancing the generalizability compared with the baselines.

agent, drl agent, expert policy, (15 more...)

arXiv.org Artificial Intelligence

2309.07265

Country:

North America > Canada > Alberta > Census Division No. 6 > Calgary Metropolitan Region > Calgary (0.14)
Africa > Middle East > Egypt > Cairo Governorate > Cairo (0.04)
North America > Canada > Ontario > Kingston (0.04)
(2 more...)

Genre:

Research Report > New Finding (0.46)
Personal > Honors (0.46)

Industry:

Leisure & Entertainment > Games > Computer Games (0.48)
Telecommunications > Networks (0.46)
Education > Educational Setting (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

BHASA: A Holistic Southeast Asian Linguistic and Cultural Evaluation Suite for Large Language Models

Leong, Wei Qi, Ngui, Jian Gang, Susanto, Yosephine, Rengarajan, Hamsawardhini, Sarveswaran, Kengatharaiyer, Tjhi, William Chandra

arXiv.org Artificial IntelligenceSep-18-2023

The rapid development of Large Language Models (LLMs) and the emergence of novel abilities with scale have necessitated the construction of holistic, diverse and challenging benchmarks such as HELM and BIG-bench. However, at the moment, most of these benchmarks focus only on performance in English and evaluations that include Southeast Asian (SEA) languages are few in number. We therefore propose BHASA, a holistic linguistic and cultural evaluation suite for LLMs in SEA languages. It comprises three components: (1) a NLP benchmark covering eight tasks across Natural Language Understanding (NLU), Generation (NLG) and Reasoning (NLR) tasks, (2) LINDSEA, a linguistic diagnostic toolkit that spans the gamut of linguistic phenomena including syntax, semantics and pragmatics, and (3) a cultural diagnostics dataset that probes for both cultural representation and sensitivity. For this preliminary effort, we implement the NLP benchmark only for Indonesian, Vietnamese, Thai and Tamil, and we only include Indonesian and Tamil for LINDSEA and the cultural diagnostics dataset. As GPT-4 is purportedly one of the best-performing multilingual LLMs at the moment, we use it as a yardstick to gauge the capabilities of LLMs in the context of SEA languages. Our initial experiments on GPT-4 with BHASA find it lacking in various aspects of linguistic capabilities, cultural representation and sensitivity in the targeted SEA languages. BHASA is a work in progress and will continue to be improved and expanded in the future.

abstractive summarization, computational linguistics, nlp benchmark component, (14 more...)

arXiv.org Artificial Intelligence

2309.06085

Country:

North America > United States > Montana (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.13)
Asia > Singapore (0.04)
(50 more...)

Genre:

Overview (0.92)
Personal (0.92)
Research Report > New Finding (0.67)

Industry:

Health & Medicine (1.00)
Government (1.00)
Education (0.92)
Leisure & Entertainment (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Obituary That Called Late NBA Player 'Useless' Sparks Firestorm

Huffington Post - Tech news and opinionSep-15-2023, 14:27:47 GMT

Social media users hurled criticism at Microsoft this week for what many thought was an AI-generated obituary for NBA player Brandon Hunter on its website MSN. The controversy began after the obituary -- which had a headline that read "Brandon Hunter useless at 42" written by "Editor" -- appeared on the Microsoft-owned platform after Hunter's death on Tuesday. The obituary goes on to refer to the former Boston Celtics and Orlando Magic player having been "handed away on the age of 42" and claimed he "performed in 67 video games over two seasons and achieved a career-high of 17 factors in a recreation in opposition to the Milwaukee Bucks in 2004." The post appeared to follow a similar format to a story on TMZ Sports, Futurism noted, "albeit with altered punctuation and a use of synonyms so liberal that the result is essentially incomprehensible." You can compare both the obituary containing the error and the TMZ Sports story here.

brandon hunter, microsoft, spark firestorm, (9 more...)

Huffington Post - Tech news and opinion

Country:

North America > United States > Wisconsin > Milwaukee County > Milwaukee (0.26)
North America > United States > California > Los Angeles County > Los Angeles (0.06)

Genre: Personal > Obituary (1.00)

Industry: Leisure & Entertainment > Sports > Basketball (1.00)

Technology:

Information Technology > Artificial Intelligence (0.99)
Information Technology > Communications > Social Media (0.59)

Add feedback

Using Large Language Models for Knowledge Engineering (LLMKE): A Case Study on Wikidata

Zhang, Bohui, Reklos, Ioannis, Jain, Nitisha, Peñuela, Albert Meroño, Simperl, Elena

arXiv.org Artificial IntelligenceSep-15-2023

In this work, we explore the use of Large Language Models (LLMs) for knowledge engineering tasks in the context of the ISWC 2023 LM-KBC Challenge. For this task, given subject and relation pairs sourced from Wikidata, we utilize pre-trained LLMs to produce the relevant objects in string format and link them to their respective Wikidata QIDs. We developed a pipeline using LLMs for Knowledge Engineering (LLMKE), combining knowledge probing and Wikidata entity mapping. The method achieved a macro-averaged F1-score of 0.701 across the properties, with the scores varying from 1.00 to 0.328. These results demonstrate that the knowledge of LLMs varies significantly depending on the domain and that further experimentation is required to determine the circumstances under which LLMs can be used for automatic Knowledge Base (e.g., Wikidata) completion and correction. The investigation of the results also suggests the promising contribution of LLMs in collaborative knowledge engineering. LLMKE won Track 2 of the challenge. The implementation is available at https://github.com/bohuizhang/LLMKE.

knowledge, python list, relation, (13 more...)

arXiv.org Artificial Intelligence

2309.08491

Country:

North America > Puerto Rico > Peñuelas > Peñuelas (0.04)
South America > Brazil (0.04)
North America > Canada > Ontario > Toronto (0.04)
(7 more...)

Genre:

Personal > Honors (1.00)
Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.72)

Add feedback