AITopics | Personal

Collaborating Authors

Personal

Engadget Podcast: MoviePass founder Stacy Spikes on the MovieCrash documentary

EngadgetMay-31-2024, 11:30:40 GMT

In this episode, Cherlynn and Devindra discuss Copilot+ and the potential rise of Arm-based Windows systems, and we dive into the new Surface Pro and Surface Laptop.

artificial intelligence, devindra, machine learning, (16 more...)

Engadget

Country:

North America > United States > New York (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)
Europe > United Kingdom (0.04)
(2 more...)

Genre: Personal > Interview (1.00)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)
Information Technology (1.00)
Banking & Finance (1.00)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Vocabulary Attack to Hijack Large Language Model Applications

Levi, Patrick, Neumann, Christoph P.

arXiv.org Artificial IntelligenceMay-30-2024

The fast advancements in Large Language Models (LLMs) are driving an increasing number of applications. Together with the growing number of users, we also see an increasing number of attackers who try to outsmart these systems. They want the model to reveal confidential information, specific false information, or offensive behavior. To this end, they manipulate their instructions for the LLM by inserting separators or rephrasing them systematically until they reach their goal. Our approach is different. It inserts words from the model vocabulary. We find these words using an optimization procedure and embeddings from another LLM (attacker LLM). We prove our approach by goal hijacking two popular open-source LLMs from the Llama2 and the Flan-T5 families, respectively. We present two main findings. First, our approach creates inconspicuous instructions and therefore it is hard to detect. For many attack cases, we find that even a single word insertion is sufficient. Second, we demonstrate that we can conduct our attack using a different model than the target model to conduct our attack with.

application, llama2, system prompt, (15 more...)

arXiv.org Artificial Intelligence

2404.02637

Country:

Europe > Romania > Vest Development Region > Timiș County > Timișoara (0.05)
Asia > Middle East > Republic of Türkiye > Batman Province > Batman (0.04)
North America > United States > California > Los Angeles County > Los Angeles (0.04)
(3 more...)

Genre:

Research Report > New Finding (0.46)
Personal > Interview (0.46)

Industry: Media (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.74)

Add feedback

AIhub monthly digest: May 2024 – causality and natural language, AfriClimate AI, and digital twins for smart cities

AIHubMay-29-2024, 08:54:34 GMT

Welcome to our monthly digest, where you can catch up with any AIhub stories you may have missed, peruse the latest news, recap recent events, and more. This month, we learn about causality and natural language, find out about the grassroots initiative AfriClimate AI, and discuss what responsible and trustworthy AI really means. In a series of interviews, we're chatting to some of the AAAI/SIGAI Doctoral Consortium participants to find out more about their research. We caught up with Salena Torres Ashton and found out about her work focusing on causality and natural language. Salena was a professional genealogist and historian for 25 years before deciding to return to University and study for a PhD.

africlimate ai, artificial intelligence, causality and natural language, (5 more...)

AIHub

Country:

Europe > Austria > Vienna (0.06)
Asia > Japan > Honshū > Kantō > Kanagawa Prefecture > Yokohama (0.06)
Africa (0.06)

Genre: Personal > Interview (0.57)

Industry: Law > Statutes (0.33)

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

Enhancing Security and Privacy in Federated Learning using Update Digests and Voting-Based Defense

Li, Wenjie, Fan, Kai, Zhang, Jingyuan, Li, Hui, Lim, Wei Yang Bryan, Yang, Qiang

arXiv.org Artificial IntelligenceMay-29-2024

Federated Learning (FL) is a promising privacy-preserving machine learning paradigm that allows data owners to collaboratively train models while keeping their data localized. Despite its potential, FL faces challenges related to the trustworthiness of both clients and servers, especially in the presence of curious or malicious adversaries. In this paper, we introduce a novel framework named \underline{\textbf{F}}ederated \underline{\textbf{L}}earning with \underline{\textbf{U}}pdate \underline{\textbf{D}}igest (FLUD), which addresses the critical issues of privacy preservation and resistance to Byzantine attacks within distributed learning environments. FLUD utilizes an innovative approach, the $\mathsf{LinfSample}$ method, allowing clients to compute the $l_{\infty}$ norm across sliding windows of updates as an update digest. This digest enables the server to calculate a shared distance matrix, significantly reducing the overhead associated with Secure Multi-Party Computation (SMPC) by three orders of magnitude while effectively distinguishing between benign and malicious updates. Additionally, FLUD integrates a privacy-preserving, voting-based defense mechanism that employs optimized SMPC protocols to minimize communication rounds. Our comprehensive experiments demonstrate FLUD's effectiveness in countering Byzantine adversaries while incurring low communication and runtime overhead. FLUD offers a scalable framework for secure and reliable FL in distributed environments, facilitating its application in scenarios requiring robust data management and security.

federated learning, overhead, protocol, (13 more...)

arXiv.org Artificial Intelligence

2405.18802

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Asia > China > Hong Kong (0.04)
Asia > Singapore (0.04)
(8 more...)

Genre:

Research Report (0.83)
Personal (0.67)

Industry:

Information Technology > Security & Privacy (1.00)
Education (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models

Ahn, Jaewoo, Lee, Taehyun, Lim, Junyoung, Kim, Jin-Hwa, Yun, Sangdoo, Lee, Hwaran, Kim, Gunhee

arXiv.org Artificial IntelligenceMay-28-2024

While Large Language Models (LLMs) can serve as agents to simulate human behaviors (i.e., role-playing agents), we emphasize the importance of point-in-time role-playing. This situates characters at specific moments in the narrative progression for three main reasons: (i) enhancing users' narrative immersion, (ii) avoiding spoilers, and (iii) fostering engagement in fandom role-playing. To accurately represent characters at specific time points, agents must avoid character hallucination, where they display knowledge that contradicts their characters' identities and historical timelines. We introduce TimeChara, a new benchmark designed to evaluate point-in-time character hallucination in role-playing LLMs. Comprising 10,895 instances generated through an automated pipeline, this benchmark reveals significant hallucination issues in current state-of-the-art LLMs (e.g., GPT-4o). To counter this challenge, we propose Narrative-Experts, a method that decomposes the reasoning steps and utilizes narrative experts to reduce point-in-time character hallucinations effectively. Still, our findings with TimeChara highlight the ongoing challenges of point-in-time character hallucination, calling for further study.

event summary, hallucination, time point, (16 more...)

arXiv.org Artificial Intelligence

2405.18027

Country:

Asia > South Korea > Seoul > Seoul (0.04)
Europe > Italy (0.04)

Genre:

Personal > Interview (0.67)
Research Report > New Finding (0.66)

Industry:

Leisure & Entertainment (1.00)
Media (0.95)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.91)

Add feedback

Facilitating Multi-Role and Multi-Behavior Collaboration of Large Language Models for Online Job Seeking and Recruiting

Sun, Hongda, Lin, Hongzhan, Yan, Haiyu, Zhu, Chen, Song, Yang, Gao, Xin, Shang, Shuo, Yan, Rui

arXiv.org Artificial IntelligenceMay-28-2024

The emergence of online recruitment services has revolutionized the traditional landscape of job seeking and recruitment, necessitating the development of high-quality industrial applications to improve person-job fitting. Existing methods generally rely on modeling the latent semantics of resumes and job descriptions and learning a matching function between them. Inspired by the powerful role-playing capabilities of Large Language Models (LLMs), we propose to introduce a mock interview process between LLM-played interviewers and candidates. The mock interview conversations can provide additional evidence for candidate evaluation, thereby augmenting traditional person-job fitting based solely on resumes and job descriptions. However, characterizing these two roles in online recruitment still presents several challenges, such as developing the skills to raise interview questions, formulating appropriate answers, and evaluating two-sided fitness. To this end, we propose MockLLM, a novel applicable framework that divides the person-job matching process into two modules: mock interview generation and two-sided evaluation in handshake protocol, jointly enhancing their performance through collaborative behaviors between interviewers and candidates. We design a role-playing framework as a multi-role and multi-behavior paradigm to enable a single LLM agent to effectively behave with multiple functions for both parties. Moreover, we propose reflection memory generation and dynamic prompt modification techniques to refine the behaviors of both sides, enabling continuous optimization of the augmented additional evidence. Extensive experimental results show that MockLLM can achieve the best performance on person-job matching accompanied by high mock interview quality, envisioning its emerging application in real online recruitment in the future.

evaluation, interviewer, interviewer and candidate, (11 more...)

arXiv.org Artificial Intelligence

2405.18113

Country:

Asia > China (0.04)
North America > United States > Hawaii (0.04)

Genre:

Research Report (0.84)
Personal > Interview (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Augmented Physics: A Machine Learning-Powered Tool for Creating Interactive Physics Simulations from Static Diagrams

Gunturu, Aditya, Wen, Yi, Thundathil, Jarin, Zhang, Nandi, Kazi, Rubaiat Habib, Suzuki, Ryo

arXiv.org Artificial IntelligenceMay-28-2024

We introduce Augmented Physics, a machine learning-powered tool designed for creating interactive physics simulations from static textbook diagrams. Leveraging computer vision techniques, such as Segment Anything and OpenCV, our web-based system enables users to semi-automatically extract diagrams from physics textbooks and then generate interactive simulations based on the extracted content. These interactive diagrams are seamlessly integrated into scanned textbook pages, facilitating interactive and personalized learning experiences across various physics concepts, including gravity, optics, circuits, and kinematics. Drawing on an elicitation study with seven physics instructors, we explore four key augmentation techniques: 1) augmented experiments, 2) animated diagrams, 3) bi-directional manipulatives, and 4) parameter visualization. We evaluate our system through technical evaluation, a usability study (N=12), and expert interviews (N=12). The study findings suggest that our system can facilitate more engaging and personalized learning experiences in physics education.

diagram, proceedings, simulation, (13 more...)

arXiv.org Artificial Intelligence

2405.18614

Country:

North America > Canada > Alberta > Census Division No. 6 > Calgary Metropolitan Region > Calgary (0.15)
North America > United States > New York > New York County > New York City (0.04)
Asia > China > Hong Kong (0.04)
(6 more...)

Genre:

Questionnaire & Opinion Survey (1.00)
Instructional Material (1.00)
Personal > Interview (0.48)
Research Report > New Finding (0.34)

Industry:

Education > Educational Setting (1.00)
Education > Curriculum > Subject-Specific Education (0.90)
Education > Educational Technology > Educational Software > Computer Based Training (0.86)

Technology:

Information Technology > Human Computer Interaction > Interfaces (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

CtrlA: Adaptive Retrieval-Augmented Generation via Probe-Guided Control

Liu, Huanshuo, Zhang, Hao, Guo, Zhijiang, Dong, Kuicai, Li, Xiangyang, Lee, Yi Quan, Zhang, Cong, Liu, Yong

arXiv.org Artificial IntelligenceMay-28-2024

Retrieval-augmented generation (RAG) has emerged as a promising solution for mitigating hallucinations of large language models (LLMs) with retrieved external knowledge. Adaptive RAG enhances this approach by dynamically assessing the retrieval necessity, aiming to balance external and internal knowledge usage. However, existing adaptive RAG methods primarily realize retrieval on demand by relying on superficially verbalize-based or probability-based feedback of LLMs, or directly fine-tuning LLMs via carefully crafted datasets, resulting in unreliable retrieval necessity decisions, heavy extra costs, and sub-optimal response generation. We present the first attempts to delve into the internal states of LLMs to mitigate such issues by introducing an effective probe-guided adaptive RAG framework, termed CtrlA. Specifically, CtrlA employs an honesty probe to regulate the LLM's behavior by manipulating its representations for increased honesty, and a confidence probe to monitor the internal states of LLM and assess confidence levels, determining the retrieval necessity during generation. Experiments show that CtrlA is superior to existing adaptive RAG methods on a diverse set of tasks, the honesty control can effectively make LLMs more honest and confidence monitoring is proven to be a promising indicator of retrieval trigger. Our codes are available at https://github.com/HSLiu-Initial/CtrlA.git.

language model, llm, retrieval, (15 more...)

arXiv.org Artificial Intelligence

2405.18727

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
North America > Canada > Ontario > Toronto (0.04)
North America > United States > Michigan (0.04)
(5 more...)

Genre:

Research Report (1.00)
Personal > Obituary (0.46)

Industry:

Media > Film (0.46)
Leisure & Entertainment > Sports (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Navigating AI Fallibility: Examining People's Reactions and Perceptions of AI after Encountering Personality Misrepresentations

Wang, Qiaosi, Anyi, Chidimma L., Swain, Vedant Das, Goel, Ashok K.

arXiv.org Artificial IntelligenceMay-25-2024

Many hyper-personalized AI systems profile people's characteristics (e.g., personality traits) to provide personalized recommendations. These systems are increasingly used to facilitate interactions among people, such as providing teammate recommendations. Despite improved accuracy, such systems are not immune to errors when making inferences about people's most personal traits. These errors manifested as AI misrepresentations. However, the repercussions of such AI misrepresentations are unclear, especially on people's reactions and perceptions of the AI. We present two studies to examine how people react and perceive the AI after encountering personality misrepresentations in AI-facilitated team matching in a higher education context. Through semi-structured interviews (n=20) and a survey experiment (n=198), we pinpoint how people's existing and newly acquired AI knowledge could shape their perceptions and reactions of the AI after encountering AI misrepresentations. Specifically, we identified three rationales that people adopted through knowledge acquired from AI (mis)representations: AI works like a machine, human, and/or magic. These rationales are highly connected to people's reactions of over-trusting, rationalizing, and forgiving of AI misrepresentations. Finally, we found that people's existing AI knowledge, i.e., AI literacy, could moderate people's changes in their trust in AI after encountering AI misrepresentations, but not changes in people's social perceptions of AI. We discuss the role of people's AI knowledge when facing AI fallibility and implications for designing responsible mitigation and repair strategies.

inference, participant, sami, (13 more...)

arXiv.org Artificial Intelligence

2405.16355

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Virginia (0.04)
North America > United States > New York > New York County > New York City (0.04)
(4 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Questionnaire & Opinion Survey (1.00)
(2 more...)

Industry: Education > Educational Setting > Higher Education (0.87)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
(3 more...)

Add feedback

Are Long-LLMs A Necessity For Long-Context Tasks?

Qian, Hongjin, Liu, Zheng, Zhang, Peitian, Mao, Kelong, Zhou, Yujia, Chen, Xu, Dou, Zhicheng

arXiv.org Artificial IntelligenceMay-24-2024

The learning and deployment of long-LLMs remains a challenging problem despite recent progresses. In this work, we argue that the long-LLMs are not a necessity to solve long-context tasks, as common long-context tasks are short-context solvable, i.e. they can be solved by purely working with oracle short-contexts within the long-context tasks' inputs. On top of this argument, we propose a framework called LC-Boost (Long-Context Bootstrapper), which enables a short-LLM to address the long-context tasks in a bootstrapping manner. In our framework, the short-LLM prompts itself to reason for two critical decisions: 1) how to access to the appropriate part of context within the input, 2) how to make effective use of the accessed context. By adaptively accessing and utilizing the context based on the presented tasks, LC-Boost can serve as a general framework to handle diversified long-context processing problems. We comprehensively evaluate different types of tasks from popular long-context benchmarks, where LC-Boost is able to achieve a substantially improved performance with a much smaller consumption of resource.

lc-boost, long context, short context, (16 more...)

arXiv.org Artificial Intelligence

2405.15318

Country:

Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
North America > Dominican Republic (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
(2 more...)

Genre:

Research Report (0.64)
Personal (0.46)

Industry: Leisure & Entertainment (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback