AITopics

With the growing demand for seamless connectivity and intelligent communication, the integration of artificial intelligence (AI) and sixth-generation (6G) communication networks has emerged as a transformative paradigm. By embedding AI capabilities across various network layers, this integration enables optimized resource allocation, improved efficiency, and enhanced system robust performance, particularly in intricate and dynamic environments. This paper presents a comprehensive overview of AI and communication for 6G networks, with a focus on emphasizing their foundational principles, inherent challenges, and future research opportunities. We first review the integration of AI and communications in the context of 6G, exploring the driving factors behind incorporating AI into wireless communications, as well as the vision for the convergence of AI and 6G. The discourse then transitions to a detailed exposition of the envisioned integration of AI within 6G networks, delineated across three progressive developmental stages. The first stage, AI for Network, focuses on employing AI to augment network performance, optimize efficiency, and enhance user service experiences. The second stage, Network for AI, highlights the role of the network in facilitating and buttressing AI operations and presents key enabling technologies, such as digital twins for AI and semantic communication. In the final stage, AI as a Service, it is anticipated that future 6G networks will innately provide AI functions as services, supporting application scenarios like immersive communication and intelligent industrial robots. In addition, we conduct an in-depth analysis of the critical challenges faced by the integration of AI and communications in 6G. Finally, we outline promising future research opportunities that are expected to drive the development and refinement of AI and 6G communications.

large language model, machine learning, real time system, (28 more...)

2412.14538

Country:

Europe (1.00)
Asia > China (1.00)
North America > United States > California (0.45)
North America > United States > New York (0.28)

Genre:

Overview > Innovation (0.92)
Research Report > Promising Solution (0.92)

Industry:

Transportation > Ground > Road (1.00)
Telecommunications > Networks (1.00)
Information Technology > Security & Privacy (1.00)
(11 more...)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (1.00)
(11 more...)

OpenGU: A Comprehensive Benchmark for Graph Unlearning

Fan, Bowen, Ai, Yuming, Li, Xunkai, Guo, Zhilin, Li, Rong-Hua, Wang, Guoren

Graph Machine Learning is essential for understanding and analyzing relational data. However, privacy-sensitive applications demand the ability to efficiently remove sensitive information from trained graph neural networks (GNNs), avoiding the unnecessary time and space overhead caused by retraining models from scratch. To address this issue, Graph Unlearning (GU) has emerged as a critical solution, with the potential to support dynamic graph updates in data management systems and enable scalable unlearning in distributed data systems while ensuring privacy compliance. Unlike machine unlearning in computer vision or other fields, GU faces unique difficulties due to the non-Euclidean nature of graph data and the recursive message-passing mechanism of GNNs. Additionally, the diversity of downstream tasks and the complexity of unlearning requests further amplify these challenges. Despite the proliferation of diverse GU strategies, the absence of a benchmark providing fair comparisons for GU, and the limited flexibility in combining downstream tasks and unlearning requests, have yielded inconsistencies in evaluations, hindering the development of this domain. To fill this gap, we present OpenGU, the first GU benchmark, where 16 SOTA GU algorithms and 37 multi-domain datasets are integrated, enabling various downstream tasks with 13 GNN backbones when responding to flexible unlearning requests. Based on this unified benchmark framework, we are able to provide a comprehensive and fair evaluation for GU. Through extensive experimentation, we have drawn $8$ crucial conclusions about existing GU methods, while also gaining valuable insights into their limitations, shedding light on potential avenues for future research.

data mining, machine learning, natural language, (23 more...)

2501.02728

Country:

North America > United States (0.46)
Asia > China (0.28)

Genre:

Overview (1.00)
Research Report > Promising Solution (0.46)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Immunology (0.93)
(3 more...)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Information Management (1.00)
Information Technology > Data Science > Data Mining (1.00)
(4 more...)

Jahanshahi, Mahmoud, Mockus, Audris

Cracks in The Stack: Hidden Vulnerabilities and Licensing Risks in LLM Pre-Training Datasets

A critical part of creating code suggestion systems is the pre-training of Large Language Models on vast amounts of source code and natural language text, often of questionable origin or quality. This may contribute to the presence of bugs and vulnerabilities in code generated by LLMs. While efforts to identify bugs at or after code generation exist, it is preferable to pre-train or fine-tune LLMs on curated, high-quality, and compliant datasets. The need for vast amounts of training data necessitates that such curation be automated, minimizing human intervention. We propose an automated source code autocuration technique that leverages the complete version history of open-source software projects to improve the quality of training data. This approach leverages the version history of all OSS projects to identify training data samples that have been modified or have undergone changes in at least one OSS project, and pinpoint a subset of samples that include fixes for bugs or vulnerabilities. We evaluate this method using The Stack v2 dataset, and find that 17% of the code versions in the dataset have newer versions, with 17% of those representing bug fixes, including 2.36% addressing known CVEs. The deduplicated version of Stack v2 still includes blobs vulnerable to 6,947 known CVEs. Furthermore, 58% of the blobs in the dataset were never modified after creation, suggesting they likely represent software with minimal or no use. Misidentified blob origins present an additional challenge, as they lead to the inclusion of non-permissively licensed code, raising serious compliance concerns. By addressing these issues, the training of new models can avoid perpetuating buggy code patterns or license violations. We expect our results to inspire process improvements for automated data curation, with the potential to enhance the reliability of outputs generated by AI tools.

large language model, machine learning, natural language, (20 more...)

2501.02628

Country: North America > United States (0.46)

Genre: Research Report > New Finding (0.48)

Industry: Law (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Bojic, Ljubisa, Seychell, Dylan, Cabarkapa, Milan

Towards New Benchmark for AI Alignment & Sentiment Analysis in Socially Important Issues: A Comparative Study of Human and LLMs in the Context of AGI

With the expansion of neural networks, such as large language models, humanity is exponentially heading towards superintelligence. As various AI systems are increasingly integrated into the fabric of societies-through recommending values, devising creative solutions, and making decisions-it becomes critical to assess how these AI systems impact humans in the long run. This research aims to contribute towards establishing a benchmark for evaluating the sentiment of various Large Language Models in socially importan issues. The methodology adopted was a Likert scale survey. Seven LLMs, including GPT-4 and Bard, were analyzed and compared against sentiment data from three independent human sample populations. Temporal variations in sentiment were also evaluated over three consecutive days. The results highlighted a diversity in sentiment scores among LLMs, ranging from 3.32 to 4.12 out of 5. GPT-4 recorded the most positive sentiment score towards AGI, whereas Bard was leaning towards the neutral sentiment. The human samples, contrastingly, showed a lower average sentiment of 2.97. The temporal comparison revealed differences in sentiment evolution between LLMs in three days, ranging from 1.03% to 8.21%. The study's analysis outlines the prospect of potential conflicts of interest and bias possibilities in LLMs' sentiment formation. Results indicate that LLMs, akin to human cognitive processes, could potentially develop unique sentiments and subtly influence societies' perceptions towards various opinions formed within the LLMs.

large language model, machine learning, sentiment, (15 more...)

2501.02531

Country:

Europe > Serbia (0.47)
Europe > Middle East > Malta (0.14)

Genre:

Questionnaire & Opinion Survey (1.00)
Research Report > Promising Solution (0.66)
Research Report > New Finding (0.46)

Industry:

Government (1.00)
Health & Medicine (0.68)
Law (0.68)
Education > Educational Setting (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

WIREDJan-4-2025, 11:30:00 GMT

Apple May Owe You 20 in a Siri Privacy Lawsuit Settlement

It may be a new year, but the hacks, scams, and dangerous people lurking online haven't gone anywhere. Just a day before the ball dropped, the United States Treasury Department said it had been hacked. Officials believe the attackers are an as-yet-unidentified Advanced Persistent Threat group linked to China's government that exploited flaws in remote tech support software made by BeyondTrust to carry out what the Treasury Department described as a "major" breach. The company told the Treasury on December 8 that the attackers stole an authentication key, which ultimately allowed them to access department computers. While the Treasury says the attackers were only able to steal "certain unclassified documents," new details have already begun to emerge, which we'll get into more below.

attacker, location data, siri privacy law suit settlement, (12 more...)

WIRED

Country:

Asia > China (0.58)
North America > United States > California (0.06)
North America > United States > Virginia (0.05)
(2 more...)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications > Social Media (0.76)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.43)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.43)

arXiv.org Artificial IntelligenceJan-4-2025

LLM Content Moderation and User Satisfaction: Evidence from Response Refusals in Chatbot Arena

Pasch, Stefan

LLM safety and ethical alignment are widely discussed, but the impact of content moderation on user satisfaction remains underexplored. To address this, we analyze nearly 50,000 Chatbot Arena response - pairs using a novel fine - tuned RoBER T a model, that we trained on hand - labeled data to disentangle refusals due to ethical concerns from other refusals due to technical disabilities or lack of information. Our findings reveal a significant refusal penalty on content moderation, with users choosing ethical - based refusals roughly one - fourth as often as their preferred LLM response compared to standard responses . However, the context and phrasing play critical roles: refusals on highly sensitive prompts, such as illegal content, achieve higher win rates than less sensitive ethical concerns, and longer responses closely aligned with the prompt perform better. These results emphasize the need for nuanced moderation strategies that balance ethical safeguards with user satisfaction. Moreover, we find that the refusal penalty is notably lower in evaluations using the LLM - as - a - Judge method, highlighting discrepancies be tween user and automated assessments. Trigger Warning and Disclaimer: This paper discusses content moderation in LLMs, including sensitive topics such as hate speech, harassment, and illegal activities, as part of an analysis of LLM performance and user satisfaction. The study does not endorse or promote any harmful, illegal, or unethical content, nor does it make any normative judgments about the " right amount " of content moderation.

llm, refusal, win rate, (13 more...)

2501.03266

Country:

Europe > France (0.04)
North America > Canada (0.04)
Europe > Switzerland (0.04)
(2 more...)

Genre: Research Report > New Finding (1.00)

Industry: Law (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Boutet, Antoine, Kazdam, Zakaria El, Magnana, Lucas, Zimmermann, Helain

Anonymization by Design of Language Modeling

arXiv.org Artificial IntelligenceJan-4-2025

However, these advances Johnson et al. proposed to use a neural network based on a BERT raise significant privacy concerns, especially when models architecture [15] to detect a number of identifying elements in medical specialized on sensitive data can memorize and then expose and documents. More recently, different hospitals have also explored regurgitate confidential information. This paper presents a privacyby-design the feasibility of using NLP models to automatically pseudonymize language modeling approach to address the problem text documents (i.e., hide specific direct identifiers named Personally of language models anonymization, and thus promote their sharing. Identifiable Information (PII)) from their clinical data warehouse Specifically, we propose both a Masking Language Modeling [35, 45]. In these approaches, the BERT model is fine-tuned (MLM) methodology to specialize a BERT-like language model, and with the medical reports from the hospital (in order to specialize and a Causal Language Modeling (CLM) methodology to specialize a well understand the reports generated by the local practitioners) GPT-like model that avoids the model from memorizing direct and before training a Named Entity Recognition on a set of Personally indirect identifying information present in the training data. We Identifiable Information that directly identify patients.

artificial intelligence, machine learning, natural language, (17 more...)

2501.02407

Country: Europe > France (0.28)

Genre: Research Report > Experimental Study (0.34)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine > Health Care Technology > Medical Record (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Park, Sanghyun, Maciejovsky, Boris, Puranam, Phanish

Thinking with Many Minds: Using Large Language Models for Multi-Perspective Problem-Solving

arXiv.org Artificial IntelligenceJan-4-2025

Complex problem-solving requires cognitive flexibility--the capacity to entertain multiple perspectives while preserving their distinctiveness. This flexibility replicates the "wisdom of crowds" within a single individual, allowing them to "think with many minds." While mental simulation enables imagined deliberation, cognitive constraints limit its effectiveness. We propose synthetic deliberation, a Large Language Model (LLM)-based method that simulates discourse between agents embodying diverse perspectives, as a solution. Using a custom GPT-based model, we showcase its benefits: concurrent processing of multiple viewpoints without cognitive degradation, parallel exploration of perspectives, and precise control over viewpoint synthesis. By externalizing the deliberative process and distributing cognitive labor between parallel search and integration, synthetic deliberation transcends mental simulation's limitations. This approach shows promise for strategic planning, policymaking, and conflict resolution.

deliberation, large language model, natural language, (17 more...)

2501.02348

Country:

North America > United States (0.28)
Europe > United Kingdom > England (0.28)

Genre:

Research Report > Promising Solution (0.46)
Research Report > New Finding (0.46)

Industry:

Health & Medicine (1.00)
Education (1.00)
Law (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

arXiv.org Machine LearningJan-4-2025

A Statistical Hypothesis Testing Framework for Data Misappropriation Detection in Large Language Models

Cai, Yinpeng, Li, Lexin, Zhang, Linjun

Large Language Models (LLMs) are rapidly gaining enormous popularity in recent years. However, the training of LLMs has raised significant privacy and legal concerns, particularly regarding the inclusion of copyrighted materials in their training data without proper attribution or licensing, which falls under the broader issue of data misappropriation. In this article, we focus on a specific problem of data misappropriation detection, namely, to determine whether a given LLM has incorporated data generated by another LLM. To address this issue, we propose embedding watermarks into the copyrighted training data and formulating the detection of data misappropriation as a hypothesis testing problem. We develop a general statistical testing framework, construct a pivotal statistic, determine the optimal rejection threshold, and explicitly control the type I and type II errors. Furthermore, we establish the asymptotic optimality properties of the proposed tests, and demonstrate its empirical effectiveness through intensive numerical experiments.

large language model, machine learning, natural language, (18 more...)

arXiv.org Machine Learning

2501.02441

Country: North America > United States (0.67)

Genre: Research Report (0.64)

Industry:

Information Technology > Security & Privacy (1.00)
Law (0.87)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

The GuardianJan-3-2025, 16:14:13 GMT

Apple to pay 95m to settle claims Siri listened to users' private conversations

Apple has agreed to pay 95m in cash to settle a proposed class-action lawsuit claiming that its voice-activated assistant Siri violated users' privacy, listening to them without their consent. A preliminary settlement was filed on Tuesday night in the Oakland, California, federal court, and requires approval by US district judge Jeffrey White. Voice assistants typically react when people use "hot words" such as "Hey, Siri". Two plaintiffs said their mentions of Air Jordan sneakers and Olive Garden restaurants triggered ads for those products. Another said he was served ads for a brand name surgical treatment after discussing it, he thought privately, with his doctor.

apple, private conversation, siri, (8 more...)

The Guardian

Country:

North America > United States > California > Alameda County > Oakland (0.26)
North America > United States > California > Santa Clara County > San Jose (0.06)
North America > United States > California > Santa Clara County > Cupertino (0.06)

Industry: Law > Litigation (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)