AITopics

2508.08278

Country:

Asia > China (0.46)
Oceania > Australia (0.28)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology > Security & Privacy (0.54)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.88)

The GuardianAug-12-2025, 13:57:22 GMT

In the time of tariffs, Nvidia and AMD cut unusual deals with Trump

My Spotify playlists are undergoing a British invasion this week. Donald Trump announced this week that two US chipmakers would tithe 15% of their revenue from sales in China to the US government. Paying for the license to sell to Chinese customers represents an unprecedented deal. The chipmakers Nvidia and AMD have agreed to give the US government 15% of their revenue from advanced chips sold to China in return for export licences to the key market. The arrangement will lead to Nvidia giving 15% of its revenue from Chinese sales of its H20 chips, and AMD giving 15% of revenue from Chinese sales of its MI308 chips, according to reports citing US officials.

nvidia and amd, openai, trump, (14 more...)

The Guardian

Country:

Asia > China (0.47)
Oceania > Australia > Northern Territory (0.05)
North America > United States > Michigan (0.05)
(4 more...)

Industry:

Information Technology (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.47)

WIREDAug-12-2025, 10:30:00 GMT

Apple's AI Ambitions Leave Big Questions Over Its Climate Goals

Apple's AI Ambitions Leave Big Questions Over Its Climate Goals Halfway to its 2030 net-zero goal, Apple faces slow and hold-out suppliers, a tariffs scramble, and an AI race that could profoundly impact eco-friendly ambitions. Here's a simple question: Is the current top iPhone better for the environment than the top iPhone was five years ago? Let's take the iPhone Pro series. If we're looking at recycled and renewable materials, it's an easy yes. Compare the iPhone 11 Pro, released in September 2019, with the iPhone 16 Pro, released in September 2024, and there has been good progress--from a few smaller components and packaging to now at more than 25 percent of the whole phone.

apple, emission, supplier, (13 more...)

WIRED

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > United Kingdom (0.14)
Asia > Taiwan (0.05)
(10 more...)

Industry:

Law > Environmental Law (1.00)
Information Technology > Services (1.00)
Energy > Renewable (1.00)

Technology:

Information Technology > Communications > Mobile (1.00)
Information Technology > Cloud Computing (0.95)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.48)

BBC NewsAug-12-2025, 08:12:06 GMT

Charges dropped against teen pilot detained in Antarctica

Charges against an American influencer and teen pilot who has been stranded on a remote island in the Antarctic since June have been dropped. Ethan Guo, 19, is alleged to have illegally landed his plane in Chilean territory after embarking on a solo trip to all seven continents to raise money for cancer research, according to local authorities. They accused him of providing false flight plan information to officials who detained him and opened an investigation. A judge has ordered him to leave the area, pay a $30,000 (£22,332) donation to a children's cancer foundation and is banned from re-entering Chilean territory for three years. Mr Guo made headlines last year when he began an attempt to become the youngest person to fly solo to all seven continents and collect donations for research into childhood cancer.

antarctica, island, teen pilot, (15 more...)

BBC News

Country:

Antarctica (0.46)
North America > Central America (0.16)
South America > Chile > Magallanes Region > Magallanes Province > Punta Arenas (0.06)
(13 more...)

Industry: Health & Medicine > Therapeutic Area > Oncology > Childhood Cancer (0.98)

Technology: Information Technology > Artificial Intelligence (0.36)

WSM: Decay-Free Learning Rate Schedule via Checkpoint Merging for LLM Pre-training

Tian, Changxin, Wang, Jiapeng, Zhao, Qian, Chen, Kunlong, Liu, Jia, Liu, Ziqi, Mao, Jiaxin, Zhao, Wayne Xin, Zhang, Zhiqiang, Zhou, Jun

Recent advances in learning rate (LR) scheduling have demonstrated the effectiveness of decay-free approaches that eliminate the traditional decay phase while maintaining competitive performance. Model merging techniques have emerged as particularly promising solutions in this domain. We present Warmup-Stable and Merge (WSM), a general framework that establishes a formal connection between learning rate decay and model merging. WSM provides a unified theoretical foundation for emulating various decay strategies-including cosine decay, linear decay and inverse square root decay-as principled model averaging schemes, while remaining fully compatible with diverse optimization methods. Through extensive experiments, we identify merge duration-the training window for checkpoint aggregation-as the most critical factor influencing model performance, surpassing the importance of both checkpoint interval and merge quantity. Our framework consistently outperforms the widely-adopted Warmup-Stable-Decay (WSD) approach across multiple benchmarks, achieving significant improvements of +3.5% on MATH, +2.9% on HumanEval, and +5.5% on MMLU-Pro. The performance advantages extend to supervised fine-tuning scenarios, highlighting WSM's potential for long-term model refinement.

large language model, machine learning, natural language, (20 more...)

2507.17634

Country:

Europe > Austria > Vienna (0.14)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(16 more...)

Genre: Research Report > New Finding (1.00)

Industry: Education (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Yang, Jinghan, Weng, Jiayu

Detecting Mislabeled and Corrupted Data via Pointwise Mutual Information

arXiv.org Machine LearningAug-12-2025

Deep neural networks can memorize corrupted labels, making data quality critical for model performance, yet real-world datasets are frequently compromised by both label noise and input noise. This paper proposes a mutual information-based framework for data selection under hybrid noise scenarios that quantifies statistical dependencies between inputs and labels. We compute each sample's pointwise contribution to the overall mutual information and find that lower contributions indicate noisy or mislabeled instances. Empirical validation on MNIST with different synthetic noise settings demonstrates that the method effectively filters low-quality samples. Under label corruption, training on high-MI samples improves classification accuracy by up to 15\% compared to random sampling. Furthermore, the method exhibits robustness to benign input modifications, preserving semantically valid data while filtering truly corrupted samples.

artificial intelligence, machine learning, noise, (17 more...)

arXiv.org Machine Learning

2508.07713

Country:

Oceania > New Zealand (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report > New Finding (0.68)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

arXiv.org Machine LearningAug-12-2025

FairDRL-ST: Disentangled Representation Learning for Fair Spatio-Temporal Mobility Prediction

Zhao, Sichen, Shao, Wei, Chan, Jeffrey, Xu, Ziqi, Salim, Flora

As deep spatio-temporal neural networks are increasingly utilised in urban computing contexts, the deployment of such methods can have a direct impact on users of critical urban infrastructure, such as public transport, emergency services, and traffic management systems. While many spatio-temporal methods focus on improving accuracy, fairness has recently gained attention due to growing evidence that biased predictions in spatio-temporal applications can disproportionately disadvantage certain demographic or geographic groups, thereby reinforcing existing socioeconomic inequalities and undermining the ethical deployment of AI in public services. In this paper, we propose a novel framework, FairDRL-ST, based on disentangled representation learning, to address fairness concerns in spatio-temporal prediction, with a particular focus on mobility demand forecasting. By leveraging adversarial learning and disentangled representation learning, our framework learns to separate attributes that contain sensitive information. Unlike existing methods that enforce fairness through supervised learning, which may lead to overcompensation and degraded performance, our framework achieves fairness in an unsupervised manner with minimal performance loss. We apply our framework to real-world urban mobility datasets and demonstrate its ability to close fairness gaps while delivering competitive predictive performance compared to state-of-the-art fairness-aware methods.

artificial intelligence, data mining, machine learning, (15 more...)

arXiv.org Machine Learning

2508.07518

Country:

Oceania > Australia > Victoria > Melbourne (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
Asia (0.04)
(4 more...)

Genre: Research Report > New Finding (0.67)

Industry:

Information Technology > Security & Privacy (0.35)
Transportation > Infrastructure & Services (0.34)
Health & Medicine > Health Care Providers & Services (0.34)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Openja, Moses, Arcaini, Paolo, Khomh, Foutse, Ishikawa, Fuyuki

FairFLRep: Fairness aware fault localization and repair of Deep Neural Networks

Deep neural networks (DNNs) are being utilized in various aspects of our daily lives, including high-stakes decision-making applications that impact individuals. However, these systems reflect and amplify bias from the data used during training and testing, potentially resulting in biased behavior and inaccurate decisions. For instance, having different misclassification rates between white and black sub-populations. However, effectively and efficiently identifying and correcting biased behavior in DNNs is a challenge. This paper introduces FairFLRep, an automated fairness-aware fault localization and repair technique that identifies and corrects potentially bias-inducing neurons in DNN classifiers. FairFLRep focuses on adjusting neuron weights associated with sensitive attributes, such as race or gender, that contribute to unfair decisions. By analyzing the input-output relationships within the network, FairFLRep corrects neurons responsible for disparities in predictive quality parity. We evaluate FairFLRep on four image classification datasets using two DNN classifiers, and four tabular datasets with a DNN model. The results show that FairFLRep consistently outperforms existing methods in improving fairness while preserving accuracy. An ablation study confirms the importance of considering fairness during both fault localization and repair stages. Our findings also show that FairFLRep is more efficient than the baseline approaches in repairing the network.

artificial intelligence, fairflrep, machine learning, (18 more...)

2508.08151

Country:

Europe (1.00)
Asia (1.00)
Oceania > Australia (0.67)
North America > United States > California (0.67)

Genre: Research Report > New Finding (1.00)

Industry:

Law (1.00)
Health & Medicine (1.00)
Education (0.67)
Government > Regional Government (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

BlindGuard: Safeguarding LLM-based Multi-Agent Systems under Unknown Attacks

Miao, Rui, Liu, Yixin, Wang, Yili, Shen, Xu, Tan, Yue, Dai, Yiwei, Pan, Shirui, Wang, Xin

The security of LLM-based multi-agent systems (MAS) is critically threatened by propagation vulnerability, where malicious agents can distort collective decision-making through inter-agent message interactions. While existing supervised defense methods demonstrate promising performance, they may be impractical in real-world scenarios due to their heavy reliance on labeled malicious agents to train a supervised malicious detection model. To enable practical and generalizable MAS defenses, in this paper, we propose BlindGuard, an unsupervised defense method that learns without requiring any attack-specific labels or prior knowledge of malicious behaviors. To this end, we establish a hierarchical agent encoder to capture individual, neighborhood, and global interaction patterns of each agent, providing a comprehensive understanding for malicious agent detection. Meanwhile, we design a corruption-guided detector that consists of directional noise injection and contrastive learning, allowing effective detection model training solely on normal agent behaviors. Extensive experiments show that BlindGuard effectively detects diverse attack types (i.e., prompt injection, memory poisoning, and tool attack) across MAS with various communication patterns while maintaining superior generalizability compared to supervised baselines. The code is available at: https://github.com/MR9812/BlindGuard.

artificial intelligence, deep learning, machine learning, (18 more...)

2508.08127

Country: Oceania > Australia (0.28)

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Robust Anomaly Detection in O-RAN: Leveraging LLMs against Data Manipulation Attacks

Dayaratne, Thusitha, Pham, Ngoc Duy, Vo, Viet, Lai, Shangqi, Abuadbba, Sharif, Suzuki, Hajime, Yuan, Xingliang, Rudolph, Carsten

The introduction of 5G and the Open Radio Access Network (O-RAN) architecture has enabled more flexible and intelligent network deployments. However, the increased complexity and openness of these architectures also introduce novel security challenges, such as data manipulation attacks on the semi-standardised Shared Data Layer (SDL) within the O-RAN platform through malicious xApps. In particular, malicious xApps can exploit this vulnerability by introducing subtle Unicode-wise alterations (hypoglyphs) into the data that are being used by traditional machine learning (ML)-based anomaly detection methods. These Unicode-wise manipulations can potentially bypass detection and cause failures in anomaly detection systems based on traditional ML, such as AutoEncoders, which are unable to process hypoglyphed data without crashing. We investigate the use of Large Language Models (LLMs) for anomaly detection within the O-RAN architecture to address this challenge. We demonstrate that LLM-based xApps maintain robust operational performance and are capable of processing manipulated messages without crashing. While initial detection accuracy requires further improvements, our results highlight the robustness of LLMs to adversarial attacks such as hypoglyphs in input data. There is potential to use their adaptability through prompt engineering to further improve the accuracy, although this requires further research. Additionally, we show that LLMs achieve low detection latency (under 0.07 seconds), making them suitable for Near-Real-Time (Near-RT) RIC deployments.

data mining, large language model, machine learning, (18 more...)

2508.08029

Country: Oceania > Australia (0.31)

Genre: Research Report > New Finding (0.89)

Industry:

Information Technology > Security & Privacy (1.00)
Government (1.00)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.47)