AITopics | guangzhou

Collaborating Authors

guangzhou

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Efficient Federated Conformal Prediction with Group-Conditional Guarantees

Wen, Haifeng, Simeone, Osvaldo, Xing, Hong

arXiv.org Machine LearningMar-18-2026

Deploying trustworthy AI systems requires principled uncertainty quantification. Conformal prediction (CP) is a widely used framework for constructing prediction sets with distribution-free coverage guarantees. In many practical settings, including healthcare, finance, and mobile sensing, the calibration data required for CP are distributed across multiple clients, each with its own local data distribution. In this federated setting, data can often be partitioned into, potentially overlapping, groups, which may reflect client-specific strata or cross-cutting attributes such as demographic or semantic categories. We propose group-conditional federated conformal prediction (GC-FCP), a novel protocol that provides group-conditional coverage guarantees. GC-FCP constructs mergeable, group-stratified coresets from local calibration scores, enabling clients to communicate compact weighted summaries that support efficient aggregation and calibration at the server. Experiments on synthetic and real-world datasets validate the performance of GC-FCP compared to centralized calibration baselines.

artificial intelligence, gc-fcp, machine learning, (17 more...)

arXiv.org Machine Learning

2603.14198

Country:

North America > Canada > Ontario > Toronto (0.14)
Asia > China > Guangdong Province > Guangzhou (0.05)
Europe > United Kingdom > England > Greater London > London (0.04)
North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)

Genre: Research Report (0.40)

Industry: Health & Medicine (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.68)

Add feedback

Source Coverage and Citation Bias in LLM-based vs. Traditional Search Engines

Zhang, Peixian, Ye, Qiming, Peng, Zifan, Garimella, Kiran, Tyson, Gareth

arXiv.org Artificial IntelligenceDec-11-2025

LLM-based Search Engines (LLM-SEs) introduces a new paradigm for information seeking. Unlike Traditional Search Engines (TSEs) (e.g., Google), these systems summarize results, often providing limited citation transparency. The implications of this shift remain largely unexplored, yet raises key questions regarding trust and transparency. In this paper, we present a large-scale empirical study of LLM-SEs, analyzing 55,936 queries and the corresponding search results across six LLM-SEs and two TSEs. We confirm that LLM-SEs cites domain resources with greater diversity than TSEs. Indeed, 37% of domains are unique to LLM-SEs. However, certain risks still persist: LLM-SEs do not outperform TSEs in credibility, political neutrality and safety metrics. Finally, to understand the selection criteria of LLM-SEs, we perform a feature-based analysis to identify key factors influencing source choice. Our findings provide actionable insights for end users, website owners, and developers.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2512.09483

Country:

North America > United States (1.00)
Asia (0.68)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.93)

Industry:

Information Technology > Security & Privacy (1.00)
Media > News (0.67)
Government > Regional Government (0.67)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

PresentCoach: Dual-Agent Presentation Coaching through Exemplars and Interactive Feedback

Chen, Sirui, Zhou, Jinsong, Xu, Xinli, Yang, Xiaoyu, Guo, Litao, Chen, Ying-Cong

arXiv.org Artificial IntelligenceNov-25-2025

Effective presentation skills are essential in education, professional communication, and public speaking, yet learners often lack access to high-quality exemplars or personalized coaching. Existing AI tools typically provide isolated functionalities such as speech scoring or script generation without integrating reference modeling and interactive feedback into a cohesive learning experience. We introduce a dual-agent system that supports presentation practice through two complementary roles: the Ideal Presentation Agent and the Coach Agent. The Ideal Presentation Agent converts user-provided slides into model presentation videos by combining slide processing, visual-language analysis, narration script generation, personalized voice synthesis, and synchronized video assembly. The Coach Agent then evaluates user-recorded presentations against these exemplars, conducting multimodal speech analysis and delivering structured feedback in an Observation-Impact-Suggestion (OIS) format. To enhance the authenticity of the learning experience, the Coach Agent incorporates an Audience Agent, which simulates the perspective of a human listener and provides humanized feedback reflecting audience reactions and engagement. Together, these agents form a closed loop of observation, practice, and feedback. Implemented on a robust backend with multi-model integration, voice cloning, and error handling mechanisms, the system demonstrates how AI-driven agents can provide engaging, human-centered, and scalable support for presentation skill development in both educational and professional contexts.

large language model, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

2511.15253

Country: Asia > China > Guangdong Province (0.15)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.68)

Industry:

Education (1.00)
Health & Medicine > Therapeutic Area (0.47)
Information Technology > Security & Privacy (0.35)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)

Add feedback

PC-UNet: An Enforcing Poisson Statistics U-Net for Positron Emission Tomography Denoising

Shi, Yang, Wang, Jingchao, Lu, Liangsi, Huang, Mingxuan, He, Ruixin, Xie, Yifeng, Liu, Hanqian, Guo, Minzhe, Liang, Yangyang, Zhang, Weipeng, Li, Zimeng, Chen, Xuhang

arXiv.org Artificial IntelligenceOct-20-2025

Positron Emission Tomography (PET) is crucial in medicine, but its clinical use is limited due to high signal-to-noise ratio doses increasing radiation exposure. Lowering doses increases Poisson noise, which current denoising methods fail to handle, causing distortions and artifacts. We propose a Poisson Consistent U-Net (PC-UNet) model with a new Poisson Variance and Mean Consistency Loss (PVMC-Loss) that incorporates physical data to improve image fidelity. PVMC-Loss is statistically unbiased in variance and gradient adaptation, acting as a Generalized Method of Moments implementation, offering robustness to minor data mismatches. Tests on PET datasets show PC-UNet improves physical consistency and image fidelity, proving its ability to integrate physical information effectively.

artificial intelligence, deep learning, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2510.14995

Country: Asia > China > Guangdong Province (0.29)

Genre: Research Report (0.64)

Industry:

Health & Medicine > Nuclear Medicine (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Add feedback

MonoGlass3D: Monocular 3D Glass Detection with Plane Regression and Adaptive Feature Fusion

Zhang, Kai, Zhao, Guoyang, Shi, Jianxing, Liu, Bonan, Qi, Weiqing, Ma, Jun

arXiv.org Artificial IntelligenceSep-9-2025

Detecting and localizing glass in 3D environments poses significant challenges for visual perception systems, as the optical properties of glass often hinder conventional sensors from accurately distinguishing glass surfaces. The lack of real-world datasets focused on glass objects further impedes progress in this field. To address this issue, we introduce a new dataset featuring a wide range of glass configurations with precise 3D annotations, collected from distinct real-world scenarios. On the basis of this dataset, we propose MonoGlass3D, a novel approach tailored for monocular 3D glass detection across diverse environments. To overcome the challenges posed by the ambiguous appearance and context diversity of glass, we propose an adaptive feature fusion module that empowers the network to effectively capture contextual information in varying conditions. Additionally, to exploit the distinct planar geometry of glass surfaces, we present a plane regression pipeline, which enables seamless integration of geometric properties within our framework. Extensive experiments demonstrate that our method outperforms state-of-the-art approaches in both glass segmentation and monocular glass depth estimation. Our results highlight the advantages of combining geometric and contextual cues for transparent surface understanding.

artificial intelligence, glass surface, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2509.05599

Country:

Asia > China (0.72)
North America > United States > California (0.68)

Genre:

Research Report > Promising Solution (0.68)
Overview > Innovation (0.54)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

AIhub monthly digest: August 2025 – causality and generative modelling, responsible multimodal AI, and IJCAI in Montréal and Guangzhou

AIHubAug-29-2025, 09:06:06 GMT

Welcome to our monthly digest, where you can catch up with any AIhub stories you may have missed, peruse the latest news, recap recent events, and more. This month, we dive into the world of agents, learn about responsible multimodal AI, apply generative AI to computer networks, and dig into the RoboCup@Work League. This month, Sanmay Das, Tom Dietterich, Sabine Hauert, Sarit Kraus, and Michael Littman tackled the topic of agentic AI, discussing recent developments, and lessons learned from the decades of research in the autonomous agents and multiagent systems community. The 34th International Joint Conference on Artificial Intelligence (IJCAI2025) took place in Montréal from 16-22 August, with a satellite event currently being held (from 29-31 August) in Guangzhou, China. You can find out more about the programmes of both venues here, and get a flavour of what attendees got up to in our social media round-ups: Part one Part two.

artificial intelligence, monthly digest, responsible multimodal ai, (15 more...)

AIHub

Country:

North America > Canada > Quebec > Montreal (0.62)
Asia > China > Guangdong Province > Guangzhou (0.62)
South America > Brazil > Bahia > Salvador (0.06)
North America > United States > Arkansas (0.06)

Genre: Personal (0.34)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Add feedback

Text2Weight: Bridging Natural Language and Neural Network Weight Spaces

Tian, Bowen, Chen, Wenshuo, Li, Zexi, Lai, Songning, Wu, Jiemin, Yue, Yutao

arXiv.org Artificial IntelligenceAug-20-2025

How far are we really from automatically generating neural networks? While neural network weight generation shows promise, current approaches struggle with generalization to unseen tasks and practical application exploration. To address this, we propose T2W, a diffusion transformer framework that generates task-specific weights conditioned on natural language descriptions. T2W hierarchically processes network parameters into uniform blocks, integrates text embeddings from CLIP via a prior attention mechanism, and employs adversarial training with weight-space augmentation to enhance generalization. Experiments on Cifar100, Caltech256, and TinyImageNet demonstrate T2W's ability to produce high-quality weights for unseen tasks, outperforming optimization-based initialization and enabling novel applications such as weight enhancement and text-guided model fusion. Our work bridges textual semantics with weight-space dynamics, supported by an open-source dataset of text-weight pairs, advancing the practicality of generative models in neural network parameter synthesis. Our code is available on Github.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2508.13633

Country:

Asia > China (0.48)
Europe (0.29)

Genre:

Research Report (0.50)
Overview (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Mini-Game Lifetime Value Prediction in WeChat

Chen, Aochuan, Niu, Yifan, Gao, Ziqi, Sun, Yujie, Liu, Shoujun, Chen, Gong, Liu, Yang, Li, Jia

arXiv.org Artificial IntelligenceAug-14-2025

The LifeTime Value (LTV) prediction, which endeavors to forecast the cumulative purchase contribution of a user to a particular item, remains a vital challenge that advertisers are keen to resolve. A precise LTV prediction system enhances the alignment of user interests with meticulously designed advertisements, thereby generating substantial profits for advertisers. Nonetheless, this issue is complicated by the paucity of data typically observed in real-world advertising scenarios. The purchase rate among registered users is often as critically low as 0.1%, resulting in a dataset where the majority of users make only several purchases. Consequently, there is insufficient supervisory signal for effectively training the LTV prediction model. An additional challenge emerges from the interdependencies among tasks with high correlation. It is a common practice to estimate a user's contribution to a game over a specified temporal interval. Varying the lengths of these intervals corresponds to distinct predictive tasks, which are highly correlated. For instance, predictions over a 7-day period are heavily reliant on forecasts made over a 3-day period, where exceptional cases can adversely affect the accuracy of both tasks. In order to comprehensively address the aforementioned challenges, we introduce an innovative framework denoted as Graph-Represented Pareto-Optimal LifeTime Value prediction (GRePO-LTV). Graph representation learning is initially employed to address the issue of data scarcity. Subsequently, Pareto-Optimization is utilized to manage the interdependence of prediction tasks.

data mining, machine learning, prediction, (19 more...)

arXiv.org Artificial Intelligence

2506.11037

Country:

North America (1.00)
Asia > China > Guangdong Province (0.30)

Genre: Research Report (1.00)

Industry:

Marketing (1.00)
Information Technology > Services (0.83)
Leisure & Entertainment > Games > Computer Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Data Science > Data Mining (0.95)
Information Technology > Game Theory (0.88)
(2 more...)

Add feedback

What's coming up at #IJCAI2025?

AIHubAug-13-2025, 15:40:41 GMT

The IJCAI-25 logo and theme photo (cropped). The 34rd International Joint Conference on Artificial Intelligence (IJCAI-25) will be held in Montréal, Canada from 16-22 August. The programme will feature keynote talks, tutorials, workshops, competitions, and oral and poster presentations. There will also be four special tracks, focussing on: AI for social good, AI and arts, human-centred AI, and AI enabling critical technologies. An exciting addition this year is the satellite event, to be held in Guangzhou, China, from 29-31 August.

guangzhou, ijcai2025, tutorial, (3 more...)

AIHub

Country:

North America > Canada > Quebec > Montreal (0.48)
Asia > China > Guangdong Province > Guangzhou (0.48)

Genre: Instructional Material > Course Syllabus & Notes (0.65)

Industry: Social Sector (0.65)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

A Physics-informed End-to-End Occupancy Framework for Motion Planning of Autonomous Vehicles

Shen, Shuqi, Yang, Junjie, Lu, Hongliang, Zhong, Hui, Zhang, Qiming, Zheng, Xinhu

arXiv.org Artificial IntelligenceJun-9-2025

Accurate and interpretable motion planning is essential for autonomous vehicles (AVs) navigating complex and uncertain environments. While recent end-to-end occupancy prediction methods have improved environmental understanding, they typically lack explicit physical constraints, limiting safety and generalization. In this paper, we propose a unified end-to-end framework that integrates verifiable physical rules into the occupancy learning process. Specifically, we embed artificial potential fields (APF) as physics-informed guidance during network training to ensure that predicted occupancy maps are both data-efficient and physically plausible. Our architecture combines convolutional and recurrent neural networks to capture spatial and temporal dependencies while preserving model flexibility. Experimental results demonstrate that our method improves task completion rate, safety margins, and planning efficiency across diverse driving scenarios, confirming its potential for reliable deployment in real-world AV systems.

artificial intelligence, machine learning, physical rule, (18 more...)

arXiv.org Artificial Intelligence

2505.07855

Country: Asia > China (0.32)

Genre: Research Report > New Finding (0.88)

Industry: Transportation > Ground > Road (0.31)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback