AITopics | Oceania

Collaborating Authors

Oceania

Meta-Inverse Reinforcement Learning for Mean Field Games via Probabilistic Context Variables

Chen, Yang, Lin, Xiao, Yan, Bo, Zhang, Libo, Liu, Jiamou, Tan, Neset Özkan, Witbrock, Michael

arXiv.org Artificial IntelligenceSep-5-2025

Designing suitable reward functions for numerous interacting intelligent agents is challenging in real-world applications. Inverse reinforcement learning (IRL) in mean field games (MFGs) offers a practical framework to infer reward functions from expert demonstrations. While promising, the assumption of agent homogeneity limits the capability of existing methods to handle demonstrations with heterogeneous and unknown objectives, which are common in practice. To this end, we propose a deep latent variable MFG model and an associated IRL method. Critically, our method can infer rewards from different yet structurally similar tasks without prior knowledge about underlying contexts or modifying the MFG model itself. Our experiments, conducted on simulated scenarios and a real-world spatial taxi-ride pricing problem, demonstrate the superiority of our approach over state-of-the-art IRL methods in MFGs.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

2509.03845

Country: Oceania > New Zealand (0.14)

Genre: Research Report (1.00)

Industry:

Transportation > Passenger (0.93)
Health & Medicine > Therapeutic Area > Immunology (0.68)
Transportation > Ground > Road (0.66)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Designing Gaze Analytics for ELA Instruction: A User-Centered Dashboard with Conversational AI Support

Davalos, Eduardo, Zhang, Yike, Jain, Shruti, Srivastava, Namrata, Truong, Trieu, Haque, Nafees-ul, Van, Tristan, Salas, Jorge, McFadden, Sara, Cho, Sun-Joo, Biswas, Gautam, Goodwin, Amanda

arXiv.org Artificial IntelligenceSep-5-2025

Eye-tracking offers rich insights into student cognition and engagement, but remains underutilized in classroom-facing educational technology due to challenges in data interpretation and accessibility. In this paper, we present the iterative design and evaluation of a gaze-based learning analytics dashboard for English Language Arts (ELA), developed through five studies involving teachers and students. Guided by user-centered design and data storytelling principles, we explored how gaze data can support reflection, formative assessment, and instructional decision-making. Our findings demonstrate that gaze analytics can be approachable and pedagogically valuable when supported by familiar visualizations, layered explanations, and narrative scaffolds. We further show how a conversational agent, powered by a large language model (LLM), can lower cognitive barriers to interpreting gaze data by enabling natural language interactions with multimodal learning analytics. We conclude with design implications for future EdTech systems that aim to integrate novel data modalities in classroom contexts.

artificial intelligence, large language model, natural language, (17 more...)

arXiv.org Artificial Intelligence

2509.03741

Country:

North America > United States (1.00)
Europe (1.00)
Oceania > Australia (0.67)

Genre:

Research Report > New Finding (1.00)
Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (0.67)
Education > Educational Setting > Online (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.91)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.68)

Add feedback

The distribution of calibrated likelihood functions on the probability-likelihood Aitchison simplex

Noé, Paul-Gauthier, Nautsch, Andreas, Matrouf, Driss, Bousquet, Pierre-Michel, Bonastre, Jean-François

arXiv.org Machine LearningSep-4-2025

While calibration of probabilistic predictions has been widely studied, this paper rather addresses calibration of likelihood functions. This has been discussed, especially in biometrics, in cases with only two exhaustive and mutually exclusive hypotheses (classes) where likelihood functions can be written as log-likelihood-ratios (LLRs). After defining calibration for LLRs and its connection with the concept of weight-of-evidence, we present the idempotence property and its associated constraint on the distribution of the LLRs. Although these results have been known for decades, they have been limited to the binary case. Here, we extend them to cases with more than two hypotheses by using the Aitchison geometry of the simplex, which allows us to recover, in a vector form, the additive form of the Bayes' rule; extending therefore the LLR and the weight-of-evidence to any number of hypotheses. Especially, we extend the definition of calibration, the idempotence, and the constraint on the distribution of likelihood functions to this multiple hypotheses and multiclass counterpart of the LLR: the isometric-log-ratio transformed likelihood function. This work is mainly conceptual, but we still provide one application to machine learning by presenting a non-linear discriminant analysis where the discriminant components form a calibrated likelihood function over the classes, improving therefore the interpretability and the reliability of the method.

artificial intelligence, likelihood function, machine learning, (19 more...)

arXiv.org Machine Learning

2509.03365

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Spain > Galicia > Madrid (0.04)
Oceania > New Zealand (0.04)
(4 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.88)

Add feedback

Can LLMs Lie? Investigation beyond Hallucination

Huan, Haoran, Prabhudesai, Mihir, Wu, Mengning, Jaiswal, Shantanu, Pathak, Deepak

arXiv.org Artificial IntelligenceSep-4-2025

Large language models (LLMs) have demonstrated impressive capabilities across a variety of tasks, but their increasing autonomy in real-world applications raises concerns about their trustworthiness. While hallucinations-unintentional falsehoods-have been widely studied, the phenomenon of lying, where an LLM knowingly generates falsehoods to achieve an ulterior objective, remains underexplored. In this work, we systematically investigate the lying behavior of LLMs, differentiating it from hallucinations and testing it in practical scenarios. Through mechanistic interpretability techniques, we uncover the neural mechanisms underlying deception, employing logit lens analysis, causal interventions, and contrastive activation steering to identify and control deceptive behavior. We study real-world lying scenarios and introduce behavioral steering vectors that enable fine-grained manipulation of lying tendencies. Further, we explore the trade-offs between lying and end-task performance, establishing a Pareto frontier where dishonesty can enhance goal optimization. Our findings contribute to the broader discourse on AI ethics, shedding light on the risks and potential safeguards for deploying LLMs in high-stakes environments. Code and more illustrations are available at https://llm-liar.github.io/

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2509.03518

Country:

Europe (0.46)
Oceania > Australia (0.14)

Genre: Research Report > New Finding (0.34)

Industry: Health & Medicine > Therapeutic Area (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Add feedback

Lessons Learned from Deploying Adaptive Machine Learning Agents with Limited Data for Real-time Cell Culture Process Monitoring

Khuat, Thanh Tung, Peng, Johnny, Bassett, Robert, Otte, Ellen, Gabrys, Bogdan

arXiv.org Artificial IntelligenceSep-4-2025

This study explores the deployment of three machine learning (ML) approaches for real-time prediction of glucose, lactate, and ammonium concentrations in cell culture processes, using Raman spectroscopy as input features. The research addresses challenges associated with limited data availability and process variability, providing a comparative analysis of pretrained models, just-in-time learning (JITL), and online learning algorithms. Two industrial case studies are presented to evaluate the impact of varying bioprocess conditions on model performance. The findings highlight the specific conditions under which pretrained models demonstrate superior predictive accuracy and identify scenarios where JITL or online learning approaches are more effective for adaptive process monitoring. This study also highlights the critical importance of updating the deployed models/agents with the latest offline analytical measurements during bioreactor operations to maintain the model performance against the changes in cell growth behaviours and operating conditions throughout the bioreactor run. Additionally, the study confirms the usefulness of a simple mixture-of-experts framework in achieving enhanced accuracy and robustness for real-time predictions of metabolite concentrations based on Raman spectral data. These insights contribute to the development of robust strategies for the efficient deployment of ML models in dynamic and changing biomanufacturing environments.

artificial intelligence, concentration, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2509.02606

Country: Oceania > Australia (0.28)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area (0.94)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.34)

Add feedback

Toward Ubiquitous Operating Systems: Lessons from the Field

Communications of the ACMSep-3-2025, 15:00:40 GMT

ACM encourages its members to take a direct hand in shaping the future of the association. There are more ways than ever to get involved.

application, operating system, ubiquitous operating system, (12 more...)

Communications of the ACM

Country:

Oceania > Australia > Victoria > Melbourne (0.04)
Asia > China (0.04)

Genre: Instructional Material > Course Syllabus & Notes (0.41)

Industry:

Health & Medicine (0.96)
Information Technology (0.95)

Technology:

Information Technology > Software (1.00)
Information Technology > Artificial Intelligence (1.00)
Information Technology > Communications > Mobile (0.97)
Information Technology > Communications > Social Media (0.74)

Add feedback

Lawyer caught using AI-generated false citations in court case penalised in Australian first

The GuardianSep-3-2025, 03:44:44 GMT

A Victorian lawyer has become the first in Australia to face professional sanctions for using artificial intelligence in a court case, being stripped of his ability to practise as a principal lawyer after AI generated false citations that he had failed to verify. Guardian Australia reported in October last year that in a 19 July 2024 hearing, the anonymous solicitor representing a husband in a dispute between a married couple provided the court with a list of prior cases that had been requested by Justice Amanda Humphreys in relation to an enforcement application in the case. When Humphreys returned to her chambers, she said in a ruling that neither herself nor her associates were able to identify the cases in the list. When the matter returned to court the lawyer confirmed that the list had been prepared using legal software that utilised AI. He acknowledged he did not verify the accuracy of the information before submitting it to the court.

artificial intelligence, false citation, lawyer, (12 more...)

The Guardian

Country:

Oceania > Australia > Western Australia (0.05)
Oceania > Australia > New South Wales (0.05)

Industry:

Law > Litigation (0.72)
Government > Regional Government > Oceania Government > Australia Government (0.36)

Technology: Information Technology > Artificial Intelligence > Applied AI (1.00)

Add feedback

Semi-Supervised Bayesian GANs with Log-Signatures for Uncertainty-Aware Credit Card Fraud Detection

Hirnschall, David

arXiv.org Machine LearningSep-3-2025

We present a novel deep generative semi-supervised framework for credit card fraud detection, formulated as time series classification task. As financial transaction data streams grow in scale and complexity, traditional methods often require large labeled datasets, struggle with time series of irregular sampling frequencies and varying sequence lengths. To address these challenges, we extend conditional Generative Adversarial Networks (GANs) for targeted data augmentation, integrate Bayesian inference to obtain predictive distributions and quantify uncertainty, and leverage log-signatures for robust feature encoding of transaction histories. We introduce a novel Wasserstein distance-based loss to align generated and real unlabeled samples while simultaneously maximizing classification accuracy on labeled data. Our approach is evaluated on the BankSim dataset, a widely used simulator for credit card transaction data, under varying proportions of labeled samples, demonstrating consistent improvements over benchmarks in both global statistical and domain-specific metrics. These findings highlight the effectiveness of GAN-driven semi-supervised learning with log-signatures for irregularly sampled time series and emphasize the importance of uncertainty-aware predictions.

artificial intelligence, machine learning, uncertainty-aware credit card fraud detection, (10 more...)

arXiv.org Machine Learning

2509.00931

Country:

Europe > Austria > Vienna (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
(11 more...)

Genre: Research Report > New Finding (0.67)

Industry:

Law Enforcement & Public Safety > Fraud (1.00)
Banking & Finance > Credit (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

FBMS: An R Package for Flexible Bayesian Model Selection and Model Averaging

Frommlet, Florian, Lachmann, Jon, Storvik, Geir, Hubin, Aliaksandr

arXiv.org Machine LearningSep-3-2025

At its core, the package implements an efficient Mode Jumping Markov Chain Monte Carlo (MJMCMC) algorithm, designed to improve mixing in multi-modal posterior landscapes within Bayesian generalized linear models. In addition, it provides a genetically modified MJMCMC (GMJMCMC) algorithm that introduces nonlinear feature generation, thereby enabling the estimation of Bayesian generalized nonlinear models (BGNLMs). Within this framework, the algorithm maintains and updates populations of transformed features, computes their posterior probabilities, and evaluates the posteriors of models constructed from them. We demonstrate the effective use of FBMS for both inferential and predictive modeling in Gaussian regression, focusing on different instances of the BGNLM class of models. Furthermore, through a broad set of applications, we illustrate how the methodology can be extended to increasingly complex modeling scenarios, extending to other response distributions and mixed effect models.

artificial intelligence, machine learning, param, (15 more...)

arXiv.org Machine Learning

2509.00753

Country:

Africa > Zambia (0.04)
Europe > Norway > Eastern Norway > Oslo (0.04)
Oceania > Australia > Tasmania (0.04)
(2 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.67)

Industry:

Energy (0.67)
Health & Medicine > Therapeutic Area > Endocrinology (0.46)
Health & Medicine > Therapeutic Area > Oncology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Multiple LLM Agents Debate for Equitable Cultural Alignment

Ki, Dayeon, Rudinger, Rachel, Zhou, Tianyi, Carpuat, Marine

arXiv.org Artificial IntelligenceSep-3-2025

Large Language Models (LLMs) need to adapt their predictions to diverse cultural contexts to benefit diverse communities across the world. While previous efforts have focused on single-LLM, single-turn approaches, we propose to exploit the complementary strengths of multiple LLMs to promote cultural adaptability. We introduce a Multi-Agent Debate framework, where two LLM-based agents debate over a cultural scenario and collaboratively reach a final decision. We propose two variants: one where either LLM agents exclusively debate and another where they dynamically choose between self-reflection and debate during their turns. We evaluate these approaches on 7 open-weight LLMs (and 21 LLM combinations) using the NormAd-ETI benchmark for social etiquette norms in 75 countries. Experiments show that debate improves both overall accuracy and cultural group parity over single-LLM baselines. Notably, multi-agent debate enables relatively small LLMs (7-9B) to achieve accuracies comparable to that of a much larger model (27B parameters).

accuracy, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2505.24671

Country:

South America (1.00)
North America > United States (1.00)
Europe (1.00)
(3 more...)

Genre: Research Report > New Finding (1.00)

Industry: Government > Regional Government > North America Government > United States Government (0.45)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback