AITopics

2508.02096

Country:

Asia > Japan (0.28)
Oceania > Australia > New South Wales > Sydney (0.14)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

The GuardianAug-6-2025, 09:58:20 GMT

Arts and media groups demand Labor take a stand against 'rampant theft' of Australian content to train AI

Arts, creative and media groups have demanded the government rule out allowing big tech companies to take Australian content to train their artificial intelligence models, with concerns such a shift would "sell out" Australian workers and lead to "rampant theft" of intellectual property. "It is not appropriate for big tech to steal the work of Australian artists, musicians, creators, news media, journalism, and use it for their own ends without paying for it," Ley said on Wednesday. In an interim report on "harnessing data and digital technology", the Productivity Commission set out proposals for how tech, including AI, could be regulated and treated in Australia, suggesting it could boost productivity by between 0.5% and 13% over the next decade, adding up to 116bn to Australia's GDP. The commission suggested several possible remedies, including expanding licensing schemes, or an exemption for "text and data mining" and expanding the existing fair dealing rules, which it said existed in other countries. The latter suggestion prompted fierce pushback from arts, creative and media companies, which raised alarm their work could be left open for massively wealthy tech companies to use – without compensation or payment – to train AI models.

australia, australian content, rampant theft, (13 more...)

The Guardian

Country: Oceania > Australia (0.57)

Industry:

Media > News (0.76)
Law > Statutes (0.56)

Technology: Information Technology > Artificial Intelligence (1.00)

Clark, Katharine M., McNicholas, Paul D.

funOCLUST: Clustering Functional Data with Outliers

arXiv.org Machine LearningAug-6-2025

An extension of the OCLUST algorithm to the functional setting is proposed to address these issue s. The approach leverages the OCLUST framework, creating a robust method to cluster cu rves and trim outliers. The methodology is evaluated on both simulated and real-wor ld functional datasets, demonstrating strong performance in clustering and outlie r identification.

artificial intelligence, data mining, machine learning, (19 more...)

2508.0011

Country:

Europe > Austria > Vienna (0.14)
North America > Canada > Ontario (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
(3 more...)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.96)
Information Technology > Data Science > Data Mining (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.69)

Daunas, Francisco, Esnaola, Iñaki, Perlaza, Samir M.

A Dual Optimization View to Empirical Risk Minimization with f-Divergence Regularization

arXiv.org Machine LearningAug-6-2025

--The dual formulation of empirical risk minimization with f -divergence regularization (ERM-f DR) is introduced. The solution of the dual optimization problem to the ERM-f DR is connected to the notion of normalization function introduced as an implicit function. This dual approach leverages the Legendre-Fenchel transform and the implicit function theorem to provide a nonlinear ODE expression to the normalization function. Furthermore, the nonlinear ODE expression and its properties provide a computationally efficient method to calculate the normalization function of the ERM-f DR solution under a mild condition. Empirical risk minimization (ERM) [1]-[6] is often posed as an optimization problem regularized by a statistical distance between the probability measure to be optimized and a given reference measure [7]-[13].

artificial intelligence, machine learning, normalization function, (15 more...)

2508.03314

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Europe > France > Provence-Alpes-Côte d'Azur (0.04)
Oceania > French Polynesia (0.04)
(9 more...)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.56)

Pham, Ninh, Zheng, Yingtao, Phibbs, Hugo

Scalable Varied-Density Clustering via Graph Propagation

arXiv.org Artificial IntelligenceAug-6-2025

We propose a novel perspective on varied-density clustering for high-dimensional data by framing it as a label propagation process in neighborhood graphs that adapt to local density variations. Our method formally connects density-based clustering with graph connectivity, enabling the use of efficient graph propagation techniques developed in network science. To ensure scalability, we introduce a density-aware neighborhood propagation algorithm and leverage advanced random projection methods to construct approximate neighborhood graphs. Our approach significantly reduces computational cost while preserving clustering quality. Empirically, it scales to datasets with millions of points in minutes and achieves competitive accuracy compared to existing baselines.

artificial intelligence, data mining, machine learning, (19 more...)

2508.02989

Country: Oceania > New Zealand (0.14)

Genre: Research Report (0.50)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Popular ScienceAug-5-2025, 15:30:35 GMT

13 World War II shipwrecks captured in stunning detail

Breakthroughs, discoveries, and DIY tips sent every weekday. Judging by newly released photos and video, the crew aboard Ocean Exploration Trust's Nautilus research vessel had an extremely productive summer trip to the South Pacific. Over 22 days, the team completed detailed archaeological surveys of more than a dozen shipwrecks sunk amid the Solomon Islands campaign during World War II. In addition to imaging four of them for the first time, experts guided remotely operated vehicles (ROVs) towards the rediscovery of two long-lost vessels:the separated bow from the USS New Orleans as well as the Imperial Japanese Naval destroyer Teruzuki. Although researchers originally spotted some of these shipwrecks more than 34 years ago, Ocean Exploration Trust president Robert Ballard explained that the most recent trip to Iron Bottom Sound provided opportunities to document their finds using a new generation of technology including high-definition survey cameras, underwater vehicles, and imaging tools aboard the EV Nautilus.

ocean exploration trust, stunning detail, world war ii shipwreck, (5 more...)

Popular Science

Country:

North America > United States > Louisiana > Orleans Parish > New Orleans (0.26)
Oceania > Solomon Islands > Guadalcanal Province > Guadalcanal Island > Honiara (0.06)
Oceania > Australia (0.06)
Asia > Japan (0.06)

Industry: Government > Military > Navy (0.74)

Technology: Information Technology > Artificial Intelligence (0.58)

Mendonça, Fábio, Mostafa, Sheikh Shanawaz, Freitas, Diogo, Morgado-Dias, Fernando, Ravelo-García, Antonio G.

Multiple Time Series Fusion Based on LSTM An Application to CAP A Phase Classification Using EEG

arXiv.org Artificial IntelligenceAug-5-2025

Biomedical decision making involves multiple signal processing, either from different sensors or from different channels. In both cases, information fusion plays a significant role. A deep learning based electroencephalogram channels' feature level fusion is carried out in this work for the electroencephalogram cyclic alternating pattern A phase classification. Channel selection, fusion, and classification procedures were optimized by two optimization algorithms, namely, Genetic Algorithm and Particle Swarm Optimization. The developed methodologies were evaluated by fusing the information from multiple electroencephalogram channels for patients with nocturnal frontal lobe epilepsy and patients without any neurological disorder, which was significantly more challenging when compared to other state of the art works. Results showed that both optimization algorithms selected a comparable structure with similar feature level fusion, consisting of three electroencephalogram channels, which is in line with the CAP protocol to ensure multiple channels' arousals for CAP detection. Moreover, the two optimized models reached an area under the receiver operating characteristic curve of 0.82, with average accuracy ranging from 77% to 79%, a result which is in the upper range of the specialist agreement. The proposed approach is still in the upper range of the best state of the art works despite a difficult dataset, and has the advantage of providing a fully automatic analysis without requiring any manual procedure. Ultimately, the models revealed to be noise resistant and resilient to multiple channel loss.

artificial intelligence, evolutionary algorithm, machine learning, (16 more...)

doi: 10.3390/ijerph191710892

2112.11218

Country:

Europe > Portugal > Madeira > Funchal (0.04)
North America > United States > California (0.04)
North America > United States > Massachusetts (0.04)
(19 more...)

Genre: Research Report > New Finding (0.68)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)

Braun, Marc, Peña, Jose M., Daoud, Adel

Flow IV: Counterfactual Inference In Nonseparable Outcome Models Using Instrumental Variables

arXiv.org Machine LearningAug-5-2025

To reach human level intelligence, learning algorithms need to incorporate causal reasoning. But identifying causality, and particularly counterfactual reasoning, remains an elusive task. In this paper, we make progress on this task by utilizing instrumental variables (IVs). IVs are a classic tool for mitigating bias from unobserved confounders when estimating causal effects. While IV methods have been extended to non-separable structural models at the population level, existing approaches to counterfactual prediction typically assume additive noise in the outcome. In this paper, we show that under standard IV assumptions, along with the assumptions that latent noises in treatment and outcome are strictly monotonic and jointly Gaussian, the treatment-outcome relationship becomes uniquely identifiable from observed data. This enables counterfactual inference even in non-separable models. We implement our approach by training a normalizing flow to maximize the likelihood of the observed data, demonstrating accurate recovery of the underlying outcome function. We call our method Flow IV .

artificial intelligence, assumption, machine learning, (15 more...)

2508.01321

Country:

Africa (0.14)
Europe > Belgium > Brussels-Capital Region > Brussels (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
(5 more...)

Genre:

Research Report > Strength High (0.46)
Research Report > New Finding (0.46)
Research Report > Experimental Study (0.46)

Industry: Government (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

arXiv.org Machine LearningAug-5-2025

Understanding the Essence: Delving into Annotator Prototype Learning for Multi-Class Annotation Aggregation

Chen, Ju, Feng, Jun, Zhang, Shenyu

Multi-class classification annotations have significantly advanced AI applications, with truth inference serving as a critical technique for aggregating noisy and biased annotations. Existing state-of-the-art methods typically model each annotator's expertise using a confusion matrix. However, these methods suffer from two widely recognized issues: 1) when most annotators label only a few tasks, or when classes are imbalanced, the estimated confusion matrices are unreliable, and 2) a single confusion matrix often remains inadequate for capturing each annotator's full expertise patterns across all tasks. To address these issues, we propose a novel confusion-matrix-based method, PTBCC (ProtoType learning-driven Bayesian Classifier Combination), to introduce a reliable and richer annotator estimation by prototype learning. Specifically, we assume that there exists a set $S$ of prototype confusion matrices, which capture the inherent expertise patterns of all annotators. Rather than a single confusion matrix, the expertise per annotator is extended as a Dirichlet prior distribution over these prototypes. This prototype learning-driven mechanism circumvents the data sparsity and class imbalance issues, ensuring a richer and more flexible characterization of annotators. Extensive experiments on 11 real-world datasets demonstrate that PTBCC achieves up to a 15% accuracy improvement in the best case, and a 3% higher average accuracy while reducing computational cost by over 90%.

annotator, artificial intelligence, machine learning, (15 more...)

2508.02123

Country:

Asia > China > Jiangsu Province > Nanjing (0.05)
Oceania > Australia > Victoria > Melbourne (0.04)

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.89)

arXiv.org Artificial IntelligenceAug-5-2025

FeatureCuts: Feature Selection for Large Data by Optimizing the Cutoff

Hu, Andy, Prasad, Devika, Pizzato, Luiz, Foord, Nicholas, Abrahamyan, Arman, Leontjeva, Anna, Doyle, Cooper, Jermyn, Dan

--In machine learning, the process of feature selection involves finding a reduced subset of features that captures most of the information required to train an accurate and efficient model. This work presents FeatureCuts, a novel feature selection algorithm that adaptively selects the optimal feature cutoff after performing filter ranking. Evaluated on 14 publicly available datasets and one industry dataset, FeatureCuts achieved, on average, 15 percentage points more feature reduction and up to 99.6% less computation time while maintaining model performance, compared to existing state-of-the-art methods. When the selected features are used in a wrapper method such as Particle Swarm Optimization (PSO), it enables 25 percentage points more feature reduction, requires 66% less computation time, and maintains model performance when compared to PSO alone. The minimal overhead of FeatureCuts makes it scalable for large datasets typically seen in enterprise applications. Traditional machine learning methods work best when their prediction signals come from data with a small, but highly informative set of features.

evolutionary algorithm, machine learning, test score 0, (16 more...)

2508.00954

Country:

Oceania > Australia (0.48)
Europe (0.46)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)