AITopics | ilr

Collaborating Authors

ilr

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Collaborative Learning for Deep Neural Networks

Guocong Song, Wei Chai

Neural Information Processing SystemsFeb-12-2026, 17:21:01 GMT

When training deep neural networks, we must confront the challenges ofgeneral nonconvexoptimization problems.

artificial intelligence, learning, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > Canada > Quebec > Montreal (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.71)

Add feedback

Collaborative Learning for Deep Neural Networks

Guocong Song, Wei Chai

Neural Information Processing SystemsNov-20-2025, 16:02:00 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, deep learning, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > United States > California > Santa Clara County > Mountain View (0.04)
North America > Canada > Quebec > Montreal (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.65)

Add feedback

Input layer regularization and automated regularization hyperparameter tuning for myelin water estimation using deep learning

Modi, Mirage, Sule, Shashank, Palumbo, Jonathan, Rozowski, Michael, Bouhrara, Mustapha, Czaja, Wojciech, Spencer, Richard G.

arXiv.org Machine LearningJan-29-2025

We propose a novel deep learning method which combines classical regularization with data augmentation for estimating myelin water fraction (MWF) in the brain via biexponential analysis. Our aim is to design an accurate deep learning technique for analysis of signals arising in magnetic resonance relaxometry. In particular, we study the biexponential model, one of the signal models used for MWF estimation. We greatly extend our previous work on \emph{input layer regularization (ILR)} in several ways. We now incorporate optimal regularization parameter selection via a dedicated neural network or generalized cross validation (GCV) on a signal-by-signal, or pixel-by-pixel, basis to form the augmented input signal, and now incorporate estimation of MWF, rather than just exponential time constants, into the analysis. On synthetically generated data, our proposed deep learning architecture outperformed both classical methods and a conventional multi-layer perceptron. On in vivo brain data, our architecture again outperformed other comparison methods, with GCV proving to be somewhat superior to a NN for regularization parameter selection. Thus, ILR improves estimation of MWF within the biexponential model. In addition, classical methods such as GCV may be combined with deep learning to optimize MWF imaging in the human brain.

artificial intelligence, estimation, machine learning, (18 more...)

arXiv.org Machine Learning

2501.18074

Country:

North America > United States > Maryland > Prince George's County > College Park (0.04)
Europe > Netherlands (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Iterative Label Refinement Matters More than Preference Optimization under Weak Supervision

Ye, Yaowen, Laidlaw, Cassidy, Steinhardt, Jacob

arXiv.org Artificial IntelligenceJan-14-2025

Language model (LM) post-training relies on two stages of human supervision: task demonstrations for supervised finetuning (SFT), followed by preference comparisons for reinforcement learning from human feedback (RLHF). As LMs become more capable, the tasks they are given become harder to supervise. Will post-training remain effective under unreliable supervision? To test this, we simulate unreliable demonstrations and comparison feedback using small LMs and time-constrained humans. We find that in the presence of unreliable supervision, SFT still retains some effectiveness, but DPO (a common RLHF algorithm) fails to improve the model beyond SFT. To address this, we propose iterative label refinement (ILR) as an alternative to RLHF. ILR improves the SFT data by using comparison feedback to decide whether human demonstrations should be replaced by model-generated alternatives, then retrains the model via SFT on the updated data. SFT+ILR outperforms SFT+DPO on several tasks with unreliable supervision (math, coding, and safe instruction-following). Our findings suggest that as LMs are used for complex tasks where human supervision is unreliable, RLHF may no longer be the best use of human comparison feedback; instead, it is better to direct feedback towards improving the training data rather than continually training the model. Our code and data are available at https://github.com/helloelwin/iterative-label-refinement.

artificial intelligence, comparison feedback, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2501.07886

Country:

North America > United States > Pennsylvania (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report > New Finding (0.86)

Industry: Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Forget but Recall: Incremental Latent Rectification in Continual Learning

Nguyen, Nghia D., Nguyen, Hieu Trung, Li, Ang, Pham, Hoang, Nguyen, Viet Anh, Doan, Khoa D.

arXiv.org Artificial IntelligenceJun-25-2024

Intrinsic capability to continuously learn a changing data stream is a desideratum of deep neural networks (DNNs). However, current DNNs suffer from catastrophic forgetting, which hinders remembering past knowledge. To mitigate this issue, existing Continual Learning (CL) approaches either retain exemplars for replay, regularize learning, or allocate dedicated capacity for new tasks. This paper investigates an unexplored CL direction for incremental learning called Incremental Latent Rectification or ILR. In a nutshell, ILR learns to propagate with correction (or rectify) the representation from the current trained DNN backward to the representation space of the old task, where performing predictive decisions is easier. This rectification process only employs a chain of small representation mapping networks, called rectifier units. Empirical experiments on several continual learning benchmarks, including CIFAR10, CIFAR100, and Tiny ImageNet, demonstrate the effectiveness and potential of this novel CL direction compared to existing representative CL methods.

continual learning, learning, representation, (13 more...)

arXiv.org Artificial Intelligence

2406.17381

Country:

Asia > China > Hong Kong (0.04)
North America > United States > California > San Mateo County > San Mateo (0.04)
Asia > Vietnam > Hanoi > Hanoi (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Edgewise outliers of network indexed signals

Rieser, Christopher, Ruiz-Gazen, Anne, Thomas-Agnan, Christine

arXiv.org Artificial IntelligenceJul-20-2023

We consider models for network indexed multivariate data involving a dependence between variables as well as across graph nodes. In the framework of these models, we focus on outliers detection and introduce the concept of edgewise outliers. For this purpose, we first derive the distribution of some sums of squares, in particular squared Mahalanobis distances that can be used to fix detection rules and thresholds for outlier detection. We then propose a robust version of the deterministic MCD algorithm that we call edgewise MCD. An application on simulated data shows the interest of taking the dependence structure into account. We also illustrate the utility of the proposed method with a real data set.

artificial intelligence, data mining, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2307.11239

Country:

Europe > Austria > Vienna (0.14)
Europe > France > Île-de-France > Seine-Saint-Denis (0.04)
Europe > France > Occitanie > Haute-Garonne > Toulouse (0.04)
(11 more...)

Genre: Research Report (0.40)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.46)

Add feedback

Improving the Inference of Topic Models via Infinite Latent State Replications

Rugeles, Daniel, Hai, Zhen, Carmona, Juan Felipe, Dash, Manoranjan, Cong, Gao

arXiv.org Artificial IntelligenceJan-25-2023

In text mining, topic models are a type of probabilistic generative models for inferring latent semantic topics from text corpus. One of the most popular inference approaches to topic models is perhaps collapsed Gibbs sampling (CGS), which typically samples one single topic label for each observed document-word pair. In this paper, we aim at improving the inference of CGS for topic models. We propose to leverage state augmentation technique by maximizing the number of topic samples to infinity, and then develop a new inference approach, called infinite latent state replication (ILR), to generate robust soft topic assignment for each given document-word pair. Experimental results on the publicly available datasets show that ILR outperforms CGS for inference of existing established topic models.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2301.12974

Country:

South America > Paraguay > Asunción > Asunción (0.04)
South America > Colombia (0.04)
North America > United States > Texas > Harris County > Houston (0.04)
(2 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.51)

Add feedback

Refining neural network predictions using background knowledge

Daniele, Alessandro, van Krieken, Emile, Serafini, Luciano, van Harmelen, Frank

arXiv.org Artificial IntelligenceSep-2-2022

Recent work has shown logical background knowledge can be used in learning systems to compensate for a lack of labeled training data. Many methods work by creating a loss function that encodes this knowledge. However, often the logic is discarded after training, even if it is still useful at test time. Instead, we ensure neural network predictions satisfy the knowledge by refining the predictions with an extra computation step. We introduce differentiable refinement functions that find a corrected prediction close to the original prediction. We study how to effectively and efficiently compute these refinement functions. Using a new algorithm called Iterative Local Refinement (ILR), we combine refinement functions to find refined predictions for logical formulas of any complexity. ILR finds refinements on complex SAT formulas in significantly fewer iterations and frequently finds solutions where gradient descent can not. Finally, ILR produces competitive results in the MNIST addition task.

minimal refinement function, refinement function, truth value, (15 more...)

arXiv.org Artificial Intelligence

2206.04976

Country:

Europe > Netherlands > North Holland > Amsterdam (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Illinois > Champaign County > Champaign (0.04)
(5 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.46)

Add feedback

China confirms it's joining Russia to build a moon base by 2035

Daily Mail - Science & techJan-28-2022, 16:54:46 GMT

China has confirmed it's joining forces with Russia to build a research station on the moon by 2035, which will rival NASA's Lunar Gateway. Confirmation of plans to build the International Lunar Research Station (ILRS) came on Friday from officials at China National Space Administration (CNSA), the country's national space agency. Russia and China aim to complete basic infrastructure construction for ILRS by 2035, Wu Yanhua, CNSA deputy director, told a briefing in Beijing. ILRS will rival NASA's Lunar Gateway, which is set to play a'vital' role in the US space agency's upcoming Artemis program. However, NASA's Lunar Gateway will only orbit the moon, while ILRS will have both an orbiter and a base on the lunar surface, as well as multiple exploration rovers.

china, lunar gateway, nasa, (14 more...)

Daily Mail - Science & tech

Country:

North America > United States (1.00)
Asia > Russia (0.88)
Asia > China > Beijing > Beijing (0.26)
(3 more...)

Industry: Government > Space Agency (1.00)

Technology: Information Technology > Artificial Intelligence (0.71)

Add feedback

A causal view on compositional data

Ailer, Elisabeth, Müller, Christian L., Kilbertus, Niki

arXiv.org Machine LearningJun-21-2021

Many scientific datasets are compositional in nature. Important examples include species abundances in ecology, rock compositions in geology, topic compositions in large-scale text corpora, and sequencing count data in molecular biology. Here, we provide a causal view on compositional data in an instrumental variable setting where the composition acts as the cause. Throughout, we pay particular attention to the interpretation of compositional causes from the viewpoint of interventions and crisply articulate potential pitfalls for practitioners. Focusing on modern high-dimensional microbiome sequencing data as a timely illustrative use case, our analysis first reveals that popular one-dimensional information-theoretic summary statistics, such as diversity and richness, may be insufficient for drawing causal conclusions from ecological data. Instead, we advocate for multivariate alternatives using statistical data transformations and regression techniques that take the special structure of the compositional sample space into account. In a comparative analysis on synthetic and semi-synthetic data we show the advantages and limitations of our proposal. We posit that our framework may provide a useful starting point for cause-effect estimation in the context of compositional data.

causal effect, instrument, regression, (17 more...)

arXiv.org Machine Learning

2106.11234

Country:

Europe > Austria > Vienna (0.14)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
North America > United States > New York (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.88)
Information Technology > Geographic Information Systems (0.82)

Add feedback