AITopics

2410.23326

Country:

Europe (1.00)
North America > Canada > Alberta (0.14)
North America > Canada > Ontario > Toronto (0.14)
(4 more...)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Government (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceApr-15-2024

New methods for drug synergy prediction: a mini-review

Abbasi, Fatemeh, Rousu, Juho

In this mini-review, we explore the new prediction methods for drug combination synergy relying on high-throughput combinatorial screens. The fast progress of the field is witnessed in the more than thirty original machine learning methods published since 2021, a clear majority of them based on deep learning techniques. We aim to put these papers under a unifying lens by highlighting the core technologies, the data sources, the input data types and synergy scores used in the methods, as well as the prediction scenarios and evaluation protocols that the papers deal with. Our finding is that the best methods accurately solve the synergy prediction scenarios involving known drugs or cell lines while the scenarios involving new drugs or cell lines still fall short of an accurate prediction level.

artificial intelligence, machine learning, prediction, (19 more...)

2404.02484

Country: Asia > Middle East > Iran (0.14)

Genre:

Research Report > Experimental Study (0.68)
Research Report > New Finding (0.48)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceJul-4-2023

Scalable variable selection for two-view learning tasks with projection operators

Szedmak, Sandor, Huusari, Riikka, Le, Tat Hong Duong, Rousu, Juho

Vector-valued, or more generally structured output learning tasks arising from various domains have attracted much research attention in recent years [Micchelli and Pontil, 2005, Deshwal et al., 2019, Brogat-Motte et al., 2022]. For both supervised but also unsupervised learning approaches, multi-view data has been of interest [Hotelling, 1936, Xu et al., 2013, Minh et al., 2016a]. Despite many successful approaches for various multi-view and vector-valued learning settings, including interpretability to these models has received less attention. While there are various feature selection and dimensionality reduction methods either for scalar-valued learning tasks, or unsupervised methods for data represented in a single view [Zebari et al., 2020, Li et al., 2017, Anette and Nokto, 2018, Bommert et al., 2020], there is scarcity of methods suitable for when data is represented in two views, or arises from a vector-valued learning task. From the point of view of interpretability, especially feature selection methods are advantageous over dimensionality reduction since the relevant features are directly obtained as a result and not given only in (linear) combinations. Recently, some feature selection methods have been proposed for structured output learning tasks.

artificial intelligence, machine learning, projection operator, (17 more...)

2307.01558

Country:

North America > United States (0.68)
Europe (0.46)

Genre: Research Report (0.64)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

arXiv.org Artificial IntelligenceNov-16-2022

Vector-Valued Least-Squares Regression under Output Regularity Assumptions

Brogat-Motte, Luc, Rudi, Alessandro, Brouard, Céline, Rousu, Juho, d'Alché-Buc, Florence

We propose and analyse a reduced-rank method for solving least-squares regression problems with infinite dimensional output. We derive learning bounds for our method, and study under which setting statistical performance is improved in comparison to full-rank method. Our analysis extends the interest of reduced-rank regression beyond the standard low-rank setting to more general output regularity assumptions. We illustrate our theoretical insights on synthetic least-squares problems. Then, we propose a surrogate structured prediction method derived from this reduced-rank method. We assess its benefits on three different problems: image reconstruction, multi-label classification, and metabolite identification.

artificial intelligence, estimator, machine learning, (19 more...)

2211.08958

Country:

Europe > France (0.46)
North America > United States (0.46)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.67)

arXiv.org Machine LearningFeb-16-2022

Learning to Predict Graphs with Fused Gromov-Wasserstein Barycenters

Brogat-Motte, Luc, Flamary, Rémi, Brouard, Céline, Rousu, Juho, d'Alché-Buc, Florence

This paper introduces a novel and generic framework to solve the flagship task of supervised labeled graph prediction by leveraging Optimal Transport tools. We formulate the problem as regression with the Fused Gromov-Wasserstein (FGW) loss and propose a predictive model relying on a FGW barycenter whose weights depend on inputs. First we introduce a non-parametric estimator based on kernel ridge regression for which theoretical results such as consistency and excess risk bound are proved. Next we propose an interpretable parametric model where the barycenter weights are modeled with a neural network and the graphs on which the FGW barycenter is calculated are additionally learned. Numerical experiments show the strength of the method and its ability to interpolate in the labeled graph space on simulated data and on a difficult metabolic identification problem where it can reach very good performance with very little engineering.

artificial intelligence, fused gromov-wasserstein barycenter, machine learning, (1 more...)

2202.03813

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.87)
Information Technology > Modeling & Simulation (0.53)

arXiv.org Machine LearningNov-2-2020

Learning Output Embeddings in Structured Prediction

Brogat-Motte, Luc, Rudi, Alessandro, Brouard, Céline, Rousu, Juho, d'Alché-Buc, Florence

A powerful and flexible approach to structured prediction consists in embedding the structured objects to be predicted into a feature space of possibly infinite dimension by means of output kernels, and then, solving a regression problem in this output space. A prediction in the original space is computed by solving a pre-image problem. In such an approach, the embedding, linked to the target loss, is defined prior to the learning phase. In this work, we propose to jointly learn a finite approximation of the output embedding and the regression function into the new feature space. For that purpose, we leverage a priori information on the outputs and also unexploited unsupervised output data, which are both often available in structured prediction problems. We prove that the resulting structured predictor is a consistent estimator, and derive an excess risk bound. Moreover, the novel structured prediction tool enjoys a significantly smaller computational complexity than former output kernel methods. The approach empirically tested on various structured prediction problems reveals to be versatile and able to handle large datasets.

artificial intelligence, inductive learning, null, (16 more...)

2007.14703

Country: North America > United States (0.46)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.48)

arXiv.org Machine LearningMay-4-2020

A Solution for Large Scale Nonlinear Regression with High Rank and Degree at Constant Memory Complexity via Latent Tensor Reconstruction

Szedmak, Sandor, Cichonska, Anna, Julkunen, Heli, Pahikkala, Tapio, Rousu, Juho

This paper proposes a novel method for learning highly nonlinear, multivariate functions from examples. Our method takes advantage of the property that continuous functions can be approximated by polynomials, which in turn are representable by tensors. Hence the function learning problem is transformed into a tensor reconstruction problem, an inverse problem of the tensor decomposition. Our method incrementally builds up the unknown tensor from rank-one terms, which lets us control the complexity of the learned model and reduce the chance of overfitting. For learning the models, we present an efficient gradient-based algorithm that can be implemented in linear time in the sample size, order, rank of the tensor and the dimension of the input. In addition to regression, we present extensions to classification, multi-view learning and vector-valued output as well as a multi-layered formulation. The method can work in an online fashion via processing mini-batches of the data with constant memory complexity. Consequently, it can fit into systems equipped only with limited resources such as embedded systems or mobile phones. Our experiments demonstrate a favorable accuracy and running time compared to competing methods.

artificial intelligence, neural network, polynomial, (9 more...)

2005.01538

Country: North America > United States > New York (0.14)

Genre: Research Report > Promising Solution (0.71)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

arXiv.org Machine LearningApr-18-2018

Bayesian Metabolic Flux Analysis reveals intracellular flux couplings

Heinonen, Markus, Osmala, Maria, Mannerström, Henrik, Wallenius, Janne, Kaski, Samuel, Rousu, Juho, Lähdesmäki, Harri

Metabolic flux balance analyses are a standard tool in analysing metabolic reaction rates compatible with measurements, steady-state and the metabolic reaction network stoichiometry. Flux analysis methods commonly place unrealistic assumptions on fluxes due to the convenience of formulating the problem as a linear programming model, and most methods ignore the notable uncertainty in flux estimates. We introduce a novel paradigm of Bayesian metabolic flux analysis that models the reactions of the whole genome-scale cellular system in probabilistic terms, and can infer the full flux vector distribution of genome-scale metabolic systems based on exchange and intracellular (e.g. 13C) flux measurements, steady-state assumptions, and target function assumptions. The Bayesian model couples all fluxes jointly together in a simple truncated multivariate posterior distribution, which reveals informative flux couplings. Our model is a plug-in replacement to conventional metabolic balance methods, such as flux balance analysis (FBA). Our experiments indicate that we can characterise the genome-scale flux covariances, reveal flux couplings, and determine more intracellular unobserved fluxes in C. acetobutylicum from 13C data than flux variability analysis. The COBRA compatible software is available at github.com/markusheinonen/bamfa

constraint-based reasoning, flux, optimization problem, (20 more...)

1804.06673

Country: Europe > Finland (0.15)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.68)
Materials > Chemicals > Commodity Chemicals (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.49)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.48)
(2 more...)

arXiv.org Machine LearningNov-7-2017

A Tutorial on Canonical Correlation Methods

Uurtio, Viivi, Monteiro, João M., Kandola, Jaz, Shawe-Taylor, John, Fernandez-Reyes, Delmiro, Rousu, Juho

Canonical correlation analysis is a family of multivariate statistical methods for the analysis of paired sets of variables. Since its proposition, canonical correlation analysis has for instance been extended to extract relations between two sets of variables when the sample size is insufficient in relation to the data dimensionality, when the relations have been considered to be non-linear, and when the dimensionality is too large for human interpretation. This tutorial explains the theory of canonical correlation analysis including its regularised, kernel, and sparse variants. Additionally, the deep and Bayesian CCA extensions are briefly reviewed. Together with the numerical examples, this overview provides a coherent compendium on the applicability of the variants of canonical correlation analysis. By bringing together techniques for solving the optimisation problems, evaluating the statistical significance and generalisability of the canonical correlation model, and interpreting the relations, we hope that this article can serve as a hands-on tool for applying canonical correlation methods in data analysis.

bayesian inference, relation, survey article, (19 more...)

doi: 10.1145/3136624

1711.02391

Country: Europe > United Kingdom (0.28)

Genre:

Research Report > Experimental Study (0.88)
Instructional Material > Course Syllabus & Notes (0.87)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (0.92)
Education (0.82)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
(2 more...)

arXiv.org Machine LearningFeb-8-2016

Multi-view Kernel Completion

Bhadra, Sahely, Kaski, Samuel, Rousu, Juho

In this paper, we introduce the first method that (1) can complete kernel matrices with completely missing rows and columns as opposed to individual missing kernel values, (2) does not require any of the kernels to be complete a priori, and (3) can tackle non-linear kernels. These aspects are necessary in practical applications such as integrating legacy data sets, learning under sensor failures and learning when measurements are costly for some of the views. The proposed approach predicts missing rows by modelling both within-view and between-view relationships among kernel values. We show, both on simulated data and real world data, that the proposed method outperforms existing techniques in the restricted settings where they are available, and extends applicability to new settings.

health & medicine, kernel matrix, oncology, (17 more...)

1602.02518

Country: Europe > Finland (0.14)

Genre: Research Report (0.64)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)