AITopics

Low-rank Matrix Completion (LRMC) describes the problem where we wish to recover missing entries of partially observed low-rank matrix. Most existing matrix completion work deals with sampling procedures that are independent of the underlying data values. While this assumption allows the derivation of nice theoretical guarantees, it seldom holds in real-world applications. In this paper, we consider various settings where the sampling mask is dependent on the underlying data values, motivated by applications in sensing, sequential decision-making, and recommender systems. Through a series of experiments, we study and compare the performance of various LRMC algorithms that were originally successful for data-independent sampling patterns.

artificial intelligence, machine learning, matrix, (17 more...)

doi: 10.23919/EUSIPCO55093.2022.9909952

2504.09873

Country: North America > United States > Michigan (0.28)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.35)

Kullback-Leibler excess risk bounds for exponential weighted aggregation in Generalized linear models

Mai, The Tien

Aggregation methods have emerged as a powerful and flexible framework in statistical learning, providing unified solutions across diverse problems such as regression, classification, and density estimation. In the context of generalized linear models (GLMs), where responses follow exponential family distributions, aggregation offers an attractive alternative to classical parametric modeling. This paper investigates the problem of sparse aggregation in GLMs, aiming to approximate the true parameter vector by a sparse linear combination of predictors. We prove that an exponential weighted aggregation scheme yields a sharp oracle inequality for the Kullback-Leibler risk with leading constant equal to one, while also attaining the minimax-optimal rate of aggregation. These results are further enhanced by establishing high-probability bounds on the excess risk.

aggregation, artificial intelligence, machine learning, (16 more...)

2504.10171

Country:

Europe (0.28)
North America > United States (0.28)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.69)

Chen, Junfeng, Wu, Kailiang, Xiu, Dongbin

DUE: A Deep Learning Framework and Library for Modeling Unknown Equations

Equations, particularly differential equations, are fundamental for understanding natural phenomena and predicting complex dynamics across various scientific and engineering disciplines. However, the governing equations for many complex systems remain unknown due to intricate underlying mechanisms. Recent advancements in machine learning and data science offer a new paradigm for modeling unknown equations from measurement or simulation data. This paradigm shift, known as data-driven discovery or modeling, stands at the forefront of AI for science, with significant progress made in recent years. In this paper, we introduce a systematic framework for data-driven modeling of unknown equations using deep learning. This versatile framework is capable of learning unknown ODEs, PDEs, DAEs, IDEs, SDEs, reduced or partially observed systems, and non-autonomous differential equations. Based on this framework, we have developed Deep Unknown Equations (DUE), an open-source software package designed to facilitate the data-driven modeling of unknown equations using modern deep learning techniques. DUE serves as an educational tool for classroom instruction, enabling students and newcomers to gain hands-on experience with differential equations, data-driven modeling, and contemporary deep learning approaches such as FNN, ResNet, generalized ResNet, operator semigroup networks (OSG-Net), and Transformers. Additionally, DUE is a versatile and accessible toolkit for researchers across various scientific and engineering fields. It is applicable not only for learning unknown equations from data but also for surrogate modeling of known, yet complex, equations that are costly to solve using traditional numerical methods. We provide detailed descriptions of DUE and demonstrate its capabilities through diverse examples, which serve as templates that can be easily adapted for other applications.

artificial intelligence, deep learning, machine learning, (18 more...)

2504.10373

Country: North America > United States (0.67)

Genre: Research Report (0.40)

Industry: Education (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

$\alpha$-Flow: A Unified Framework for Continuous-State Discrete Flow Matching Models

Cheng, Chaoran, Li, Jiahan, Fan, Jiajun, Liu, Ge

Recent efforts have extended the flow-matching framework to discrete generative modeling. One strand of models directly works with the continuous probabilities instead of discrete tokens, which we colloquially refer to as Continuous-State Discrete Flow Matching (CS-DFM). Existing CS-DFM models differ significantly in their representations and geometric assumptions. This work presents a unified framework for CS-DFM models, under which the existing variants can be understood as operating on different $\alpha$-representations of probabilities. Building upon the theory of information geometry, we introduce $\alpha$-Flow, a family of CS-DFM models that adheres to the canonical $\alpha$-geometry of the statistical manifold, and demonstrate its optimality in minimizing the generalized kinetic energy. Theoretically, we show that the flow matching loss for $\alpha$-flow establishes a unified variational bound for the discrete negative log-likelihood. We comprehensively evaluate different instantiations of $\alpha$-flow on various discrete generation domains to demonstrate their effectiveness in discrete generative modeling, including intermediate values whose geometries have never been explored before. $\alpha$-flow significantly outperforms its discrete-state counterpart in image and protein sequence generation and better captures the entropy in language modeling.

machine learning, manifold, natural language, (16 more...)

2504.10283

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.88)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Broadbent, Dominic, Whiteley, Nick, Allison, Robert, Lovett, Tom

Conditional Distribution Compression via the Kernel Conditional Mean Embedding

Existing distribution compression methods, like Kernel Herding (KH), were originally developed for unlabelled data. However, no existing approach directly compresses the conditional distribution of labelled data. To address this gap, we first introduce the Average Maximum Conditional Mean Discrepancy (AMCMD), a natural metric for comparing conditional distributions. We then derive a consistent estimator for the AMCMD and establish its rate of convergence. Next, we make a key observation: in the context of distribution compression, the cost of constructing a compressed set targeting the AMCMD can be reduced from $\mathcal{O}(n^3)$ to $\mathcal{O}(n)$. Building on this, we extend the idea of KH to develop Average Conditional Kernel Herding (ACKH), a linear-time greedy algorithm that constructs a compressed set targeting the AMCMD. To better understand the advantages of directly compressing the conditional distribution rather than doing so via the joint distribution, we introduce Joint Kernel Herding (JKH), a straightforward adaptation of KH designed to compress the joint distribution of labelled data. While herding methods provide a simple and interpretable selection process, they rely on a greedy heuristic. To explore alternative optimisation strategies, we propose Joint Kernel Inducing Points (JKIP) and Average Conditional Kernel Inducing Points (ACKIP), which jointly optimise the compressed set while maintaining linear complexity. Experiments show that directly preserving conditional distributions with ACKIP outperforms both joint distribution compression (via JKH and JKIP) and the greedy selection used in ACKH. Moreover, we see that JKIP consistently outperforms JKH.

artificial intelligence, machine learning, tr 3, (18 more...)

2504.10139

Country:

North America > United States (0.93)
Europe > United Kingdom > England (0.28)

Genre:

Research Report (0.64)
Overview (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.45)

Alacaoglu, Ahmet, Malitsky, Yura, Wright, Stephen J.

Towards Weaker Variance Assumptions for Stochastic Optimization

We revisit a classical assumption for analyzing stochastic gradient algorithms where the squared norm of the stochastic subgradient (or the variance for smooth problems) is allowed to grow as fast as the squared norm of the optimization variable. We contextualize this assumption in view of its inception in the 1960s, its seemingly independent appearance in the recent literature, its relationship to weakest-known variance assumptions for analyzing stochastic gradient algorithms, and its relevance in deterministic problems for non-Lipschitz nonsmooth convex optimization. We build on and extend a connection recently made between this assumption and the Halpern iteration. For convex nonsmooth, and potentially stochastic, optimization, we analyze horizon-free, anytime algorithms with last-iterate rates. For problems beyond simple constrained optimization, such as convex problems with functional constraints or regularized convex-concave min-max problems, we obtain rates for optimality measures that do not require boundedness of the feasible set.

artificial intelligence, assumption, machine learning, (15 more...)

2504.09951

Country: North America (0.28)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.54)

FOX NewsApr-13-2025, 21:10:16 GMT

Artificial intelligence transforms patient care and reduces burnout, physician says

With just one click, the AI technology begins transcribing the doctor's conversation with a patient. DENVER – Artificial intelligence is quietly transforming how doctors interact with patients -- and it might already be in use during your next visit to the doctor's office. Thousands of physicians across the country are using a form of AI called ambient listening, surveys show. This technology listens to conversations between doctors and patients, creates real-time transcriptions, and then compiles detailed clinical notes -- all without disrupting the flow of the appointment. Dr. Daniel Kortsch, associate chief of artificial intelligence and digital health at Denver Health, said that ambient listening technology has made a big difference since his practice began using it in fall 2024.

artificial intelligence, kortsch, physician, (10 more...)

FOX News

Country: North America > United States (1.00)

Industry: Health & Medicine > Health Care Technology > Medical Record (0.38)

Technology: Information Technology > Artificial Intelligence (1.00)

MashableApr-13-2025, 19:52:10 GMT

Netflix tests out new AI search engine for movies and TV shows powered by OpenAI

Black Mirror may be able to draw inspiration for future episodes from the very platform it streams on. Netflix has just recently rolled out access to a new AI search engine tool to some of its subscribers, according to a report from Bloomberg. The AI search engine, which is powered by ChatGPT creator OpenAI, takes Netflix's search capabilities beyond looking up movies and TV shows by title, genre, or actor. The tool allows users to search for content using numerous other search queries, such as mood. Being that the feature is powered by OpenAI, it appears likely that users will be able to use natural language in their search.

information retrieval, large language model, machine learning, (12 more...)

Mashable

Country: Oceania (0.22)

Industry:

Media > Television (1.00)
Media > Film (1.00)
Leisure & Entertainment (1.00)
Information Technology > Services (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.92)

PCWorldApr-13-2025, 14:51:09 GMT

Is your phone secretly listening to you? Here's a simple way to find out

If you're a smartphone owner--and chances are that's everyone reading this--you've probably encountered an eerie, but all too common scenario: One day you're talking about a random topic while your phone is next to you and the following day you notice ads start popping up related to that same topic. How do these ads know what you were talking about? Your smartphone may be the culprit. Every smartphone has its built-in microphone constantly turned on in order for the virtual assistant to hear your voice commands. So, could it be that these devices are also secretly eavesdropping on your conversations in order to serve you ads? Here's everything you need to know, plus a simple test to find out.

artificial intelligence, eavesdropping, phone secretly, (9 more...)

PCWorld

Country: Europe > Spain (0.16)

Industry:

Marketing (0.30)
Information Technology (0.30)

Technology:

Information Technology > Communications > Mobile (0.79)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.70)

MashableApr-13-2025, 11:01:05 GMT

Upgrade to Windows 11 Pro for less than a movie ticket

TL;DR: Upgrade to Microsoft Windows 11 Pro for just 14.97 (regularly 199) and enjoy enhanced security, productivity features, and the AI-powered Copilot assistant. In the ever-evolving world of technology, keeping your operating system current is essential for optimal performance and security. For a limited time, you can upgrade to Microsoft Windows 11 Pro for just 14.97, a significant reduction from its regular price of 199. Windows 11 Pro offers a sleek and user-friendly interface designed to enhance your computing experience. Features like Snap Layouts and Virtual Desktops allow for efficient multitasking and enable you to organize your workspace with ease.

artificial intelligence, human computer interaction, window 11, (4 more...)

Mashable

Industry: Information Technology > Security & Privacy (0.38)

Technology:

Information Technology > Artificial Intelligence (0.44)
Information Technology > Human Computer Interaction (0.40)
Information Technology > Security & Privacy (0.38)