AITopics | mbox

Collaborating Authors

mbox

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

The Sample Complexity of Parameter-Free Stochastic Convex Optimization

Lawrence, Jared, Kalinsky, Ari, Bradfield, Hannah, Carmon, Yair, Hinder, Oliver

arXiv.org Artificial IntelligenceJun-16-2025

We study the sample complexity of stochastic convex optimization when problem parameters, e.g., the distance to optimality, are unknown. We pursue two strategies. First, we develop a reliable model selection method that avoids overfitting the validation set. This method allows us to generically tune the learning rate of stochastic optimization methods to match the optimal known-parameter sample complexity up to $\log\log$ factors. Second, we develop a regularization-based method that is specialized to the case that only the distance to optimality is unknown. This method provides perfect adaptability to unknown distance to optimality, demonstrating a separation between the sample and computational complexity of parameter-free stochastic convex optimization. Combining these two methods allows us to simultaneously adapt to multiple problem structures. Experiments performing few-shot learning on CIFAR-10 by fine-tuning CLIP models and prompt engineering Gemini to count shapes indicate that our reliable model selection method can help mitigate overfitting to small validation sets.

artificial intelligence, equation, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2506.11336

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > New York (0.04)
Europe > France (0.04)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

EmotioNet Challenge

#artificialintelligenceFeb-10-2020, 11:30:14 GMT

This track requires the identification of 12 action units (AUs). The AUs included in the challenge are: 1, 2, 4, 5, 6, 9, 12, 17, 20, 25, 26, 43. Training data: The EmotioNet database includes 950,000 images with annotated AUs. These were annotated with the algorithm described in [1]. You can train your system using this set.

accuracy, algorithm, emotionet challenge, (14 more...)

#artificialintelligence

Country: North America > United States > Ohio (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Tutorial #5: variational autoencoders

#artificialintelligenceJan-28-2020, 21:05:09 GMT

The goal of the variational autoencoder (VAE) is to learn a probability distribution $Pr(\mathbf{x})$ over a multi-dimensional variable $\mathbf{x}$. There are two main reasons for modelling distributions. First, we might want to draw samples (generate) from the distribution to create new plausible values of $\mathbf{x}$. Second, we might want to measure the likelihood that a new vector $\mathbf{x} {*}$ was created by this probability distribution. In fact, it turns out that the variational autoencoder is well-suited to the former task but not for the latter. It is common to talk about the variational autoencoder as if it is the model of $Pr(\mathbf{x})$. However, this is misleading; the variational autoencoder is a neural architecture that is designed to help learn the model for $Pr(\mathbf{x})$.

Add feedback

random kitchen sinks as approximation to kernel machine

#artificialintelligenceOct-7-2019, 09:09:53 GMT

The dimension of $w$ does not make sense to me. In order to approximate the kernel function with sufficient accuracy, we need to use a high number of $D$. It gives me a feeling of a high risk of overfitting the model and my model is appeared to be overfitting when I am trying to make use of it. Isn't the true approximation to the kernel machine should be I think the author is trying to fit a linear model in the feature space (as $z(x)$ is the feature map) rather than the standard kernel trick which does not need to evaluate the feature map. But I don't understand why the author do not need to compute the sample average of $K$ (or do something similar to $z$)? The implementation here is also fitting a model of $D$ parameter, no averaging step is done, which makes me quite confusing.

approximation, kernel machine, random kitchen sink, (8 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.65)

Add feedback

Chapter 29 Smoothing Introduction to Data Science

#artificialintelligenceAug-28-2019, 08:45:36 GMT

Before continuing learning about machine learning algorithms, we introduce the important concept of smoothing. Smoothing is a very powerful technique used all across data analysis. Other names given to this technique are curve fitting and low pass filtering. It is designed to detect trends in the presence of noisy data in cases in which the shape of the trend is unknown. The smoothing name comes from the fact that to accomplish this feat, we assume that the trend is smooth, as in a smooth surface.

assumption, loess, mbox, (16 more...)

#artificialintelligence

Country: North America > Puerto Rico (0.04)

Industry:

Government (0.48)
Education (0.34)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Add feedback

Forecasting Solar Flares Using Magnetogram-based Predictors and Machine Learning

#artificialintelligenceJan-30-2018, 22:42:28 GMT

The Gini importance is returned with the randomForest function of the randomForest package in R. The higher the Gini importance of the $j$th predictor, the more important this predictor. Abbreviations for predictors used in the main text (Symbol1) and in Figures 7 and 8 (Symbol2).

artificial intelligence, forecasting solar flare, magnetogram-based predictor and machine learning, (5 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

Lessons from Bayesian disease diagnosis: Don't over-interpret the Bayes factor, VERSION 2

#artificialintelligenceDec-5-2016, 02:45:18 GMT

This revision has corrected derivations, new R/JAGS code, and new diagrams.] Overview "Captain, the prior probability of this character dying and leaving the show is infinitesimal." A primary example of Bayes' rule is for disease diagnosis (or illicit drug screening). The example is invoked routinely to explain the importance of prior probabilities. Here's one version of it: Suppose a diagnostic test has a 97% detection rate and a 5% false alarm rate.

artificial intelligence, bayesian inference, machine learning, (18 more...)

#artificialintelligence

Industry: Health & Medicine (0.75)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.87)

Add feedback

Automating Quantified Multimodal Logics in Simple Type Theory -- A Case Study

Benzmueller, Christoph

arXiv.org Artificial IntelligenceMay-27-2009

This paper presents a case study in quantified multimodal logics. An interesting aspect of this case study is that off the shelf theorem provers and model generators for simple type theory, that is, classical higher-order logic, are employed to automate problems in quantified multimodal logics, that is, nonclassical logics. This is enabled by our recent embedding of normal quantified multimodal logics in simple type theory [8, 10], which is sound and complete [10]. Interestingly, not only reasoning within various nonclassical logics can be automated this way but also reasoning about them. For example, the equivalence between different properties of accessibility relations and their associated multimodal axioms can be proved automatically.

artificial intelligence, logic & formal reasoning, mbox, (15 more...)

arXiv.org Artificial Intelligence

0905.4369

Country: Europe > Germany (0.47)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)

Add feedback