AITopics | bouchard

The softmax representation of probabilities for categorical variables plays a prominent role in modern machine learning with numerous applications in areas such as large scale classification, neural language modeling and recommendation systems. However, softmax estimation is very expensive for large scale inference because of the high cost associated with computing the normalizing constant. Here, we introduce an efficient approximation to softmax probabilities which takes the form of a rigorous lower bound on the exact probability. This bound is expressed as a product over pairwise probabilities and it leads to scalable estimation based on stochastic optimization. It allows us to perform doubly stochastic estimation by subsampling both training instances and class labels. We show that the new bound has interesting theoretical properties and we demonstrate its use in classification problems.

Add feedback

Unsupervised Protoform Reconstruction through Parsimonious Rule-guided Heuristics and Evolutionary Search

Kpoglu, Promise Dodzi

arXiv.org Artificial IntelligenceJun-13-2025

We propose an unsupervised method for the reconstruction of protoforms i.e., ancestral word forms from which modern language forms are derived. While prior work has primarily relied on probabilistic models of phonological edits to infer protoforms from cognate sets, such approaches are limited by their p redominantly data - driven nature. In contrast, our model integrates data - driven inference with rule - based heuristics within an evolutionary optimization framework. This hybrid approach leverages on both statistical patterns and linguistically motivat ed constraints to guide the reconstruction process. We evaluate our method on the task of reconstructing Latin protoforms using a dataset of cognates from five Romance languages. Experimental results demonstrate substantial improvements over established ba selines across both character - level accuracy and phonological plausibility metrics. Keywords: protoform reconstruction, historical linguistics, evolutionary algorithms, phonological modeling, rule - based inference .

artificial intelligence, evolutionary algorithm, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2506.10614

Country:

Europe (0.46)
North America (0.28)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)

Add feedback

LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases

Bouchard, Dylan, Chauhan, Mohit Singh, Skarbrevik, David, Bajaj, Viren, Ahmad, Zeya

arXiv.org Artificial IntelligenceJan-6-2025

Large Language Models (LLMs) have been observed to exhibit bias in numerous ways, potentially creating or worsening outcomes for specific groups identified by protected attributes such as sex, race, sexual orientation, or age. To help address this gap, we introduce LangFair, an open-source Python package that aims to equip LLM practitioners with the tools to evaluate bias and fairness risks relevant to their specific use cases. The package offers functionality to easily generate evaluation datasets, comprised of LLM responses to use-case-specific prompts, and subsequently calculate applicable metrics for the practitioner's use case. To guide in metric selection, LangFair offers an actionable decision framework.

artificial intelligence, large language model, natural language, (15 more...)

arXiv.org Artificial Intelligence

2501.03112

Country:

North America > United States > New York > New York County > New York City (0.05)
South America > Colombia > Meta Department > Villavicencio (0.04)
North America > United States > Washington > King County > Seattle (0.04)
(3 more...)

Genre: Research Report (0.44)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

One-vs-Each Approximation to Softmax for Scalable Estimation of Probabilities

Neural Information Processing SystemsMar-12-2024, 13:43:10 GMT

The softmax representation of probabilities for categorical variables plays a prominent role in modern machine learning with numerous applications in areas such as large scale classification, neural language modeling and recommendation systems. However, softmax estimation is very expensive for large scale inference because of the high cost associated with computing the normalizing constant. Here, we introduce an efficient approximation to softmax probabilities which takes the form of a rigorous lower bound on the exact probability. This bound is expressed as a product over pairwise probabilities and it leads to scalable estimation based on stochastic optimization. It allows us to perform doubly stochastic estimation by subsampling both training instances and class labels. We show that the new bound has interesting theoretical properties and we demonstrate its use in classification problems.

bouchard, likelihood, probability, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > New York (0.04)
North America > United States > New Jersey > Hudson County > Secaucus (0.04)
North America > United States > Maryland > Baltimore (0.04)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.48)

Add feedback

The Fisher-Rao geometry of CES distributions

Bouchard, Florent, Breloy, Arnaud, Collas, Antoine, Renaux, Alexandre, Ginolhac, Guillaume

arXiv.org Machine LearningOct-2-2023

When dealing with a parametric statistical model, a Riemannian manifold can naturally appear by endowing the parameter space with the Fisher information metric. The geometry induced on the parameters by this metric is then referred to as the Fisher-Rao information geometry. Interestingly, this yields a point of view that allows for leveragingmany tools from differential geometry. After a brief introduction about these concepts, we will present some practical uses of these geometric tools in the framework of elliptical distributions. This second part of the exposition is divided into three main axes: Riemannian optimization for covariance matrix estimation, Intrinsic Cram\'er-Rao bounds, and classification using Riemannian distances.

artificial intelligence, geometry, machine learning, (17 more...)

arXiv.org Machine Learning

2310.01032

Country:

North America > United States > Indiana > Hamilton County > Fishers (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > France > Île-de-France > Hauts-de-Seine > Nanterre (0.04)
(2 more...)

Genre: Research Report (0.64)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.67)

Add feedback

louisfb01/start-machine-learning-in-2020

#artificialintelligenceMar-29-2021, 01:05:53 GMT

Here is a list of awesome courses available on YouTube that you should definitely follow and are 100% free.

machine learning, online course, youtube playlist, (9 more...)

#artificialintelligence

Industry: Education > Educational Setting > Online (0.37)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.34)

Add feedback

Reconsidering Analytical Variational Bounds for Output Layers of Deep Networks

Sakhi, Otmane, Bonner, Stephen, Rohde, David, Vasile, Flavian

arXiv.org Machine LearningOct-3-2019

The combination of the re-parameterization trick with the use of variational auto-encoders has caused a sensation in Bayesian deep learning, allowing the training of realistic generative models of images and has considerably increased our ability to use scalable latent variable models. The re-parameterization trick is necessary for models in which no analytical variational bound is available and allows noisy gradients to be computed for arbitrary models. However, for certain standard output layers of a neural network, analytical bounds are available and the variational auto-encoder may be used both without the re-parameterization trick or the need for any Monte Carlo approximation. In this work, we show that using Jaakola and Jordan bound, we can produce a binary classification layer that allows a Bayesian output layer to be trained, using the standard stochastic gradient descent algorithm. We further demonstrate that a latent variable model utilizing the Bouchard bound for multi-class classification allows for fast training of a fully probabilistic latent factor model, even when the number of classes is very large.

algorithm, approximation, variational auto-encoder, (12 more...)

arXiv.org Machine Learning

1910.00877

Country:

Asia > Middle East > Jordan (0.27)
North America > Canada > Alberta > Census Division No. 15 > Improvement District No. 9 > Banff (0.04)

Genre: Research Report (0.55)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Add feedback

Rise of the robot

#artificialintelligenceJun-7-2017, 16:57:38 GMT

A four-part look at how robots are changing the way we work. About 180 robots here are doing work that humans used to do at a GE Aviation plant that makes parts for jet engines. But they haven't replaced the humans. Indeed, the opposite is true. Since a new, automated section of the plant ramped up at the start of the decade, the number of people working here has risen to more than 900 from 600. "A machine is not replacing three jobs," said Eric Bouchard, senior operations manager at the Bromont plant.

artificial intelligence, canada, robot, (16 more...)

#artificialintelligence

Country:

North America > Canada > Ontario > Toronto (0.15)
North America > Canada > Quebec (0.05)

Genre: Research Report (0.71)

Industry:

Government (0.70)
Banking & Finance (0.49)
Professional Services (0.48)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.40)

Add feedback

The charity that wants video game karts in every hospital

EngadgetFeb-27-2017, 17:15:04 GMT

In many ways, Jonathan Watson is like other 11-year-olds. He does his homework, dreams of becoming a doctor and plays video games when he can. Depending on the day, his favorite is either Minecraft or The Elder Scrolls V: Skyrim. Unlike most kids his age, though, Jonathan is at the hospital every three weeks for blood transfusions -- a procedure that can take up to six hours at a time. When I visited him at Mott Children's Hospital in Ann Arbor, Michigan, he wasn't slaying dragons or building a pixelated fortress; he was replaying the opening levels of Rayman Legends on a kart that had just been wheeled in.

artificial intelligence, hospital, kart, (15 more...)

Engadget

Country:

North America > United States > Michigan > Washtenaw County > Ann Arbor (0.25)
North America > United States > Texas (0.05)
North America > United States > Ohio (0.05)

Industry:

Leisure & Entertainment > Games > Computer Games (1.00)
Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Games > Computer Games (0.55)

Add feedback

One-vs-Each Approximation to Softmax for Scalable Estimation of Probabilities

AUEB, Michalis Titsias RC

Neural Information Processing SystemsDec-31-2016

The softmax representation of probabilities for categorical variables plays a prominent role in modern machine learning with numerous applications in areas such as large scale classification, neural language modeling and recommendation systems. However, softmax estimation is very expensive for large scale inference because of the high cost associated with computing the normalizing constant. Here, we introduce an efficient approximation to softmax probabilities which takes the form of a rigorous lower bound on the exact probability. This bound is expressed as a product over pairwise probabilities and it leads to scalable estimation based on stochastic optimization. It allows us to perform doubly stochastic estimation by subsampling both training instances and class labels. We show that the new bound has interesting theoretical properties and we demonstrate its use in classification problems.

artificial intelligence, bayesian inference, machine learning, (18 more...)

Neural Information Processing Systems

Country: