AITopics | Mohri, Christopher

Collaborating Authors

Mohri, Christopher

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Cardinality-Aware Set Prediction and Top-$k$ Classification

Cortes, Corinna, Mao, Anqi, Mohri, Christopher, Mohri, Mehryar, Zhong, Yutao

arXiv.org Machine LearningJul-9-2024

We present a detailed study of cardinality-aware top-$k$ classification, a novel approach that aims to learn an accurate top-$k$ set predictor while maintaining a low cardinality. We introduce a new target loss function tailored to this setting that accounts for both the classification error and the cardinality of the set predicted. To optimize this loss function, we propose two families of surrogate losses: cost-sensitive comp-sum losses and cost-sensitive constrained losses. Minimizing these loss functions leads to new cardinality-aware algorithms that we describe in detail in the case of both top-$k$ and threshold-based classifiers. We establish $H$-consistency bounds for our cardinality-aware surrogate loss functions, thereby providing a strong theoretical foundation for our algorithms. We report the results of extensive experiments on CIFAR-10, CIFAR-100, ImageNet, and SVHN datasets demonstrating the effectiveness and benefits of our cardinality-aware algorithms.

artificial intelligence, conditional regret, machine learning, (18 more...)

arXiv.org Machine Learning

2407.0714

Country:

North America > United States (0.28)
North America > Canada > Ontario > Toronto (0.14)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.92)

Add feedback

Learning to Reject with a Fixed Predictor: Application to Decontextualization

Mohri, Christopher, Andor, Daniel, Choi, Eunsol, Collins, Michael

arXiv.org Artificial IntelligenceJan-31-2023

Large language models, often trained with billions of parameters, have achieved impressive performance in recent years (Raffel et al., 2019) and are used in a wide variety of natural language generation tasks. However, their output is sometimes undesirable, with hallucinated content (Maynez et al., 2020; Filippova, 2020), and much work remains to fully understand their properties. In many applications, such as healthcare, question-answering systems, or customer service, incorrect predictions are particularly costly and must be avoided. This motivates the design of algorithms for large language models and other NLP tasks that achieve high precision on a large fraction of the input set, while abstaining on the rest. How can we devise such accurate models that allow a reject option?

machine learning, question answering, rejection loss, (17 more...)

arXiv.org Artificial Intelligence

2301.09044

Country:

North America > United States (0.46)
Europe > Spain (0.28)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.54)
(2 more...)

Add feedback

Online Learning Algorithms for Statistical Arbitrage

Mohri, Christopher

arXiv.org Machine LearningOct-31-2018

Arbitrage is the risk-free method of making profit from exploiting price differences in different markets. For example, if one stock is trading at a higher price in one market than another, one could buy the stock for the lower price on one market and sell it for the higher price on the other, thereby making profit without taking risks. These pricing disparities have become increasingly hard to capitalize on as they only appear for very short periods of time with the advancements in technology and highfrequency trading. Only those who can recognize and take advantage of arbitrage opportunities first can benefit, turning it into a winner-takes-all situation. This has made it difficult to make consistent profit from price discrepancies, as one needs to recognize them quickly and be the first to leverage them.

algorithm, computer based training, educational technology, (21 more...)

arXiv.org Machine Learning

1811.002

Country: North America > United States (0.14)

Genre: Research Report (0.50)

Industry:

Banking & Finance > Trading (1.00)
Education > Educational Setting > Online (0.44)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.85)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.44)

Add feedback