AITopics | oreo

Collaborating Authors

oreo

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Appendix ASource codes

Neural Information Processing SystemsApr-24-2026, 21:48:38 GMT

Source codes for reproducing our experimental results are available at https://github.com/ We utilize DQNReplay dataset5 [1] for expert demonstrations on 27 Atari environments [5]. To encourage the size of the dataset to be consistent across multiple environments, we use the number of expert demonstrations N 2{ 20,50}. We provide the size of a dataset for each environment in Table 4. We process input images to grayscale images of 84 84 1, by utilizing Dopamine library6 [9].

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Genre: Research Report (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.48)

Add feedback

Object-AwareRegularizationfor AddressingCausalConfusioninImitationLearning

Neural Information Processing SystemsFeb-7-2026, 16:06:46 GMT

Behavioral cloning has proven to be effective for learning sequential decisionmaking policies fromexpertdemonstrations.

expert demonstration, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Robots (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.69)

Add feedback

HiPerRAG: High-Performance Retrieval Augmented Generation for Scientific Insights

Gokdemir, Ozan, Siebenschuh, Carlo, Brace, Alexander, Wells, Azton, Hsu, Brian, Hippe, Kyle, Setty, Priyanka V., Ajith, Aswathy, Pauloski, J. Gregory, Sastry, Varuni, Foreman, Sam, Zheng, Huihuo, Ma, Heng, Kale, Bharat, Chia, Nicholas, Gibbs, Thomas, Papka, Michael E., Brettin, Thomas, Alexander, Francis J., Anandkumar, Anima, Foster, Ian, Stevens, Rick, Vishwanath, Venkatram, Ramanathan, Arvind

arXiv.org Artificial IntelligenceMay-9-2025

The volume of scientific literature is growing exponentially, leading to underutilized discoveries, duplicated efforts, and limited cross-disciplinary collaboration. Retrieval Augmented Generation (RAG) offers a way to assist scientists by improving the factuality of Large Language Models (LLMs) in processing this influx of information. However, scaling RAG to handle millions of articles introduces significant challenges, including the high computational costs associated with parsing documents and embedding scientific knowledge, as well as the algorithmic complexity of aligning these representations with the nuanced semantics of scientific content. To address these issues, we introduce HiPerRAG, a RAG workflow powered by high performance computing (HPC) to index and retrieve knowledge from more than 3.6 million scientific articles. At its core are Oreo, a high-throughput model for multimodal document parsing, and ColTrast, a query-aware encoder fine-tuning algorithm that enhances retrieval accuracy by using contrastive learning and late-interaction techniques. HiPerRAG delivers robust performance on existing scientific question answering benchmarks and two new benchmarks introduced in this work, achieving 90% accuracy on SciQ and 76% on PubMedQA-outperforming both domain-specific models like PubMedGPT and commercial LLMs such as GPT-4. Scaling to thousands of GPUs on the Polaris, Sunspot, and Frontier supercomputers, HiPerRAG delivers million document-scale RAG workflows for unifying scientific knowledge and fostering interdisciplinary innovation.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3732775.3733586

2505.04846

Country:

North America > United States > California (0.46)
North America > United States > Illinois (0.28)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.94)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Oreo: A Plug-in Context Reconstructor to Enhance Retrieval-Augmented Generation

Li, Sha, Ramarkrishnan, Naren

arXiv.org Artificial IntelligenceFeb-18-2025

Despite the remarkable capabilities of Large Language Models (LLMs) in various NLP tasks, they remain vulnerable to hallucinations due to their limited parametric knowledge and lack of domain-specific expertise. Retrieval-Augmented Generation (RAG) addresses this challenge by incorporating external document retrieval to augment the knowledge base of LLMs. In this approach, RAG retrieves document chunks from an external corpus in response to a query, which are then used as context for the downstream language model to generate an answer. However, these retrieved knowledge sources often include irrelevant or erroneous information, undermining the effectiveness of RAG in downstream tasks. To overcome this limitation, we introduce a compact, efficient, and pluggable module designed to refine external knowledge sources before feeding them to the generator. The module reconstructs retrieved content by extracting the most relevant and supportive information and reorganising it into a concise, query-specific format. Through a three-stage training paradigm - comprising supervised fine-tuning, contrastive multi-task learning, and reinforcement learning-based alignment - it prioritises critical knowledge and aligns it with the generator's preferences. This method enables LLMs to produce outputs that are more accurate, reliable, and contextually appropriate.

computational linguistic, large language model, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2502.13019

Country:

Europe (1.00)
Asia (1.00)
North America > United States (0.93)

Genre: Research Report (1.00)

Industry:

Leisure & Entertainment > Sports (0.93)
Transportation > Air (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Wang, Huaijie, Hao, Shibo, Dong, Hanze, Zhang, Shenao, Bao, Yilin, Yang, Ziran, Wu, Yi

arXiv.org Artificial IntelligenceDec-25-2024

Improving the multi-step reasoning ability of large language models (LLMs) with offline reinforcement learning (RL) is essential for quickly adapting them to complex tasks. While Direct Preference Optimization (DPO) has shown promise in aligning LLMs with human preferences, it is less suitable for multi-step reasoning tasks because (1) DPO relies on paired preference data, which is not readily available for multi-step reasoning tasks, and (2) it treats all tokens uniformly, making it ineffective for credit assignment in multi-step reasoning tasks, which often come with sparse reward. In this work, we propose OREO (Offline Reasoning Optimization), an offline RL method for enhancing LLM multi-step reasoning. Building on insights from previous works of maximum entropy reinforcement learning, it jointly learns a policy model and value function by optimizing the soft Bellman Equation. We show in principle that it reduces the need to collect pairwise data and enables better credit assignment. Empirically, OREO surpasses existing offline learning methods on multi-step reasoning benchmarks, including mathematical reasoning tasks (GSM8K, MATH) and embodied agent control (ALFWorld). The approach can be extended to a multi-iteration framework when additional resources are available. Furthermore, the learned value function can be leveraged to guide the tree search for free, which can further boost performance during test time.

large language model, machine learning, preprint arxiv, (20 more...)

arXiv.org Artificial Intelligence

2412.16145

Country:

North America > United States > California > San Diego County > San Diego (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Research Report (0.50)
Workflow (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Nine AI Chatbots You Can Play With Right Now

The Atlantic - TechnologyMar-27-2023, 21:43:00 GMT

If you believe in the multibillion-dollar valuations, the prognostications from some of tech's most notable figures, and the simple magic of getting a computer to do your job for you, then you might say we're at the start of the chatbot era. Last November, OpenAI released ChatGPT into the unsuspecting world: It became the fastest-growing consumer app in history and immediately seemed to reconfigure how people think of conversational programs. Chatbots have existed for decades, but they haven't seemed especially intelligent--nothing like the poetry-writing, email-summarizing machines that have sprouted up recently. OpenAI has defined the moment, but there are plenty of competitors, including major players such as Google and Meta and lesser-known start-ups such as Anthropic. This cheat sheet tracks some of the most notable chatbot contenders through a few metrics: Can you actually use them? Do they contain glaring flaws?

ai revolution, oreo, ralph waldo emerson, (13 more...)

The Atlantic - Technology

Country: North America > United States > New York (0.04)

Industry: Media > News (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.92)

Add feedback

I Have Questions for ChatGPT

The New YorkerMar-6-2023, 11:00:00 GMT

ChatGPT enables users to ask questions or tell a story, and the bot will respond with relevant, natural-sounding answers and topics. A friend gifted me a fancy designer bucket hat that she swore she didn't want anymore. Then we had a misunderstanding, and she ghosted my birthday party. And put a potato in her tailpipe. And slept with her ex.

chatgpt, oreo, tree-lined street, (2 more...)

The New Yorker

Country: Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.06)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.63)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.63)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.63)

Add feedback

Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning

Park, Jongjin, Seo, Younggyo, Liu, Chang, Zhao, Li, Qin, Tao, Shin, Jinwoo, Liu, Tie-Yan

arXiv.org Artificial IntelligenceOct-26-2021

Behavioral cloning has proven to be effective for learning sequential decision-making policies from expert demonstrations. However, behavioral cloning often suffers from the causal confusion problem where a policy relies on the noticeable effect of expert actions due to the strong correlation but not the cause we desire. This paper presents Object-aware REgularizatiOn (OREO), a simple technique that regularizes an imitation policy in an object-aware manner. Our main idea is to encourage a policy to uniformly attend to all semantic objects, in order to prevent the policy from exploiting nuisance variables strongly correlated with expert actions. To this end, we introduce a two-stage approach: (a) we extract semantic objects from images by utilizing discrete codes from a vector-quantized variational autoencoder, and (b) we randomly drop the units that share the same discrete code together, i.e., masking out semantic objects. Our experiments demonstrate that OREO significantly improves the performance of behavioral cloning, outperforming various other regularization and causality-based methods on a variety of Atari environments and a self-driving CARLA environment. We also show that our method even outperforms inverse reinforcement learning methods trained with a considerable amount of environment interaction.

atari environment, causal confusion problem, environment interaction, (12 more...)

arXiv.org Artificial Intelligence

2110.14118

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Information Technology (0.93)
Education (0.67)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

How Voice Will Capture Connected Commerce PYMNTS.com

#artificialintelligenceOct-11-2019, 19:18:28 GMT

A massive sea change has happened in the world of retail – albeit so subtly and swiftly that most consumers probably never even felt it happen. Shopping went from being a discreet, defined, daily (or weekly or monthly) activity to something that has become like the background noise of modern interaction. As recently as the turn of the century, "going shopping" meant exactly that for over 90 percent of consumers: getting in a car and physically going someplace to make purchases. Flash forward to the closing months of the second decade of the 21st century, and shopping is not so much a thing that consumers go do, so much as something that is happening in the background of everything else customers are already doing. According to the 2019 edition of the PYMNTS How We Will Pay Study, the average consumer does about 12 activities over the course of a day, and makes a purchase during about four of them.

capture connected commerce pymnt, commerce, consumer, (9 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.33)

Add feedback