AITopics | Rhode Island

Collaborating Authors

Rhode Island

Transfer of Deep Reactive Policies for MDP Planning

Aniket (Nick) Bajpai, Sankalp Garg, None

Neural Information Processing SystemsMay-24-2025, 07:08:14 GMT

Domain-independent probabilistic planners input an MDP description in a factored representation language such as PPDDL or RDDL, and exploit the specifics of the representation for faster planning. Traditional algorithms operate on each problem instance independently, and good methods for transferring experience from policies of other instances of a domain to a new instance do not exist. Recently, researchers have begun exploring the use of deep reactive policies, trained via deep reinforcement learning (RL), for MDP planning domains. One advantage of deep reactive policies is that they are more amenable to transfer learning. In this paper, we present the first domain-independent transfer algorithm for MDP planning domains expressed in an RDDL representation. Our architecture exploits the symbolic state configuration and transition function of the domain (available via RDDL) to learn a shared embedding space for states and state-action pairs for all problem instances of a domain. We then learn an RL agent in the embedding space, making a near zero-shot transfer possible, i.e., without much training on the new instance, and without using the domain simulator at all. Experiments on three different benchmark domains underscore the value of our transfer algorithm. Compared against planning from scratch, and a state-of-the-art RL transfer algorithm, our transfer solution has significantly superior learning curves.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > New York (0.14)
North America > United States > Rhode Island (0.14)
North America > United States > Massachusetts (0.14)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

597254dc45be8c166d3ccf0ba2d56325-Paper-Conference.pdf

Neural Information Processing SystemsMay-23-2025, 19:57:59 GMT

artificial intelligence, geometry, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Wisconsin (0.14)
North America > United States > Rhode Island (0.14)

Industry: Government (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Hot methane seeps could support life beneath Antarctica's ice sheet

New ScientistApr-18-2025, 10:00:38 GMT

Microbes living beneath Antarctica's ice sheet may survive on methane generated by geothermal heat rising from deep below Earth's surface. The discovery could have implications for assessing the potential for life to survive on icy worlds beyond Earth. "These could be hotspots for microbes that are adapted to live in these areas," says Gavin Piccione at Brown University in Rhode Island. We already know that there is methane beneath Antarctica's ice sheet.

antarctica, artificial intelligence, ice sheet, (2 more...)

New Scientist

Country:

Antarctica (1.00)
North America > United States > Rhode Island (0.34)

Industry: Energy > Renewable > Geothermal (0.73)

Technology: Information Technology > Artificial Intelligence (0.33)

Add feedback

Brown University student angers non-faculty employees by asking 'what do you do all day,' faces punishment

FOX NewsApr-4-2025, 09:00:12 GMT

Alex Shieh is a student at Brown University. He is making waves and facing charges for asking the school's non-faculty employees what they do all day. A sophomore at Brown University is facing the school's wrath after he sent a DOGE-like email to non-faculty employees asking them what they do all day to try to figure out why the elite school's tuition has gotten so expensive. "The inspiration for this is the rising cost of tuition," Alex Shieh told Fox News Digital in an interview. "Next year, it's set to be 93,064 to go to Brown," Shieh said of the Ivy League university.

artificial intelligence, non-faculty employee, shieh, (9 more...)

FOX News

Country: North America > United States > Rhode Island (0.16)

Industry:

Media > News (0.95)
Education > Educational Setting > Higher Education (0.70)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Anthropic can now track the bizarre inner workings of a large language model

MIT Technology ReviewMar-27-2025, 17:00:00 GMT

It's no secret that large language models work in mysterious ways. Few--if any--mass-market technologies have ever been so little understood. That makes figuring out what makes them tick one of the biggest open challenges in science. Shedding some light on how these models work would expose their weaknesses, revealing why they make stuff up and can be tricked into going off the rails. It would help resolve deep disputes about exactly what these models can and can't do.

large language model, machine learning, natural language, (9 more...)

MIT Technology Review

Country: North America > United States > Rhode Island (0.18)

Genre: Research Report (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.56)

Add feedback

Directional Pruning of Deep Neural Networks

Neural Information Processing SystemsMar-20-2025, 01:44:58 GMT

In the light of the fact that the stochastic gradient descent (SGD) often finds a flat minimum valley in the training loss, we propose a novel directional pruning method which searches for a sparse minimizer in or close to that flat region. The proposed pruning method does not require retraining or the expert knowledge on the sparsity level.

artificial intelligence, deep learning, machine learning, (15 more...)

Neural Information Processing Systems

Country: North America > United States > Rhode Island (0.28)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Learning to Navigate Wikipedia by Taking Random Walks, Kenneth Marino, John Schultz

Neural Information Processing SystemsMar-18-2025, 07:41:04 GMT

A fundamental ability of an intelligent web-based agent is seeking out and acquiring new information. Internet search engines reliably find the correct vicinity but the top results may be a few links away from the desired target. A complementary approach is navigation via hyperlinks, employing a policy that comprehends local content and selects a link that moves it closer to the target. In this paper, we show that behavioral cloning of randomly sampled trajectories is sufficient to learn an effective link selection policy. We demonstrate the approach on a graph version of Wikipedia with 38M nodes and 387M edges. The model is able to efficiently navigate between nodes 5 and 20 steps apart 96% and 92% of the time, respectively. We then use the resulting embeddings and policy in downstream fact verification and question answering tasks where, in combination with basic TF-IDF search and ranking methods, they are competitive results to the state-of-the-art methods.

information retrieval, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Rhode Island (0.30)
North America > United States > New York (0.28)

Genre:

Research Report > New Finding (0.46)
Research Report > Promising Solution (0.34)

Industry:

Government > Regional Government > North America Government > United States Government (0.46)
Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.93)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.88)

Add feedback

Learning Object Placement Programs for Indoor Scene Synthesis with Iterative Self Training

Chang, Adrian, Wang, Kai, Li, Yuanbo, Savva, Manolis, Chang, Angel X., Ritchie, Daniel

arXiv.org Artificial IntelligenceMar-6-2025

Data driven and autoregressive indoor scene synthesis systems generate indoor scenes automatically by suggesting and then placing objects one at a time. Empirical observations show that current systems tend to produce incomplete next object location distributions. We introduce a system which addresses this problem. We design a Domain Specific Language (DSL) that specifies functional constraints. Programs from our language take as input a partial scene and object to place. Upon execution they predict possible object placements. We design a generative model which writes these programs automatically. Available 3D scene datasets do not contain programs to train on, so we build upon previous work in unsupervised program induction to introduce a new program bootstrapping algorithm. In order to quantify our empirical observations we introduce a new evaluation procedure which captures how well a system models per-object location distributions. We ask human annotators to label all the possible places an object can go in a scene and show that our system produces per-object location distributions more consistent with human annotators. Our system also generates indoor scenes of comparable quality to previous systems and while previous systems degrade in performance when training data is sparse, our system does not degrade to the same degree.

large language model, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2503.04496

Country:

North America > United States > Rhode Island (0.14)
North America > Canada > British Columbia (0.14)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.88)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Add feedback

Handwritten Text Recognition: A Survey

Garrido-Munoz, Carlos, Rios-Vila, Antonio, Calvo-Zaragoza, Jorge

arXiv.org Artificial IntelligenceFeb-12-2025

Handwritten Text Recognition (HTR) has become an essential field within pattern recognition and machine learning, with applications spanning historical document preservation to modern data entry and accessibility solutions. The complexity of HTR lies in the high variability of handwriting, which makes it challenging to develop robust recognition systems. This survey examines the evolution of HTR models, tracing their progression from early heuristic-based approaches to contemporary state-of-the-art neural models, which leverage deep learning techniques. The scope of the field has also expanded, with models initially capable of recognizing only word-level content progressing to recent end-to-end document-level approaches. Our paper categorizes existing work into two primary levels of recognition: (1) \emph{up to line-level}, encompassing word and line recognition, and (2) \emph{beyond line-level}, addressing paragraph- and document-level challenges. We provide a unified framework that examines research methodologies, recent advances in benchmarking, key datasets in the field, and a discussion of the results reported in the literature. Finally, we identify pressing research challenges and outline promising future directions, aiming to equip researchers and practitioners with a roadmap for advancing the field.

machine learning, pattern recognition, recognition, (19 more...)

arXiv.org Artificial Intelligence

2502.08417

Country:

Europe > Spain (0.28)
North America > United States > Rhode Island (0.14)
Europe > Middle East > Malta (0.14)

Genre:

Overview (1.00)
Instructional Material > Course Syllabus & Notes (0.68)

Industry: Education (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.30)

Add feedback

Cross-Encoder Rediscovers a Semantic Variant of BM25

Lu, Meng, Chen, Catherine, Eickhoff, Carsten

arXiv.org Artificial IntelligenceFeb-6-2025

Neural Ranking Models (NRMs) have rapidly advanced state-of-the-art performance on information retrieval tasks. In this work, we investigate a Cross-Encoder variant of MiniLM to determine which relevance features it computes and where they are stored. We find that it employs a semantic variant of the traditional BM25 in an interpretable manner, featuring localized components: (1) Transformer attention heads that compute soft term frequency while controlling for term saturation and document length effects, and (2) a low-rank component of its embedding matrix that encodes inverse document frequency information for the vocabulary. This suggests that the Cross-Encoder uses the same fundamental mechanisms as BM25, but further leverages their capacity to capture semantics for improved retrieval performance. The granular understanding lays the groundwork for model editing to enhance model transparency, addressing safety concerns, and improving scalability in training and real-world applications.

information retrieval, large language model, machine learning, (21 more...)

arXiv.org Artificial Intelligence

2502.04645

Country:

North America > United States > Rhode Island (0.14)
Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.14)
North America > Canada > Ontario > Toronto (0.14)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback