AITopics

2502.02339

Country:

Asia > China > Beijing > Beijing (0.04)
North America > United States > Florida > Miami-Dade County > Miami (0.04)
Asia > Thailand > Bangkok > Bangkok (0.04)
(7 more...)

Genre: Research Report > New Finding (0.93)

Industry:

Information Technology (0.46)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
(2 more...)

arXiv.org Artificial IntelligenceFeb-7-2025

Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation

Eger, Steffen, Cao, Yong, D'Souza, Jennifer, Geiger, Andreas, Greisinger, Christian, Gross, Stephanie, Hou, Yufang, Krenn, Brigitte, Lauscher, Anne, Li, Yizhi, Lin, Chenghua, Moosavi, Nafise Sadat, Zhao, Wei, Miller, Tristan

With the advent of large multimodal language models, science is now at a threshold of an AI-based technological transformation. Recently, a plethora of new AI models and tools has been proposed, promising to empower researchers and academics worldwide to conduct their research more effectively and efficiently. This includes all aspects of the research cycle, especially (1) searching for relevant literature; (2) generating research ideas and conducting experimentation; generating (3) text-based and (4) multimodal content (e.g., scientific figures and diagrams); and (5) AI-based automatic peer review. In this survey, we provide an in-depth overview over these exciting recent developments, which promise to fundamentally alter the scientific research process for good. Our survey covers the five aspects outlined above, indicating relevant datasets, methods and results (including evaluation) as well as limitations and scope for future research. Ethical concerns regarding shortcomings of these tools and potential for misuse (fake science, plagiarism, harms to research integrity) take a particularly prominent place in our discussion. We hope that our survey will not only become a reference guide for newcomers to the field but also a catalyst for new AI-based initiatives in the area of "AI4Science".

information retrieval, large language model, machine learning, (22 more...)

2502.05151

Country:

Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.14)
Europe > Austria > Vienna (0.14)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
(44 more...)

Genre:

Research Report > Experimental Study (1.00)
Overview (1.00)
Research Report > New Finding (0.93)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Government (1.00)
Law (0.92)
(5 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
(9 more...)

Neural Information Processing SystemsFeb-6-2025, 21:16:08 GMT

Review for NeurIPS paper: RandAugment: Practical Automated Data Augmentation with a Reduced Search Space

Weaknesses: The paper only shows proxy task complicated search space may not work as well as using a simple search task without much approximation. It doesn't really tell us what happens if a complicated search space can be efficiently explored on the real task. In this sense, this paper is only a reflection of current practice, without providing a clear direction forward. In fact, the simplification of this paper (reducing the search space to number of op to apply, and the shared magnitude of ops) seems like an over-kill. By doing that, it misses an opportunity to answer some interesting question, such as: "Does assigning a different magnitude to different ops useful at all in auto data augmentation"?

neurips paper, practical automated data augmentation, reduced search space, (5 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)

Neural Information Processing SystemsFeb-6-2025, 21:16:00 GMT

Review for NeurIPS paper: RandAugment: Practical Automated Data Augmentation with a Reduced Search Space

This paper got mixed reviews. The original ratings are 6,5,5,6. On the positive side, reviewers think the paper solves an important problem. Data augmentation is recognized to be an important step for improving machine learning model performance. However, existing auto data augmentation methods are typically very costly.

artificial intelligence, machine learning, practical automated data augmentation, (7 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.40)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.40)

arXiv.org Artificial IntelligenceFeb-6-2025

Iterate to Accelerate: A Unified Framework for Iterative Reasoning and Feedback Convergence

Fein-Ashley, Jacob

Iterative methods lie at the heart of numerous optimization and reasoning algorithms, ranging from classical mirror descent and dynamic programming to modern deep learning architectures that exhibit chain-of-thought reasoning. Traditional acceleration techniques, such as Nesterov's momentum, have shown that carefully designed iterative schemes can significantly improve convergence rates in convex settings. However, many practical applications operate in non-Euclidean spaces and are subject to state-dependent perturbations or even adversarial disturbances, motivating the need for a more general analysis. In this work, we develop a comprehensive framework that unifies a wide class of iterative reasoning processes using the language of Bregman divergences.

architecture, artificial intelligence, machine learning, (15 more...)

2502.03787

Country:

Europe > Russia (0.04)
Asia > Russia (0.04)

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.90)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.67)

Costinescu, Andrei, Burschka, Darius

Adaptation of Task Goal States from Prior Knowledge

arXiv.org Artificial IntelligenceFeb-6-2025

This paper presents a framework to define a task with freedom and variability in its goal state. A robot could use this to observe the execution of a task and target a different goal from the observed one; a goal that is still compatible with the task description but would be easier for the robot to execute. We define the model of an environment state and an environment variation, and present experiments on how to interactively create the variation from a single task demonstration and how to use this variation to create an execution plan for bringing any environment into the goal state.

artificial intelligence, goal state, variation, (15 more...)

2502.03918

Country:

North America > United States > New York > New York County > New York City (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)

Genre:

Workflow (0.51)
Research Report (0.41)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.88)

Su, Guinan, Geiping, Jonas

Fine, I'll Merge It Myself: A Multi-Fidelity Framework for Automated Model Merging

arXiv.org Artificial IntelligenceFeb-6-2025

Reasoning capabilities represent a critical frontier for large language models (LLMs), but developing them requires extensive proprietary datasets and computational resources. One way to efficiently supplement capabilities with is by model merging, which offers a promising alternative by combining multiple models without retraining. However, current merging approaches rely on manually-designed strategies for merging hyperparameters, limiting the exploration of potential model combinations and requiring significant human effort. We propose an Automated Model Merging Framework that enables fine-grained exploration of merging strategies while reducing costs through multi-fidelity approximations. We support both single and multi-objective optimization and introduce two novel search spaces: layerwise fusion (LFS) and depth-wise integration (DIS). Evaluating across a number of benchmarks, we find that the search autonomously finds 1) Merges that further boost single-objective performance, even on tasks the model has already been finetuned on, and 2) Merges that optimize multi-objective frontiers across tasks. Effective merges are found with limited compute, e.g. within less than 500 search steps.

large language model, machine learning, natural language, (17 more...)

2502.0403

Country:

Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.04)
Europe > Italy > Lazio > Rome (0.04)
Asia > China (0.04)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)

Neural Information Processing SystemsFeb-5-2025, 04:07:44 GMT

Review for NeurIPS paper: Sub-linear Regret Bounds for Bayesian Optimisation in Unknown Search Spaces

Additional Feedback: Algorithm 2. X_t is never defined. I assumed that X_t is defined by Equation 2 like Algorithm 1. Authors mentioned the same computational budget for acquisition function optimization. What is the optimizer though? Constrained optimization of the acquisition function inside H_t (Equation 3) does not seem trivial. It isn't mentioned anywhere how the acquisition funciton was optimized.

bayesian optimisation, sub-linear regret bound, unknown search space, (3 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.61)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.47)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.47)

Neural Information Processing SystemsFeb-5-2025, 04:07:37 GMT

Review for NeurIPS paper: Sub-linear Regret Bounds for Bayesian Optimisation in Unknown Search Spaces

The paper has been discussed after the rebuttal that the reviewers found useful and actionable (e.g., concerns about the confidence bound). The paper is recommended for acceptance. All reviewers have acknowledged that the paper is well motivated, well written and establishes a nice interplay between theory and a practical problem of interest.

bayesian optimisation, sub-linear regret bound, unknown search space, (2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.40)
Information Technology > Artificial Intelligence > Machine Learning (0.40)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.40)

arXiv.org Artificial IntelligenceFeb-5-2025

The Logical Implication Steering Method for Conditional Interventions on Transformer Generation

Kalajdzievski, Damjan

The field of mechanistic interpretability in pre-trained transformer models has demonstrated substantial evidence supporting the ''linear representation hypothesis'', which is the idea that high level concepts are encoded as vectors in the space of activations of a model. Studies also show that model generation behavior can be steered toward a given concept by adding the concept's vector to the corresponding activations. We show how to leverage these properties to build a form of logical implication into models, enabling transparent and interpretable adjustments that induce a chosen generation behavior in response to the presence of any given concept. Our method, Logical Implication Model Steering (LIMS), unlocks new hand engineered reasoning capabilities by integrating neuro-symbolic logic into pre-trained transformer models.

conditional intervention, machine learning, natural language, (16 more...)

2502.03618

Country:

South America > Colombia > Meta Department > Villavicencio (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
(2 more...)

Genre: Research Report > New Finding (0.88)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)