AITopics

2512.09097

Country: North America > United States (0.28)

Genre: Research Report (1.00)

Industry:

Automobiles & Trucks (0.48)
Transportation (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.49)

arXiv.org Artificial IntelligenceDec-4-2025

TRACED: Transition-aware Regret Approximation with Co-learnability for Environment Design

Cho, Geonwoo, Im, Jaegyun, Lee, Jihwan, Yi, Hojun, Kim, Sejin, Kim, Sundong

Generalizing deep reinforcement learning agents to unseen environments remains a significant challenge. One promising solution is Unsupervised Environment Design (UED), a co-evolutionary framework in which a teacher adaptively generates tasks with high learning potential, while a student learns a robust policy from this evolving curriculum. Existing UED methods typically measure learning potential via regret, the gap between optimal and current performance, approximated solely by value-function loss. Building on these approaches, we introduce the transition-prediction error as an additional term in our regret approximation. To capture how training on one task affects performance on others, we further propose a lightweight metric called Co-Learnability. By combining these two measures, we present Transition-aware Regret Approximation with Co-learnability for Environment Design (TRACED). Empirical evaluations show that TRACED produces curricula that improve zero-shot generalization over strong baselines across multiple benchmarks. Ablation studies confirm that the transition-prediction error drives rapid complexity ramp-up and that Co-Learnability delivers additional gains when paired with the transition-prediction error. These results demonstrate how refined regret approximation and explicit modeling of task relationships can be leveraged for sample-efficient curriculum design in UED. Project Page: https://geonwoo.me/traced/

machine learning, natural language, reinforcement learning, (18 more...)

2506.19997

Genre: Research Report > New Finding (1.00)

Industry: Education > Curriculum (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Ramos-Torres, Abraham, Montoya, Laura N.

Evaluating Investment Risks in LATAM AI Startups: Ranking of Investment Potential and Framework for Valuation

arXiv.org Artificial IntelligenceSep-17-2024

The growth of the tech startup ecosystem in Latin America (LATAM) is driven by innovative entrepreneurs addressing market needs across various sectors. However, these startups encounter unique challenges and risks that require specific management approaches. This paper explores a case study with the Total Addressable Market (TAM), Serviceable Available Market (SAM), and Serviceable Obtainable Market (SOM) metrics within the context of the online food delivery industry in LATAM, serving as a model for valuing startups using the Discounted Cash Flow (DCF) method. By analyzing key emerging powers such as Argentina, Colombia, Uruguay, Costa Rica, Panama, and Ecuador, the study highlights the potential and profitability of AI-driven startups in the region through the development of a ranking of emerging powers in Latin America for tech startup investment. The paper also examines the political, economic, and competitive risks faced by startups and offers strategic insights on mitigating these risks to maximize investment returns. Furthermore, the research underscores the value of diversifying investment portfolios with startups in emerging markets, emphasizing the opportunities for substantial growth and returns despite inherent risks.

artificial intelligence, cloud computing, machine learning, (15 more...)

2410.03552

Country:

North America > Central America (0.52)
South America > Argentina (0.27)
South America > Colombia (0.26)
(25 more...)

Genre: Research Report (1.00)

Industry:

Law (1.00)
Government (1.00)
Banking & Finance > Trading (1.00)
(3 more...)

Technology:

Information Technology > Cloud Computing (1.00)
Information Technology > Artificial Intelligence > Applied AI (0.68)
Information Technology > Artificial Intelligence > Machine Learning (0.68)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.46)

Matthews, Michael, Beukman, Michael, Ellis, Benjamin, Samvelyan, Mikayel, Jackson, Matthew, Coward, Samuel, Foerster, Jakob

Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning

arXiv.org Artificial IntelligenceJun-3-2024

Benchmarks play a crucial role in the development and analysis of reinforcement learning (RL) algorithms. We identify that existing benchmarks used for research into open-ended learning fall into one of two categories. Either they are too slow for meaningful research to be performed without enormous computational resources, like Crafter, NetHack and Minecraft, or they are not complex enough to pose a significant challenge, like Minigrid and Procgen. To remedy this, we first present Craftax-Classic: a ground-up rewrite of Crafter in JAX that runs up to 250x faster than the Python-native original. A run of PPO using 1 billion environment interactions finishes in under an hour using only a single GPU and averages 90% of the optimal reward. To provide a more compelling challenge we present the main Craftax benchmark, a significant extension of the Crafter mechanics with elements inspired from NetHack. Solving Craftax requires deep exploration, long term planning and memory, as well as continual adaptation to novel situations as more of the world is discovered. We show that existing methods including global and episodic exploration, as well as unsupervised environment design fail to make material progress on the benchmark. We believe that Craftax can for the first time allow researchers to experiment in a complex, open-ended environment with limited computational resources.

craftax, lightning-fast benchmark, timestep, (14 more...)

2402.16801

Country:

Europe > Austria > Vienna (0.14)
Europe > Sweden > Skåne County > Malmö (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.64)

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

arXiv.org Artificial IntelligenceApr-1-2024

PhysORD: A Neuro-Symbolic Approach for Physics-infused Motion Prediction in Off-road Driving

Zhao, Zhipeng, Li, Bowen, Du, Yi, Fu, Taimeng, Wang, Chen

Motion prediction is critical for autonomous off-road driving, however, it presents significantly more challenges than on-road driving because of the complex interaction between the vehicle and the terrain. Traditional physics-based approaches encounter difficulties in accurately modeling dynamic systems and external disturbance. In contrast, data-driven neural networks require extensive datasets and struggle with explicitly capturing the fundamental physical laws, which can easily lead to poor generalization. By merging the advantages of both methods, neuro-symbolic approaches present a promising direction. These methods embed physical laws into neural models, potentially significantly improving generalization capabilities. However, no prior works were evaluated in real-world settings for off-road driving. To bridge this gap, we present PhysORD, a neural-symbolic approach integrating the conservation law, i.e., the Euler-Lagrange equation, into data-driven neural models for motion prediction in off-road driving. Our experiments showed that PhysORD can accurately predict vehicle motion and tolerate external disturbance by modeling uncertainties. It outperforms existing methods both in accuracy and efficiency and demonstrates data-efficient learning and generalization ability in long-term prediction.

neural network, physord, prediction, (17 more...)

2404.01596

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
North America > United States > New York > Erie County > Buffalo (0.04)

Genre: Research Report > New Finding (0.34)

Industry: Automobiles & Trucks (0.95)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.90)

Parker-Holder, Jack, Jiang, Minqi, Dennis, Michael, Samvelyan, Mikayel, Foerster, Jakob, Grefenstette, Edward, Rocktäschel, Tim

Evolving Curricula with Regret-Based Environment Design

arXiv.org Artificial IntelligenceSep-30-2023

It remains a significant challenge to train generally capable agents with reinforcement learning (RL). A promising avenue for improving the robustness of RL agents is through the use of curricula. One such class of methods frames environment design as a game between a student and a teacher, using regret-based objectives to produce environment instantiations (or levels) at the frontier of the student agent's capabilities. These methods benefit from their generality, with theoretical guarantees at equilibrium, yet they often struggle to find effective levels in challenging design spaces. By contrast, evolutionary approaches seek to incrementally alter environment complexity, resulting in potentially open-ended learning, but often rely on domain-specific heuristics and vast amounts of computational resources. In this paper we propose to harness the power of evolution in a principled, regret-based curriculum. Our approach, which we call Adversarially Compounding Complexity by Editing Levels (ACCEL), seeks to constantly produce levels at the frontier of an agent's capabilities, resulting in curricula that start simple but become increasingly complex. ACCEL maintains the theoretical benefits of prior regret-based methods, while providing significant empirical gains in a diverse set of environments. An interactive version of the paper is available at accelagent.github.io.

accel, agent, curriculum, (14 more...)

2203.01302

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
Africa > Ethiopia > Addis Ababa > Addis Ababa (0.04)
North America > United States > New York > New York County > New York City (0.04)
(6 more...)

Genre: Research Report (1.00)

Industry:

Education (1.00)
Leisure & Entertainment > Games (0.92)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(3 more...)

#artificialintelligenceAug-30-2022, 02:15:35 GMT

Accel.AI

Accel.AI was founded in September of 2016, our mission is to drive artificial intelligence for social impact initiatives. We focus on integrating AI and social impact through research, consulting, and workshops, on ethical AI development and applied AI engineering. Our target audience includes underrepresented groups, tech companies, including startups and large corporations, governments, and educating individuals experiencing job loss due to automation. We work with companies, professionals, and students around the world.

accel

Industry: Social Sector (0.71)

Technology: Information Technology > Artificial Intelligence (1.00)

#artificialintelligenceAug-3-2022, 10:55:31 GMT

Practical Principles for AI Ethics -- Accel.AI

Principles of AI are a top-down approach to ethics for artificial intelligence (AI). Recently, we have been seeing lists of principles for AI ethics popping up everywhere. They are very useful, not only for AI and its impact but also on a larger social level. Because of AI, people are thinking about ethics in a whole new way: How do we define and digest ethics in order to codify it? Previously I have written an analysis of top-down and bottom-up approaches to ethics for AI, and then we explored the bottom-up method of reinforcement learning for teaching AI ethics.

ai ethics, artificial intelligence, ethics, (5 more...)

Technology: Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)

#artificialintelligenceApr-12-2022, 10:20:48 GMT

Deep tech start-up Spyne raises $7 mn led by Accel

Spyne a deep tech start-up helping businesses and marketplaces create high-quality product images and videos at scale with AI has raised $7 million in their latest funding round. Led by Accel, the funding round also saw the participation from other marquee investors including Storm Ventures, Smile Group, Pentathlon Ventures, Core91, and prominent founders/CXOs from leading Internet companies. The fresh capital will be invested in acquiring the right talent, bolstering global expansion, including in the US market, and setting up a state-of-the-art computer vision lab for deeper R&D in the space. The brand also intends to expand its technological horizons into the field of AR / VR to build products for metaverse and omniverse. Founded in 2018 by Sanjay Kumar and Deepti Prasad, Spyne develops 100 per cent automatic, industry-first AI image processing products to help large e-commerce marketplaces in the automotive, fashion, and retail industry enhance the visual value of the images without a physical studio.

accel, deep tech start-up spyne raise, product image and video, (3 more...)

Country:

North America > United States (0.27)
Asia > India (0.07)

Industry:

Information Technology (0.69)
Media > News (0.40)

Technology: Information Technology > Artificial Intelligence (1.00)

#artificialintelligenceFeb-3-2022, 23:05:53 GMT

Machine Learning Algorithms Cheat Sheet -- Accel.AI

Machine Learning can be divided into three different types of learning: Unsupervised Learning, Supervised Learning, and Semi-supervised Learning. Unsupervised learning uses information data that is not labeled, that way the machine should work with no guidance according to patterns, similarities, and differences. On the other hand, supervised learning has a presence of a "teacher", who is in charge of training the machine by labeling the data to work with. Next, the machine receives some examples that allow it to produce a correct outcome. But there's a hybrid approach for these types of learning, this Semi-supervised learning works with both labeled and unlabeled data. This method uses a tiny data set of labeled data to train and label the rest of the data with corresponding predictions, finally giving a solution to the problem.

dimensionality, information, machine learning algorithm cheat sheet, (2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (1.00)