copper
Reflective Multi-Agent Collaboration based on Large Language Models
Benefiting from the powerful language expression and planning capabilities of Large Language Models (LLMs), LLM-based autonomous agents have achieved promising performance in various downstream tasks. Recently, based on the development of single-agent systems, researchers propose to construct LLM-based multi-agent systems to tackle more complicated tasks. In this paper, we propose a novel framework, named COPPER, to enhance the collaborative capabilities of LLM-based agents with the self-reflection mechanism. To improve the quality of reflections, we propose to fine-tune a shared reflector, which automatically tunes the prompts of actor models using our counterfactual PPO mechanism. On the one hand, we propose counterfactual rewards to assess the contribution of a single agent's reflection within the system, alleviating the credit assignment problem. On the other hand, we propose to train a shared reflector, which enables the reflector to generate personalized reflections according to agent roles, while reducing the computational resource requirements and improving training stability. We conduct experiments on three datasets to evaluate the performance of our model in multi-hop question answering, mathematics, and chess scenarios. Experimental results show that COPPER possesses stronger reflection capabilities and exhibits excellent generalization performance across different actor models.
- Asia > China (0.04)
- North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
- North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
- (4 more...)
- Education (0.68)
- Information Technology (0.46)
The Download: gene-edited babies, and cleaning up copper
Plus: the FDA's drug regulator has resigned A West Coast biotech entrepreneur says he's secured $30 million to form a public-benefit company to study how to safely create genetically edited babies, marking the largest known investment into the taboo technology. The new company, called Preventive, is being formed to research so-called "heritable genome editing," in which the DNA of embryos would be modified by correcting harmful mutations or installing beneficial genes. The goal would be to prevent disease. Creating genetically edited humans remains controversial. The first scientist to do it, in China, was imprisoned for three years. The procedure remains illegal in many countries, including the US, and doubts surround its usefulness as a form of medicine.
- Asia > China (0.51)
- Asia > South Korea (0.30)
- North America > United States > New York (0.05)
- North America > United States > Massachusetts (0.05)
- Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
- Government > Regional Government (0.69)
- Materials > Metals & Mining > Copper (0.49)
- Information Technology > Communications > Social Media (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.49)
- Asia > China (0.04)
- North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
- North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
- (4 more...)
- Media > Film (1.00)
- Leisure & Entertainment (1.00)
- Education (0.68)
- Information Technology > Communications (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)
Is chlorophyll actually good for you?
Is chlorophyll actually good for you? Some benefits are rooted in science, but others are just leafy lore. Thanks to social media, chlorophyll water is now an internet sensation, promising everything from clearer skin to better breath. But does the green liquid live up to the hype? Breakthroughs, discoveries, and DIY tips sent every weekday.
- North America > United States > Ohio (0.04)
- North America > United States > New York > Kings County > New York City (0.04)
- North America > United States > California (0.04)
- Asia > China (0.04)
- Research Report > Experimental Study (0.70)
- Research Report > New Finding (0.68)
- Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
- Health & Medicine > Therapeutic Area > Oncology (0.95)
- Health & Medicine > Consumer Health (0.70)
Reflective Multi-Agent Collaboration based on Large Language Models
Benefiting from the powerful language expression and planning capabilities of Large Language Models (LLMs), LLM-based autonomous agents have achieved promising performance in various downstream tasks. Recently, based on the development of single-agent systems, researchers propose to construct LLM-based multi-agent systems to tackle more complicated tasks. In this paper, we propose a novel framework, named COPPER, to enhance the collaborative capabilities of LLM-based agents with the self-reflection mechanism. To improve the quality of reflections, we propose to fine-tune a shared reflector, which automatically tunes the prompts of actor models using our counterfactual PPO mechanism. On the one hand, we propose counterfactual rewards to assess the contribution of a single agent's reflection within the system, alleviating the credit assignment problem.
Leveraging Large Language Models for Active Merchant Non-player Characters
Kim, Byungjun, Kim, Minju, Seo, Dayeon, Kim, Bugeun
We highlight two significant issues leading to the passivity of current merchant non-player characters (NPCs): pricing and communication. While immersive interactions have been a focus, negotiations between merchant NPCs and players on item prices have not received sufficient attention. First, we define passive pricing as the limited ability of merchants to modify predefined item prices. Second, passive communication means that merchants can only interact with players in a scripted manner. To tackle these issues and create an active merchant NPC, we propose a merchant framework based on large language models (LLMs), called MART, which consists of an appraiser module and a negotiator module. We conducted two experiments to guide game developers in selecting appropriate implementations by comparing different training methods and LLM sizes. Our findings indicate that finetuning methods, such as supervised finetuning (SFT) and knowledge distillation (KD), are effective in using smaller LLMs to implement active merchant NPCs. Additionally, we found three irregular cases arising from the responses of LLMs. We expect our findings to guide developers in using LLMs for developing active merchant NPCs.
- North America > United States > New York > New York County > New York City (0.04)
- North America > United States > Florida > Miami-Dade County > Miami (0.04)
- North America > United States > Illinois > Cook County > Chicago (0.04)
- (7 more...)
Metal Price Spike Prediction via a Neurosymbolic Ensemble Approach
Lee, Nathaniel, Ngu, Noel, Sahdev, Harshdeep Singh, Motaganahall, Pramod, Chowdhury, Al Mehdi Saadat, Xi, Bowen, Shakarian, Paulo
Predicting price spikes in critical metals such as Cobalt, Copper, Magnesium, and Nickel is crucial for mitigating economic risks associated with global trends like the energy transition and reshoring of manufacturing. While traditional models have focused on regression-based approaches, our work introduces a neurosymbolic ensemble framework that integrates multiple neural models with symbolic error detection and correction rules. This framework is designed to enhance predictive accuracy by correcting individual model errors and offering interpretability through rule-based explanations. We show that our method provides up to 6.42% improvement in precision, 29.41% increase in recall at 13.24% increase in F1 over the best performing neural models. Further, our method, as it is based on logical rules, has the benefit of affording an explanation as to which combination of neural models directly contribute to a given prediction.
- Materials > Metals & Mining (1.00)
- Banking & Finance (1.00)
Dominion: A New Frontier for AI Research
Halawi, Danny, Sarmasi, Aron, Saltzen, Siena, McCoy, Joshua
Games have long played a role in AI research, both as a test-bed, and as a moving goal-post, constantly driving innovation. From the heyday of chess agents, when Deep Blue beat Gary Kasparov, to more recent advances, like AlphaGo's dark horse ascent to fame, games have both assisted AI research and provided something to aim for. As the AIs got better, the games they were applied to also got more complex. New game mechanics, such as the fog of war in StarCraft and the stochasticity of Poker, pushed researchers to adapt their methods to ever greater generality. In this paper, we argue that the deck-building strategy game Dominion [1] deserves to join the ranks of AI benchmark games, providing an RL-based bot in service of that benchmark. Dominion has all of the abovementioned elements, but it also incorporates a mechanic that is not present in other popular RL benchmarks: every game is played with a different set of cards. Since each dominion card has a specific rule printed on it, and the set of 10 cards for a game are randomly picked from among hundreds of cards, no two games of Dominion can be approached the same way. Thus a key part of playing Dominion is adapting one's inductive bias of how to play to the specific cards on the table.
- Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)
- North America > United States > Oklahoma (0.04)
- North America > United States > California > Yolo County > Davis (0.04)
- (3 more...)
- Information Technology > Artificial Intelligence > Machine Learning (1.00)
- Information Technology > Artificial Intelligence > Games > Chess (0.54)
Caught trying to steal copper piping, he fled across freeway and was run down, police say
A 32-year-old man died Monday while he was attempting to cross a freeway in Orange County after apparently trying to steal copper piping from a local business. The crash occurred around 11:35 a.m. The man, identified as Alberto Huizar, 32, was pronounced dead at the scene, according to the Orange County Sheriff's Department. Huizar appeared to be homeless, authorities said. First responders arrived to find a woman pinned underneath a self-driving car with'multiple traumatic injuries.'
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
- Transportation > Ground > Road (0.47)