ramo
Data-adaptive Safety Rules for Training Reward Models
Li, Xiaomin, Gao, Mingye, Zhang, Zhiwei, Fan, Jingxuan, Li, Weiyu
Reinforcement Learning from Human Feedback (RLHF) is commonly employed to tailor models to human preferences, especially to improve the safety of outputs from large language models (LLMs). Traditionally, this method depends on selecting preferred responses from pairs. However, due to the variability in human opinions and the challenges in directly comparing two responses, there is an increasing trend towards fine-grained annotation approaches that evaluate responses using multiple targeted metrics or rules. The challenge lies in efficiently choosing and applying these rules to handle the diverse range of preference data. In this paper, we propose a dynamic method that adaptively selects the most important rules for each response pair. We introduce a mathematical framework that utilizes the maximum discrepancy across paired responses and demonstrate theoretically that this approach maximizes the mutual information between the rule-based annotations and the underlying true preferences. We then train an 8B reward model using this adaptively labeled preference dataset and assess its efficacy using RewardBench. As of January 25, 2025, our model achieved the highest safety performance on the leaderboard, surpassing various larger models.
- North America > United States > Virginia (0.04)
- North America > United States > Pennsylvania (0.04)
- North America > United States > Massachusetts (0.04)
- Law (1.00)
- Information Technology > Security & Privacy (1.00)
- Government (1.00)
- Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.68)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.98)
RAMO: Retrieval-Augmented Generation for Enhancing MOOCs Recommendations
Massive Open Online Courses (MOOCs) have significantly enhanced educational accessibility by offering a wide variety of courses and breaking down traditional barriers related to geography, finance, and time. However, students often face difficulties navigating the vast selection of courses, especially when exploring new fields of study. Driven by this challenge, researchers have been exploring course recommender systems to offer tailored guidance that aligns with individual learning preferences and career aspirations. These systems face particular challenges in effectively addressing the ``cold start'' problem for new users. Recent advancements in recommender systems suggest integrating large language models (LLMs) into the recommendation process to enhance personalized recommendations and address the ``cold start'' problem. Motivated by these advancements, our study introduces RAMO (Retrieval-Augmented Generation for MOOCs), a system specifically designed to overcome the ``cold start'' challenges of traditional course recommender systems. The RAMO system leverages the capabilities of LLMs, along with Retrieval-Augmented Generation (RAG)-facilitated contextual understanding, to provide course recommendations through a conversational interface, aiming to enhance the e-learning experience.
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
- Europe > France > Île-de-France > Paris > Paris (0.04)
- Instructional Material > Online (1.00)
- Instructional Material > Course Syllabus & Notes (1.00)
- Education > Educational Technology > Educational Software > Computer Based Training (1.00)
- Education > Educational Setting > Online (1.00)
- Information Technology > Enterprise Applications > Human Resources > Learning Management (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- (2 more...)
- North America > United States > California > Los Angeles County > Los Angeles (0.15)
- North America > United States > New York > Bronx County > New York City (0.05)
Love is in the A.I.r: Bronx mom, 36, marries virtual husband 'Eren'
While artificial intelligence is stoking fears around the world - the technology has given one New York woman the love of her life. Rosanna Ramos, a petite, active 36-year-old from the Bronx, 'married' Eren Kartal this year - virtually of course - after creating him on an online AI companion site in 2022. Their relationship developed slowly initially, but Ms Ramos fell for Eren. 'He didn't come with baggage,' she said. Eren'works' as a medical professional and enjoys writing as a hobby, things he's told Rosanna as they got to know each other Rosanna claims to be pregnant with Eren's child'I could tell him stuff, and he wouldn't be like, "Oh, no, you can't say stuff like that. Oh no, you're not allowed to feel that way," you know, and then start arguing with me,' Ramos said.
AI Weekly: UN recommendations point to need for AI ethics guidelines
The U.N.'s Educational, Scientific, and Cultural Organization (UNESCO) this week approved a series of recommendations for AI ethics, which aim to recognize that AI can "be of great service" but also raise "fundamental … concerns." UNESCO's 193 member countries, including Russia and China, agreed to conduct AI impact assessments and place "strong enforcement mechanisms and remedial actions" to protect human rights. "The world needs rules for artificial intelligence to benefit humanity. The recommendation[s] on the ethics of AI is a major answer," UNESCO chief Audrey Azoulay said in a press release. "It sets the first global normative framework while giving States the responsibility to apply it at their level. UNESCO will support its … member states in its implementation and ask them to report regularly on their progress and practices."
sbp-env: Sampling-based Motion Planners' Testing Environment
Sampling-based motion planners' testing environment (sbp-env) is a full feature framework to quickly test different sampling-based algorithms for motion planning. sbp-env focuses on the flexibility of tinkering with different aspects of the framework, and had divided the main planning components into two categories (i) samplers and (ii) planners. The focus of motion planning research had been mainly on (i) improving the sampling efficiency (with methods such as heuristic or learned distribution) and (ii) the algorithmic aspect of the planner using different routines to build a connected graph. Therefore, by separating the two components one can quickly swap out different components to test novel ideas.
All dressed up with nowhere to go: Cosplaying in the pandemic
It took Michelle Anderson a month to create her E3 2019 outfit. It took her another hour to put it on. She wore a wig with red Afro puffs, an army-green tactical vest and fake bloodstained bandage. She completed the look with medical gloves and a mask looped around her neck, then took one last look in the mirror before she headed out the door. She was dressed as Lifeline, a playable combat medic from the video game "Apex Legends."
- Health & Medicine (1.00)
- Leisure & Entertainment > Games > Computer Games (0.92)
- Information Technology > Communications > Social Media (0.98)
- Information Technology > Artificial Intelligence > Games (0.61)
Drone video captures dolphins sharing fish and getting frisky in Mexico
It turns out humans are not the only creatures that use food as foreplay. Researchers in southwestern Mexico have recorded a group of rough-toothed dolphins sharing a meal and getting frisky. A drone camera caught two dolphins passing a piece of fish back and forth in what may be the first video of the conduct. The repast seemed to inspire some amorous behavior, as well, with two males initiating sexual encounters with another member of the pod. Rough-toothed dolphins spend up to 80 percent of their time in the ocean depths, making them extremely difficult to study.
- North America > Mexico (0.62)
- South America > Bolivia > Potosí Department > Tomás Frías Province > Potosí (0.05)
- Pacific Ocean (0.05)
- (3 more...)
A Formal Critique of the Value of the Colombian P\'aramo
ESF thus beckons the valuation of ecosystem services (VES) as a means to signalling nature's contribution to the (re)production of value (Barbier et al., 2009; Villa et al., 2009; Fisher et al., 2010; Gómez-Baggethun et al., 2016); for value is the central category of modern capitalist societies, and the valorisation of value -- i.e., economic growth sublimated into economic development -- their driving force (see, e.g., Mankiw (2016) and Holden et al. (2017)). VES is, in this sense, inscribed in an interpretive approach to modern capitalist praxis, not only invoking assumptions that are instrumentally validated in a retroactive manner, but also taking for granted precisely those historical and material conditions which VES is meant to interpret and, in doing so, reproduce. Overlooking the historical basis of ESF and VES has important practical consequences. When VES practitioners elicit value, a moment or specific field of the social praxis embodied in the valorisation of value is inaugurated, allowing value to mediate other social constructs built around the idea of nature. Since the patterns of actions that make up the capitalist social praxis are presupposed within this new ambit, value takes on a transhistorical quality that justifies its allencompassing and unreflective usage (see, e.g., Badura et al. (2016) and Gómez-Baggethun and Martín-López (2015)).
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- South America > Colombia > Bogotá D.C. > Bogotá (0.04)
- North America > United States > Montana (0.04)
- (8 more...)
- Law (0.95)
- Food & Agriculture > Agriculture (0.93)
- Leisure & Entertainment (0.92)
- Banking & Finance (0.87)
- Information Technology > Artificial Intelligence > Natural Language (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.46)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.45)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.45)
Building a 3D Printer That Self-Corrects With AI
Most 3D-printed objects are prototypes or one-off creations, in large part because 3D printing is more finicky than traditional manufacturing. Because the process works by adding layers of material atop each other, subtle changes in temperatures or material quality can result in imperfections and hours of lost work. Inkbit, a Boston-area 3D printing company, is using machine vision and artificial intelligence to help its equipment course correct. Javier Ramos, co-founder and director of hardware at Inkbit, said Inkbit's machine vision technology instantly scans the objects it prints, relying on AI to correct for any mistakes made. He imagines a future where Inkbit's tech is used on every factory floor, printing out millions of products more cheaply -- and faster -- than traditional manufacturing processes ever could.