Goto

Collaborating Authors

 broadway


AMaPO: Adaptive Margin-attached Preference Optimization for Language Model Alignment

Deng, Ruibo, Feng, Duanyu, Lei, Wenqiang

arXiv.org Artificial Intelligence

Offline preference optimization offers a simpler and more stable alternative to RLHF for aligning language models. However, their effectiveness is critically dependent on ranking accuracy, a metric where further gains are highly im-pactful. This limitation arises from a fundamental problem that we identify and formalize as the Overfitting-Underfitting Dilemma: current margin designs cause models to apply excessive, wasteful gradients to correctly ranked samples (over-fitting) while providing insufficient corrective signals for misranked ones (underfitting). To resolve this dilemma, we propose Adaptive Margin-attached Preference Optimization (AMaPO), a simple yet principled algorithm. AMaPO employs an instance-wise adaptive margin, refined by Z-normalization and exponential scaling, which dynamically reallocates learning effort by amplifying gradients for mis-ranked samples and suppressing them for correct ones. Extensive experiments on widely used benchmarks demonstrate that AMaPO not only achieves better ranking accuracy and superior downstream alignment performance, but targeted analysis also confirms that it successfully mitigates the core overfitting and underfitting issues.


RMoA: Optimizing Mixture-of-Agents through Diversity Maximization and Residual Compensation

Xie, Zhentao, Han, Chengcheng, Shi, Jinxin, Cui, Wenjun, Zhao, Xin, Wu, Xingjiao, Zhao, Jiabao

arXiv.org Artificial Intelligence

Although multi-agent systems based on large language models show strong capabilities on multiple tasks, they are still limited by high computational overhead, information loss, and robustness. Inspired by ResNet's residual learning, we propose Residual Mixture-of-Agents (RMoA), integrating residual connections to optimize efficiency and reliability. To maximize information utilization from model responses while minimizing computational costs, we innovatively design an embedding-based diversity selection mechanism that greedily selects responses via vector similarity. Furthermore, to mitigate iterative information degradation, we introduce a Residual Extraction Agent to preserve cross-layer incremental information by capturing inter-layer response differences, coupled with a Residual Aggregation Agent for hierarchical information integration. Additionally, we propose an adaptive termination mechanism that dynamically halts processing based on residual convergence, further improving inference efficiency. RMoA achieves state-of-the-art performance on the benchmarks of across alignment, mathematical reasoning, code generation, and multitasking understanding, while significantly reducing computational overhead. Code is available at https://github.com/mindhunter01/RMoA.


Clifton's is reopening (again), this time in a changed downtown

Los Angeles Times

Andrew Meieran is about to reopen the doors of one of L.A.'s legendary restaurants in a bid to once again make it an offbeat dining and entertainment destination. Meieran is the proprietor of Clifton's Republic, the kitschy, forest-themed restaurant on Broadway in downtown's Historic Core that for nearly a century served up comfort food such as pot roast, mashed potatoes and Jell-O. The five-story restaurant and bar complex has been closed for the last year after a burst water pipe caused a flood that destroyed the kitchen and collapsed the ceilings on three floors. Clifton's is scheduled to reopen next month after extensive repairs and renovations. Among the changes patrons will find is a basement venue several years in the making that Meieran said is "dedicated to innovation and the magic of experiences" with "entertainment, cocktails and culinary offerings." Andrew Meieran has ambitious vision for Clifton's Cafeteria Meieran is keeping details under wraps for now, but he has demonstrated a knack for creating provocative entertainment and dining venues through an obsessive attention to offbeat details, as well as a willingness to spend more money than most real estate developers to realize his vision and preserve the historic integrity of his projects.


On the Efficacy of Eviction Policy for Key-Value Constrained Generative Language Model Inference

Ren, Siyu, Zhu, Kenny Q.

arXiv.org Artificial Intelligence

Despite the recent success associated with Large Language Models~(LLMs), they are notably cost-prohibitive to deploy in resource-constrained environments due to their excessive memory and computational demands. In addition to model parameters, the key-value cache is also stored in GPU memory, growing linearly with batch size and sequence length. As a remedy, recent works have proposed various eviction policies for maintaining the overhead of key-value cache under a given budget. This paper embarks on the efficacy of existing eviction policies in terms of \textit{importance score calculation} and \textit{eviction scope construction}. We identify the deficiency of prior policies in these two aspects and introduce RoCo, a \underline{r}\underline{o}bust \underline{c}ache \underline{o}mission policy based on temporal attention scores and robustness measures. Extensive experimentation spanning prefilling and auto-regressive decoding stages validates the superiority of RoCo. Finally, we release EasyKV, a versatile software package dedicated to user-friendly key-value constrained generative inference. Code available at \url{https://github.com/DRSY/EasyKV}.


Sr. Machine Learning Engineer at Ask Media Group - United States - Remote

#artificialintelligence

Sr. Machine Learning Engineer at Ask Media Group, LLC: Build innovative solutions to predict and prescribe strategies that bring a competitive advantage to our business products. Build and productionize models (test and deploy an application model to production), fine tune models for better effectiveness, and build for scalability and stability. A Master's degree or foreign equivalent in Computer Science, Computer Engineering, or a related discipline followed by 3 years of experience in a machine learning- or data science engineering-occupation. To apply, please mail cover letter and resume, referencing Req.


Coming up from the USA TODAY Network: Vote on our NFL Power Rankings, Springsteen on Broadway, more

USATODAY - Tech Top Stories

Cast your vote for our NFL Power Rankings, via our Sports section: Our football experts rank the NFL teams from 1-32 every Tuesday, and now we want you to do the same. Check out their rankings, then submit your own and compare! 10 a.m. Dive back into the world of The Da Vinci Code and see what we thought of the fifth installment in the series. See our NCAA Bowl predictions, via our Sports section: USA TODAY Sports' Erick Smith projects the match-ups for every NCAA football bowl game. Google Pixel and Home 2 reveal, via Facebook Live: We'll be in San Francisco where the new iPhone rival Pixel 2 and Amazon Echo rival Google Home 2 will be introduced.


Micro drones swarm above Metallica

Robohub

Metallica's European WorldWired tour, which opened to an ecstatic crowd of 15,000 in Copenhagen's sold-out Royal Arena this Saturday, features a swarm of micro drones flying above the band. Shortly after the band breaks into their hit single "Moth Into Flame", dozens of micro drones start emerging from the stage, forming a large rotating circle above the stage. As the music builds, more and more drones emerge and join the formation, creating increasingly complex patterns, culminating in a choreography of three interlocking rings that rotate in position. This show's debut marks the world's first autonomous drone swarm performance in a major touring act. Unlike previous drone shows, this performance features indoor drones, flying above performers and right next to throngs of concert viewers in a live event setting.


He's Lin-Manuel's right-hand man: the 'Hamilton' arranger who hasn't let hearing loss derail the dream

Los Angeles Times

Alex Lacamoire has hearing loss. But the Tony-winning music director of "Hamilton" wants you to know, he's no Beethoven. He's heard that you can see teeth marks on the wood inside Beethoven's piano "because he would bite it to try to be able to hear the vibrations," Lacamoire said. My hearing is not that bad." When he was 2, growing up near Los Angeles' Koreatown, Lacamoire would sit in front of the stereo and stare into the speaker, drawn to music like a drug. When he was 3, his mother observed him sitting too close to the TV, following the characters on "Sesame Street" with his eyes. "I noticed that when I called him, he'd run away like he wasn't paying attention," Maria Lacamoire said. She took him for a hearing test, where it was discovered that he had mild hearing loss. "I think I was a little bit too young for it to really understand," Lacamoire said. "All I remember is, like, oh wow, they're putting this weird goop in my ear to mold me [for hearing aids] and then I walked away and I had these little apparatuses behind my ears." When he was 6, the school district recommended that Lacamoire attend a special class that combined sign-language instruction along with spoken language. "That was devastating for me," his mother said, "because I didn't notice any other problem with him, because he was very smart." She appealed the decision, and Lacamoire was given an IQ test. He not only joined a mainstream class at Commonwealth Avenue Elementary School but also skipped the first grade. "Alex was the most outstanding student I ever had," said his second-grade teacher, Dorothy Chapman, who taught at Commonwealth for 25 years and retired in 2002. Children with hearing loss, especially when that loss is identified late, often lag behind their peers because they've absorbed less vocabulary and less information. Chapman said the charming little 6-year-old would finish his assignments in five minutes, whereas it took his classmates 20, so she would give him third-grade work. "I've just always been drawn to design, whether it's uniformity or harmony -- and by harmony I mean symmetry and balance and those kinds of things," Lacamoire said. He found beauty and design in the piano, and starting lessons at age 4. After his family moved to Miami when he was 9, he attended an arts high school and then the New World School of the Arts. For Lacamoire, music was "as fluid to me as writing down words.


Indoor drones make history on Broadway

Robohub

For the first time on Broadway human and drone performances fuse to create a new form of artistic expression. The magic happened in Cirque du Soleil's first musical on Broadway: 'Paramour' at the Lyric Theatre. The show is themed on the Golden Age of Hollywood and follows the life of a poet who is forced to choose between love and art. The contributions of the technology firm Verity Studios include the choreography of the drone show segment, the frame and lighting design of the drone costumes, and all underlying drone technologies. The system was operated by the show's automation team, with Verity Studios providing maintenance services twice per year.


A Protest Musical for the Trump Era

The New Yorker

Five actors gathered in a room on Lafayette Street, in downtown Manhattan, to start rehearsing a new work for the Public Theatre, "Joan of Arc: Into the Fire." Written by David Byrne, formerly of the Talking Heads, the show recast the enduring, improbable story of Joan--a teen-age girl in medieval France who experienced divine visions, led an army to defeat an occupying power, and was burned at the stake for heresy--as a rock musical that spoke to the current political moment. It was early January, and, that morning, U.S. intelligence officials had arrived at Trump Tower to brief the President-elect, Donald Trump, on the findings of an investigation into the recent election, in which they had concluded that President Vladimir Putin, of Russia, had acted to insure the defeat of Hillary Clinton. Inauguration Day was looming, and the rehearsal room had a troubled mood that reflected more than the ordinary anxieties of creating a show. The actors arranged four tables into a rectangle and sat down with Alex Timbers, the director of "Joan of Arc." Timbers, who is thirty-eight, is tall and fine-featured. He wore a denim shirt and black jeans that hung off his lanky, slightly hunched frame. His hair is dark and thick, and he frequently runs a hand through it, like a Romantic poet on deadline. Despite the air of disquiet, Timbers, who talks like a cool high-school teacher--lots of vocal fry, the repeated use of "awesome"--addressed the cast with rousing enthusiasm. He explained that, though the show had been in development for two years, it remained a work in progress. "I don't think anything is sacred--we are going to be building this together," Timbers said to the actors, all of whom were men except for Jo Lampert, a thirty-one-year-old newcomer, who was to play Joan. Timbers presented a scale model of the stage design, which had been conceived by Chris Barreca. When built, the set would be black and austere, and filled with enormous L.E.D. screens. A staircase extended from wing to wing, and at center stage there was a vertiginous platform. The set was on a turntable, and as it revolved it represented everything from a cathedral to a prison tower.