Goto

Collaborating Authors

 marconi


Marconi: Prefix Caching for the Era of Hybrid LLMs

arXiv.org Artificial Intelligence

Hybrid models that combine the language modeling capabilities of Attention layers with the efficiency of Recurrent layers (e.g., State Space Models) have gained traction in practically supporting long contexts in Large Language Model serving. Yet, the unique properties of these models complicate the usage of complementary efficiency optimizations such as prefix caching that skip redundant computations across requests. Most notably, their use of in-place state updates for recurrent layers precludes rolling back cache entries for partial sequence overlaps, and instead mandates only exact-match cache hits; the effect is a deluge of (large) cache entries per sequence, most of which yield minimal reuse opportunities. We present Marconi, the first system that supports efficient prefix caching with Hybrid LLMs. Key to Marconi are its novel admission and eviction policies that more judiciously assess potential cache entries based not only on recency, but also on (1) forecasts of their reuse likelihood across a taxonomy of different hit scenarios, and (2) the compute savings that hits deliver relative to memory footprints. Across diverse workloads and Hybrid models, Marconi achieves up to 34.4$\times$ higher token hit rates (71.1% or 617 ms lower TTFT) compared to state-of-the-art prefix caching systems.


How artificial intelligence can save journalism

#artificialintelligence

The economic fallout from the COVID-19 pandemic has caused an unprecedented crisis in journalism that could decimate media organizations around the world. The future of journalism -- and its survival -- could lie in artificial intelligence (AI). AI refers "to intelligent machines that learn from experience and perform tasks like humans," according to Francesco Marconi, a professor of journalism at Columbia University in New York, who has just published a book on the subject: Newsmakers, Artificial Intelligence and the Future of Journalism. Marconi was head of the media lab at the Wall Street Journal and the Associated Press, one of the largest news organizations in the world. His thesis is clear and incontrovertible: the journalism world is not keeping pace with the evolution of new technologies.


How APIs Can Save AI Research Labs: Lessons From OpenAI

#artificialintelligence

"It is not a dream, it is a simple feat of scientific engineering, only expensive -- blind, faint-hearted, doubting world!" Discovering a new medicine is a billion-dollar research endeavour. At least, it can draw in the money as the results are kind of self-explanatory; life-saving. But, in case of AI, which is usually riddled by speculations and scepticism, it is an uphill task for the researchers to sell their idea or to churn profits to keep fueling their AI labs. For example, OpenAI, which started as a non-profit research lab, changed its stance when it partnered with Microsoft. A year later, they have announced that they are making all their exotic deep learning innovations available to the public through an API that comes with a price tag.


How artificial intelligence can save journalism

#artificialintelligence

The economic fallout from the COVID-19 pandemic has caused an unprecedented crisis in journalism that could decimate media organizations around the world. The future of journalism -- and its survival -- could lie in artificial intelligence (AI). AI refers "to intelligent machines that learn from experience and perform tasks like humans," according to Francesco Marconi, a professor of journalism at Columbia University in New York, who has just published a book on the subject: Newsmakers, Artificial Intelligence and the Future of Journalism. Marconi was head of the media lab at the Wall Street Journal and the Associated Press, one of the largest news organizations in the world. His thesis is clear and incontrovertible: the journalism world is not keeping pace with the evolution of new technologies.


Can Artificial Intelligence Save Journalism?

#artificialintelligence

We are at a crossroads. A crossroads that will in large part determine the future of journalism. The Covid-19 pandemic has caused an unprecedented crisis that could decimate certain media organizations. One possible solution has been proposed: artificial intelligence (AI). AI refers "to intelligent machines that learn from experience and perform tasks like humans," according to Francesco Marconi, a professor of journalism at Columbia University in New York who has just published a book on the subject: Newsmakers, Artificial Intelligence and the Future of Journalism.


Newsmakers Columbia University Press

#artificialintelligence

Will the use of artificial intelligence (AI), algorithms, and smart machines be the end of journalism as we know it--or its savior? In Newsmakers, Francesco Marconi, who has led the development of the Associated Press and Wall Street Journal's use of AI in journalism, offers a new perspective on the potential of these technologies. He explains how reporters, editors, and newsrooms of all sizes can take advantage of the possibilities they provide to develop new ways of telling stories and connecting with readers. Marconi analyzes the challenges and opportunities of AI through case studies ranging from financial publications using algorithms to write earnings reports to investigative reporters analyzing large data sets to outlets determining the distribution of news on social media. Newsmakers contends that AI can augment--not automate--the industry, allowing journalists to break more news more quickly while simultaneously freeing up their time for deeper analysis.


Associated Press: Future of Journalism Will Be Augmented Thanks to AI

#artificialintelligence

It hasn't been an easy couple of years for algorithms. Increasingly populated with content decried as'fake news', 'clickbait', today's highly personalized social media feeds are coming under increasing fire for being filter bubbles, culminating in Mark Zuckerberg's highly public apology to Facebook's users at the start of this month. These shifts in media mirror concurrent developments in spheres as diverse as customer service and support, financial trading, healthcare, and more. Today, though, tech is fighting back – thanks to AI. After partnering with Automated Insights in 2014 – a natural language generation start-up – the Associated Press became one of the earliest adopters of AI within the media space.


When IBM First Got People Worried About The Impact Of AI On Jobs

#artificialintelligence

Chess enthusiasts watch World Chess champion Garry Kasparov on a television monitor as he holds his head in his hands at the start of the sixth and final match 11 May 1997 against IBM's Deep Blue computer in New York. Kasparov lost this match in just 19 moves giving overall victory to Deep Blue with a score of 2.5-3.5 (STAN HONDA/AFP/Getty Images) This week's milestones in the history of technology include the invention of the integrated circuit, the first singing telegram, and the first widely-publicized triumph of the machines over humans. Jack Kilby of Texas Instruments (TI) files for a patent on the integrated circuit. For this invention he received the 2000 Nobel Prize for Physics. The notion of an integrated circuit was there.


Powering the golden age of audio

#artificialintelligence

Audio, the spoken word, is humanity's primary means of sentient communication: the sounds a fetus hears in utero; a lover's whisper; a marriage proposal… all leave deep imprints on our hearts and minds. We use sound to accentuate and transmit our emotions; our aural ability is a primary sense that is deeply connected to emotion. In fact, much research indicates that hearing is the most important of the five senses. We detect harmful and dangerous sounds with our ears -- if a fire alarm rings in the middle of the night, we depend on our hearing to alert us of impending danger. While historically sight has been the most valued sense, audio has been catching up.