AITopics | chiron

Collaborating Authors

chiron

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Hierarchical Autoscaling for Large Language Model Serving with Chiron

Patke, Archit, Reddy, Dhemath, Jha, Saurabh, Narayanaswami, Chandra, Kalbarczyk, Zbigniew, Iyer, Ravishankar

arXiv.org Artificial IntelligenceJan-14-2025

Large language model (LLM) serving is becoming an increasingly important workload for cloud providers. Based on performance SLO requirements, LLM inference requests can be divided into (a) interactive requests that have tight SLOs in the order of seconds, and (b) batch requests that have relaxed SLO in the order of minutes to hours. These SLOs can degrade based on the arrival rates, multiplexing, and configuration parameters, thus necessitating the use of resource autoscaling on serving instances and their batch sizes. However, previous autoscalers for LLM serving do not consider request SLOs leading to unnecessary scaling and resource under-utilization. To address these limitations, we introduce Chiron, an autoscaler that uses the idea of hierarchical backpressure estimated using queue size, utilization, and SLOs. Our experiments show that Chiron achieves up to 90% higher SLO attainment and improves GPU efficiency by up to 70% compared to existing solutions.

chiron, large language model, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2501.0809

Country: North America > United States (0.46)

Genre: Research Report (0.40)

Industry: Information Technology > Services (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

CHIRON: Rich Character Representations in Long-Form Narratives

Gurung, Alexander, Lapata, Mirella

arXiv.org Artificial IntelligenceJun-26-2024

Characters are integral to long-form narratives, but are poorly understood by existing story analysis and generation systems. While prior work has simplified characters via graph-based methods and brief character descriptions, we aim to better tackle the problem of representing complex characters by taking inspiration from advice given to professional writers. We propose CHIRON, a new `character sheet' based representation that organizes and filters textual information about characters. We construct CHIRON sheets in two steps: a Generation Module that prompts an LLM for character information via question-answering and a Validation Module that uses automated reasoning and a domain-specific entailment model to eliminate false facts about a character. We validate CHIRON via the downstream task of masked-character prediction, where our experiments show CHIRON is better and more flexible than comparable summary-based baselines. We also show that metrics derived from CHIRON can be used to automatically infer character-centricity in stories, and that these metrics align with human judgments.

computational linguistic, information, snippet, (15 more...)

arXiv.org Artificial Intelligence

2406.1019

Country:

North America > United States > New York (0.05)
Europe > Bulgaria > Sofia City Province > Sofia (0.04)
Asia > Middle East > Jordan (0.04)
(13 more...)

Genre: Research Report (0.64)

Industry:

Leisure & Entertainment (1.00)
Media > Film (0.92)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Top AI Research Advances For Machine Learning Infrastructure

#artificialintelligenceNov-17-2019, 13:07:31 GMT

As deep learning models become more and more popular in real-world business applications and training datasets grow very large, machine learning (ML) infrastructure is becoming a critical issue in many companies. To help you stay aware of the latest research advances in ML infrastructure, we've summarized some of the most important research papers recently introduced in this area. As you read these summaries, you will be able to learn from the experience of the leading tech companies, including Google, Microsoft, and LinkedIn. The papers we've selected cover data labeling and data validation frameworks, different approaches to distributed training of ML models, a novel approach to tracking ML model performance in production, and more. If you'd like to skip around, here are the papers we've summarized: If these accessible AI research analyses & summaries are useful for you, you can subscribe to receive our regular industry updates below.

deployment, parallelization strategy, training data, (14 more...)

#artificialintelligence

Country: North America > United States > Arizona (0.04)

Genre:

Research Report > Promising Solution (0.49)
Overview > Innovation (0.35)

Industry:

Information Technology (1.00)
Government > Military (0.69)
Government > Regional Government > North America Government > United States Government (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Top AI Research Advances For Machine Learning Infrastructure

#artificialintelligenceNov-17-2019, 13:07:31 GMT

deployment, parallelization strategy, training data, (14 more...)

#artificialintelligence

Country: North America > United States > Arizona (0.04)

Genre:

Research Report > Promising Solution (0.49)
Overview > Innovation (0.35)

Industry:

Information Technology (1.00)
Government > Military (0.69)
Government > Regional Government > North America Government > United States Government (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

How Researchers Are Building Models To Safeguard Private Data In Machine Learning

#artificialintelligenceJul-19-2018, 19:05:50 GMT

More machine learning applications are permeating in the tech ecosystem and the data that goes into ML systems is being derived from all sorts of sources -- regardless of its sensitivity. ML algorithms do not realise the aspect of sensitivity as it always looks at data as a way to establish and learn patterns, rather than looking into the who's who of the data. Miscreants might take advantage of this and circumvent the ML systems itself, which can have devastating effects altogether. If that happens, the purpose of ML will completely fail. To counter this, and establish a secure and safe ML environment, researchers are strictly working towards building privacy in ML models.

artificial intelligence, machine learning, privacy, (15 more...)

#artificialintelligence

Country:

North America > United States > Pennsylvania (0.05)
North America > United States > Michigan (0.05)
North America > United States > California (0.05)

Genre: Research Report (0.31)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.31)

Add feedback

Bloodhound engineers reveal it has only been tested virtually - until now

Daily Mail - Science & techJul-23-2016, 02:11:39 GMT

It was a staggering feat, a car that went faster than the speed of sound. Two decades on, that record remains unchallenged. Back in 2007, a small team of British engineers headed up by Richard Noble and Andy Green decided to have a pop at the world land speed record once more. Back in 2007, a small team of British engineers headed up by Richard Noble and Andy Green decided to have a pop at the world land speed record once more. A rocket scientist was brought in to design the largest hybrid rocket system ever developed in the UK, a structural engineer was brought in to design the car's internal structure and I was invited to join the team along with Ron Ayers to ensure that this car would, indeed, remain a car and stay firmly planted on the ground.

artificial intelligence, computer model, south africa, (12 more...)

Daily Mail - Science & tech

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.08)
Africa > South Africa (0.07)
Europe > United Kingdom > England > Cornwall > Newquay (0.05)
North America > United States > Nevada (0.05)

Industry:

Transportation (0.32)
Aerospace & Defense (0.31)

Technology: Information Technology > Artificial Intelligence (0.52)

Add feedback