Goto

Collaborating Authors

 justitia


Justitia: Fair and Efficient Scheduling for LLM Applications

Yang, Mingyan, Wang, Guanjie, Luo, Manqi, Liu, Yifei, Chen, Chen, Zhao, Han, Feng, Yu, Chen, Quan, Guo, Minyi

arXiv.org Artificial Intelligence

In the era of Large Language Models (LLMs), it has been popular to launch a series of LLM inferences -- we call an LLM application -- to better solve real-world problems. When serving those applications in shared GPU servers, the schedulers are expected to attain fast application completions with guaranteed worst-case performance. However, mainstream LLM schedulers fail to behave well for LLM applications -- due to head-of-line blocking or over-constrained resource allocation. In this paper, we propose to serve LLM applications in a fair and also efficient manner. To this end, we design Justitia, a novel scheduler with three key techniques. First, given that memory is prevalently a bottleneck for mainstream inference frameworks like vLLM, Justitia models the service cost of LLM applications in a memory-centric manner. Meanwhile, it uses a simple neural network model to conduct light-weight and also accurate demand prediction. Moreover, Justitia adopts a virtual-time based fair queuing algorithm to reduce the overall performance with guaranteed worst-case delay. We have implemented Justitia atop vLLM, and experimental results involving diverse LLM applications show that it can substantially enhance the scheduling efficiency with fairness preserved.


MBR Explorer: UAE Plans Space Mission to Explore Asteroid Belt

NYT > Middle East

The seventh asteroid, Justitia, is the most intriguing. About 30 miles wide, Justitia is very reddish, an unusual color for an asteroid. Indeed, it looks more like one of the small icy worlds found in the Kuiper belt, circling the sun beyond the orbit of Neptune. That has led planetary scientists to speculate that Justitia formed in the outer reaches of the solar system and then was scattered inward by the shifting orbits of the giant planets, eventually joining the asteroid belt. If that is true, a visit to Justitia would provide a close-up study of a Kuiper belt object without the long trip to the solar system's distant reaches.


Justitia ex Machina: The Case for Automating Morals

#artificialintelligence

This piece was a finalist for the inaugural Gradient Prize. Machine Learning is a powerful technique to automatically learn models from data that have recently been the driving force behind several impressive technological leaps such as self-driving cars, robust speech recognition, and, arguably, better-than-human image recognition. We rely on these machine learning models daily; they influence our lives in ways we did not expect, and they are only going to become even more ubiquitous. Consider a couple of example machine learning models: 1) Detecting cats in images 2) Deciding which ads to show you online 3) Predicting which areas will suffer crime, and 4) Predicting how likely a criminal is to re-offend. The first two seem harmless enough.