AITopics | spad

Collaborating Authors

spad

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

SPAD: Specialized Prefill and Decode Hardware for Disaggregated LLM Inference

Zhang, Hengrui, Patel, Pratyush, Ning, August, Wentzlaff, David

arXiv.org Artificial IntelligenceOct-10-2025

Large Language Models (LLMs) have gained popularity in recent years, driving up the demand for inference. LLM inference is composed of two phases with distinct characteristics: a compute-bound prefill phase followed by a memory-bound decode phase. To efficiently serve LLMs, prior work proposes prefill-decode disaggregation to run each phase on separate hardware. However, existing hardware poorly matches the different requirements of each phase. Current datacenter GPUs and TPUs follow a more-is-better design philosophy that maximizes compute and memory resources, causing memory bandwidth underutilization in the prefill phase and compute underutilization in the decode phase. Such underutilization directly translates into increased serving costs. This paper proposes SPAD (Specialized Prefill and Decode hardware), adopting a less-is-more methodology to design specialized chips tailored to the distinct characteristics of prefill and decode phases. The proposed Prefill Chips have larger systolic arrays and use cost-effective GDDR memory, whereas the proposed Decode Chips retain high memory bandwidth but reduce compute capacity. Compared to modeled H100s, simulations show that the proposed Prefill Chips deliver 8% higher prefill performance on average at 52% lower hardware cost, while the proposed Decode Chips achieve 97% of the decode performance with 28% lower TDP. End-to-end simulations on production traces show that SPAD reduces hardware cost by 19%-41% and TDP by 2%-17% compared to modeled baseline clusters while offering the same performance. Even when models and workloads change, SPAD can reallocate either type of chip to run either phase and still achieve 11%-43% lower hardware costs, demonstrating the longevity of the SPAD design.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2510.08544

Country:

North America > United States > California (0.67)
North America > United States > New York > New York County > New York City (0.14)

Genre: Research Report (0.82)

Industry: Information Technology > Services (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Ranging Performance Analysis in Automotive DToF Lidars

Guo, Xiao

arXiv.org Artificial IntelligenceJan-23-2025

In recent years, achieving full autonomy in driving has emerged as a paramount objective for both the industry and academia. Among various perception technologies, Lidar (Light detection and ranging) stands out for its high-precision and high-resolution capabilities based on the principle of light propagation and coupling ranging module and imaging module. Lidar is a sophisticated system that integrates multiple technologies such as optics, mechanics, circuits, and algorithms. Therefore, there are various feasible Lidar schemes to meet the needs of autonomous driving in different scenarios. The ranging performance of Lidar is a key factor that determines the overall performance of autonomous driving systems. As such, it is necessary to conduct a systematic analysis of the ranging performance of different Lidar schemes. In this paper, we present the ranging performance analysis methods corresponding to different optical designs, device selec-tions and measurement mechanisms. By using these methods, we compare the ranging perfor-mance of several typical commercial Lidars. Our findings provide a reference framework for de-signing Lidars with various trade-offs between cost and performance, and offer insights into the advancement towards improving Lidar schemes.

artificial intelligence, lidar, sipm, (17 more...)

arXiv.org Artificial Intelligence

2501.17884

Country:

North America > United States > California > Alameda County > Berkeley (0.14)
Asia > Middle East > Republic of Türkiye > Karaman Province > Karaman (0.04)
Asia > Japan > Honshū > Kansai > Kyoto Prefecture > Kyoto (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)

Genre: Research Report > New Finding (0.48)

Industry:

Automobiles & Trucks (1.00)
Transportation > Ground > Road (0.54)
Information Technology > Robotics & Automation (0.54)

Technology:

Information Technology > Sensing and Signal Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.87)

Add feedback

Improved histogram-based anomaly detector with the extended principal component features

Aryal, Sunil, Baniya, Arbind Agrahari, Santosh, KC

arXiv.org Machine LearningSep-27-2019

In this era of big data, databases are growing rapidly in terms of the number of records. Fast automatic detection of anomalous records in these massive databases is a challenging task. Traditional distance based anomaly detectors are not applicable in these massive datasets. Recently, a simple but extremely fast anomaly detector using one-dimensional histograms has been introduced. The anomaly score of a data instance is computed as the product of the probability mass of histograms in each dimensions where it falls into. It is shown to produce competitive results compared to many state-of-the-art methods in many datasets. Because it assumes data features are independent of each other, it results in poor detection accuracy when there is correlation between features. To address this issue, we propose to increase the feature size by adding more features based on principal components. Our results show that using the original input features together with principal components improves the detection accuracy of histogram-based anomaly detector significantly without compromising much in terms of run-time.

anomaly, dataset, spad, (13 more...)

arXiv.org Machine Learning

1909.12702

Country:

North America > United States > South Dakota > Clay County > Vermillion (0.14)
Asia (0.04)
Oceania > Australia > Victoria (0.04)
North America > United States > California > Orange County > Irvine (0.04)

Genre: Research Report > New Finding (0.86)

Industry:

Health & Medicine (0.69)
Information Technology > Security & Privacy (0.69)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Add feedback

Wanna See Around Corners? Better Get Yourself a Laser

WIREDMar-5-2018, 17:23:11 GMT

You can't see the bunny, but the picosecond laser certainly can. In a lab at Stanford, engineers have set up a weird contraption, hiding a toy bunny behind a T-shaped wall. And their complex system of computation and rapidly firing lasers can see around that corner. At least that's the idea behind this technique, which uses the flight paths of the photons in lasers to calculate the shape and position of hidden objects--be they bunnies or passing pedestrians. This system deploys the same very, very precise timing that drives the laser-spewing lidar on a self-driving car.

artificial intelligence, laser, photon, (13 more...)

WIRED

Industry: Automobiles & Trucks (0.39)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.63)

Add feedback