AITopics

arXiv.org Artificial IntelligenceJul-16-2024

Chip Placement with Diffusion

Lee, Vint, Deng, Chun, Elzeiny, Leena, Abbeel, Pieter, Wawrzynek, John

Macro placement is a vital step in digital circuit design that defines the physical location of large collections of components, known as macros, on a 2-dimensional chip. The physical layout obtained during placement determines key performance metrics of the chip, such as power consumption, area, and performance. Existing learning-based methods typically fall short because of their reliance on reinforcement learning, which is slow and limits the flexibility of the agent by casting placement as a sequential process. Instead, we use a powerful diffusion model to place all components simultaneously. To enable such models to train at scale, we propose a novel architecture for the denoising model, as well as an algorithm to generate large synthetic datasets for pre-training. We empirically show that our model can tackle the placement task, and achieve competitive performance on placement benchmarks compared to state-of-the-art methods.

artificial intelligence, machine learning, placement, (15 more...)

arXiv.org Artificial Intelligence

2407.12282

Country: North America > United States > Hawaii (0.14)

Genre: Research Report > New Finding (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)

arXiv.org Artificial IntelligenceMay-27-2020

ProTuner: Tuning Programs with Monte Carlo Tree Search

Haj-Ali, Ameer, Genc, Hasan, Huang, Qijing, Moses, William, Wawrzynek, John, Asanović, Krste, Stoica, Ion

We explore applying the Monte Carlo Tree Search (MCTS) algorithm in a notoriously difficult task: tuning programs for high-performance deep learning and image processing. We build our framework on top of Halide and show that MCTS can outperform the state-of-the-art beam-search algorithm. Unlike beam search, which is guided by greedy intermediate performance comparisons between partial and less meaningful schedules, MCTS compares complete schedules and looks ahead before making any intermediate scheduling decision. We further explore modifications to the standard MCTS algorithm as well as combining real execution time measurements with the cost model. Our results show that MCTS can outperform beam search on a suite of 16 real benchmarks.

artificial intelligence, cost model, planning & scheduling, (18 more...)

arXiv.org Artificial Intelligence

2005.13685

Country:

North America > United States (0.14)
Europe > Estonia (0.14)

Genre: Research Report > New Finding (0.54)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)

Neural Information Processing SystemsDec-31-1997

A Micropower Analog VLSI HMM State Decoder for Wordspotting

Lazzaro, John, Wawrzynek, John, Lippmann, Richard P.

We describe the implementation of a hidden Markov model state decoding system, a component for a wordspotting speech recognition system.The key specification for this state decoder design is microwatt power dissipation; this requirement led to a continuoustime, analogcircuit implementation. We characterize the operation of a 10-word (81 state) state decoder test chip.

artificial intelligence, likelihood, speech recognition, (17 more...)

Country: North America > United States > Massachusetts > Middlesex County (0.14)

Industry:

Government > Military (0.69)
Government > Regional Government (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.90)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.56)

Neural Information Processing SystemsDec-31-1997

A Micropower Analog VLSI HMM State Decoder for Wordspotting

Lazzaro, John, Wawrzynek, John, Lippmann, Richard P.

We describe the implementation of a hidden Markov model state decoding system, a component for a wordspotting speech recognition system. The key specification for this state decoder design is microwatt power dissipation; this requirement led to a continuoustime, analog circuit implementation. We characterize the operation of a 10-word (81 state) state decoder test chip.

artificial intelligence, likelihood, speech recognition, (14 more...)

Country: North America > United States > Massachusetts > Middlesex County (0.14)

Industry:

Government > Military (0.69)
Government > Regional Government (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.90)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.56)

SPERT-II: A Vector Microprocessor System and its Application to Large Problems in Backpropagation Training

Wawrzynek, John, Asanovic, Krste, Kingsbury, Brian, Beck, James, Johnson, David, Morgan, Nelson

We report on our development of a high-performance system for neural network and other signal processing applications. We have designed and implemented a vector microprocessor and packaged it as an attached processor for a conventional workstation.

artificial intelligence, neural network, opération, (15 more...)

Country: North America > United States > California (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Backpropagation (0.42)

SPERT-II: A Vector Microprocessor System and its Application to Large Problems in Backpropagation Training

Wawrzynek, John, Asanovic, Krste, Kingsbury, Brian, Beck, James, Johnson, David, Morgan, Nelson

We report on our development of a high-performance system for neural network and other signal processing applications. We have designed and implemented a vector microprocessor and packaged itas an attached processor for a conventional workstation.

artificial intelligence, neural network, opération, (15 more...)

Country: North America > United States > California (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Backpropagation (0.51)

Silicon Models for Auditory Scene Analysis

Lazzaro, John, Wawrzynek, John

We are developing special-purpose, low-power analog-to-digital converters for speech and music applications, that feature analog circuit models of biological audition to process the audio signal before conversion. This paper describes our most recent converter design, and a working system that uses several copies ofthe chip to compute multiple representations of sound from an analog input. This multi-representation system demonstrates the plausibility of inexpensively implementing an auditory scene analysis approach to sound processing. 1. INTRODUCTION The visual system computes multiple representations of the retinal image, such as motion, orientation, and stereopsis, as an early step in scene analysis. Likewise, the auditory brainstem computes secondary representations of sound, emphasizing properties such as binaural disparity, periodicity, and temporal onsets. Recent research in auditory scene analysis involves using computational models of these auditory brainstem representations in engineering applications. Computation is a major limitation in auditory scene analysis research: the complete auditoryprocessing system described in (Brown and Cooke, 1994) operates at approximately 4000 times real time, running under UNIX on a Sun SPARCstation 1. Standard approaches to hardware acceleration for signal processing algorithms could be used to ease this computational burden in a research environment; a variety of parallel, fixed-point hardware products would work well on these algorithms.

health & medicine, neural network, representation, (19 more...)

Industry: Health & Medicine > Diagnostic Medicine (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.76)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.76)

Silicon Models for Auditory Scene Analysis

Lazzaro, John, Wawrzynek, John

We are developing special-purpose, low-power analog-to-digital converters for speech and music applications, that feature analog circuit models of biological audition to process the audio signal before conversion. This paper describes our most recent converter design, and a working system that uses several copies ofthe chip to compute multiple representations of sound from an analog input. This multi-representation system demonstrates the plausibility of inexpensively implementing an auditory scene analysis approach to sound processing. 1. INTRODUCTION The visual system computes multiple representations of the retinal image, such as motion, orientation, and stereopsis, as an early step in scene analysis. Likewise, the auditory brainstem computes secondary representations of sound, emphasizing properties such as binaural disparity, periodicity, and temporal onsets. Recent research in auditory scene analysis involves using computational models of these auditory brainstem representations in engineering applications. Computation is a major limitation in auditory scene analysis research: the complete auditory processing system described in (Brown and Cooke, 1994) operates at approximately 4000 times real time, running under UNIX on a Sun SPARCstation 1. Standard approaches to hardware acceleration for signal processing algorithms could be used to ease this computational burden in a research environment; a variety of parallel, fixed-point hardware products would work well on these algorithms.

health & medicine, neural network, representation, (19 more...)