VeriGen: A Large Language Model for Verilog Code Generation

Thakur, Shailja, Ahmad, Baleegh, Pearce, Hammond, Tan, Benjamin, Dolan-Gavitt, Brendan, Karri, Ramesh, Garg, Siddharth

Jul-27-2023–arXiv.org Artificial Intelligence

In this study, we explore the capability of Large Language Models (LLMs) to automate hardware design by generating high-quality Verilog code, a common language for designing and modeling digital systems. We fine-tune pre-existing LLMs on Verilog datasets compiled from GitHub and Verilog textbooks. We evaluate the functional correctness of the generated Verilog code using a specially designed test suite, featuring a custom problem set and testing benches. Here, our fine-tuned open-source CodeGen-16B model outperforms the commercial state-of-the-art GPT-3.5-turbo model with a 1.1% overall increase. Upon testing with a more diverse and complex problem set, we find that the fine-tuned model shows competitive performance against state-of-the-art gpt-3.5-turbo, excelling in certain scenarios. Notably, it demonstrates a 41% improvement in generating syntactically correct Verilog code across various problem categories compared to its pre-trained counterpart, highlighting the potential of smaller, in-house LLMs in hardware design automation.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

Jul-27-2023

arXiv.org PDF

Add feedback

Country:
- Europe
  - Iceland (0.04)
  - Italy > Calabria
    - Catanzaro Province > Catanzaro (0.04)
- North America
  - Canada > Alberta
    - Census Division No. 6 > Calgary Metropolitan Region > Calgary (0.14)
  - United States
    - Minnesota > Hennepin County
      - Minneapolis (0.14)
    - New York > New York County
      - New York City (0.04)
- Oceania > Australia
  - New South Wales > Sydney (0.04)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Information Technology (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)
  - Natural Language > Large Language Model (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found