HPC-GPT: Integrating Large Language Model for High-Performance Computing

Ding, Xianzhong, Chen, Le, Emani, Murali, Liao, Chunhua, Lin, Pei-Hung, Vanderbruggen, Tristan, Xie, Zhen, Cerpa, Alberto E., Du, Wan

Oct-2-2023–arXiv.org Artificial Intelligence

Large Language Models (LLMs), including the LLaMA model, have exhibited their efficacy across various general-domain natural language processing (NLP) tasks. However, their performance in high-performance computing (HPC) domain tasks has been less than optimal due to the specialized expertise required to interpret the model responses. In response to this challenge, we propose HPC-GPT, a novel LLaMA-based model that has been supervised fine-tuning using generated QA (Question-Answer) instances for the HPC domain. To evaluate its effectiveness, we concentrate on two HPC tasks: managing AI models and datasets for HPC, and data race detection. By employing HPC-GPT, we demonstrate comparable performance with existing methods on both tasks, exemplifying its excellence in HPC-related scenarios. Our experiments on open-source benchmarks yield extensive results, underscoring HPC-GPT's potential to bridge the performance gap between LLMs and HPC-specific tasks. With HPC-GPT, we aim to pave the way for LLMs to excel in HPC domains, simplifying the utilization of language models in complex computing applications.

application, dataset, hpc-gpt, (13 more...)

arXiv.org Artificial Intelligence

Oct-2-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York
    - Broome County > Binghamton (0.04)
    - New York County > New York City (0.04)
  - Iowa > Story County
    - Ames (0.04)
  - Illinois > Cook County
    - Lemont (0.04)
  - Colorado > Denver County
    - Denver (0.05)
  - California
    - Merced County > Merced (0.14)
    - Alameda County > Livermore (0.04)

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)