Narrow Transformer: Starcoder-Based Java-LM For Desktop

Rathinasamy, Kamalkumar, J, Balaji A, Kumar, Ankush, Gayari, Gagan, K, Harshini, Mondal, Rajab Ali, S, Sreenivasa Raghavan K, Singh, Swayam

Jul-4-2024–arXiv.org Artificial Intelligence

The state-of-the-art code models, capable of understanding and generating code in numerous programming languages, are revolutionizing the way enterprises approach software development. With the ability to understand and generate code across a vast array of programming languages, these code models offer a significant boost in productivity. However, the one-size-fits-all approach of these generic multi-lingual code models often falls short in meeting the nuanced requirements of project-level coding tasks in an enterprise, which tend to be language-specific. This has led to the development of Narrow Transformers (NTs), specialized models further trained on a particular programming language, offering a more efficient solution for enterprises. These NTs are designed to optimize performance for a specific programming language, balancing the trade-offs between model size, inferencing cost, and operational throughput. As demand for tailored solutions grows, we can expect a surge in NT development, providing the precision and efficiency required by enterprise projects. However, in practice, the substantial economic cost associated with training and fine-tuning large code models renders language model experiments prohibitively expensive for most researchers and organizations.

artificial intelligence, machine learning, programming language, (16 more...)

arXiv.org Artificial Intelligence

Jul-4-2024

arXiv.org PDF

Add feedback

Country:
- Africa > Rwanda (0.14)
- North America > United States (0.14)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Information Technology (0.52)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning (1.00)
    - Natural Language (0.93)
  - Software > Programming Languages (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found