Sabi\'a-2: A New Generation of Portuguese Large Language Models

Almeida, Thales Sales, Abonizio, Hugo, Nogueira, Rodrigo, Pires, Ramon

Mar-26-2024–arXiv.org Artificial Intelligence

We introduce Sabi\'a-2, a family of large language models trained on Portuguese texts. The models are evaluated on a diverse range of exams, including entry-level tests for Brazilian universities, professional certification exams, and graduate-level exams for various disciplines such as accounting, economics, engineering, law and medicine. Our results reveal that our best model so far, Sabi\'a-2 Medium, matches or surpasses GPT-4's performance in 23 out of 64 exams and outperforms GPT-3.5 in 58 out of 64 exams. Notably, specialization has a significant impact on a model's performance without the need to increase its size, allowing us to offer Sabi\'a-2 Medium at a price per token that is 10 times cheaper than GPT-4. Finally, we identified that math and coding are key abilities that need improvement.

benchmark, exam, language model, (13 more...)

arXiv.org Artificial Intelligence

Mar-26-2024

arXiv.org PDF

Add feedback

Country:
- North America > Central America (0.04)
- Europe > Switzerland (0.04)
- South America > Brazil
  - São Paulo (0.05)
  - Pernambuco (0.04)
  - Alagoas > Maceió (0.04)
  - Rio de Janeiro > Rio de Janeiro (0.04)
  - Minas Gerais > Belo Horizonte (0.04)
- Asia
  - Taiwan (0.04)
  - Southeast Asia (0.04)

Genre:
- Research Report > New Finding (0.88)

Industry:
- Education > Educational Setting (0.93)
- Law (0.93)
- Health & Medicine (0.93)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found