AITopics | llama3 instruct

Collaborating Authors

llama3 instruct

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Fine-Tuning LLMs on Small Medical Datasets: Text Classification and Normalization Effectiveness on Cardiology reports and Discharge records

Losch, Noah, Plagwitz, Lucas, Büscher, Antonius, Varghese, Julian

arXiv.org Artificial IntelligenceMar-27-2025

We investigate the effectiveness of fine-tuning large language models (LLMs) on small medical datasets for text classification and named entity recognition tasks. Using a German cardiology report dataset and the i2b2 Smoking Challenge dataset, we demonstrate that fine-tuning small LLMs locally on limited training data can improve performance achieving comparable results to larger models. Our experiments show that fine-tuning improves performance on both tasks, with notable gains observed with as few as 200-300 training examples. Overall, the study highlights the potential of task-specific fine-tuning of LLMs for automating clinical workflows and efficiently extracting structured data from unstructured medical text.

large language model, llama3 instruct, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2503.21349

Country: Europe > Germany > North Rhine-Westphalia > Münster Region > Münster (0.06)

Genre: Research Report > New Finding (0.69)

Industry: Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.87)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Add feedback

ALMA: Alignment with Minimal Annotation

Yasunaga, Michihiro, Shamis, Leonid, Zhou, Chunting, Cohen, Andrew, Weston, Jason, Zettlemoyer, Luke, Ghazvininejad, Marjan

arXiv.org Artificial IntelligenceDec-5-2024

Recent approaches to large language model (LLM) alignment typically require millions of human annotations or rely on external aligned models for synthetic data generation. This paper introduces ALMA: Alignment with Minimal Annotation, demonstrating that effective alignment can be achieved using only 9,000 labeled examples--less than 1% of conventional approaches. ALMA generates large amounts of high-quality synthetic alignment data through new techniques: diverse prompt synthesis via few-shot learning, diverse response generation with multiple model checkpoints, and judge (reward model) enhancement through score aggregation and self-distillation. Using only a pretrained Llama3 base model, 5,000 SFT examples, and 4,000 judge annotations, ALMA achieves performance close to Llama3-Instruct across diverse alignment benchmarks (e.g., 0.1% difference on AlpacaEval 2.0 score). These results are achieved with a multiround, self-bootstrapped data synthesis and training recipe that continues to improve for 10 rounds, surpassing the typical 3-round ceiling of previous methods. These results suggest that base models already possess sufficient knowledge for effective alignment, and that synthetic data generation methods can expose it. Synthesize prompts ( 3.1) Base model (e.g. Sample diverse & many responses per prompt. Starting with only a pretrained base LLM (Llama3 Base) and minimal seed data (9k samples--less than 1% of conventional approaches), we align the model to achieve performance close to Llama3 Instruct (left panel). This is achieved through our new alignment techniques (right panel) that enhance each of the four key components in alignment: prompt synthesis ( 3.1), response synthesis ( 3.2), judge ( 3.3), and model training ( 3.4).

base model, data synthesis, synthesis, (12 more...)

arXiv.org Artificial Intelligence

2412.04305

Country: Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models

Yu, Dian, Peng, Baolin, Tian, Ye, Song, Linfeng, Mi, Haitao, Yu, Dong

arXiv.org Artificial IntelligenceAug-28-2024

There is a growing trend of teaching large language models (LLMs) to solve mathematical problems through coding. Existing studies primarily focus on prompting powerful, closed-source models to generate seed training data followed by in-domain data augmentation, equipping LLMs with considerable capabilities for code-aided mathematical reasoning. However, continually training these models on augmented data derived from a few datasets such as GSM8K may impair their generalization abilities and restrict their effectiveness to a narrow range of question types. Conversely, the potential of improving such LLMs by leveraging large-scale, expert-written, diverse math question-answer pairs remains unexplored. To utilize these resources and tackle unique challenges such as code response assessment, we propose a novel paradigm that uses a code-based critic model to guide steps including question-code data construction, quality control, and complementary evaluation. We also explore different alignment algorithms with self-generated instruction/preference data to foster continuous improvement. Experiments across both in-domain (up to +5.7%) and out-of-domain (+4.4%) benchmarks in English and Chinese demonstrate the effectiveness of the proposed paradigm.

arxiv preprint arxiv, critic model, reference answer, (15 more...)

arXiv.org Artificial Intelligence

2408.15565

Country:

Asia > China > Guangxi Province > Nanning (0.04)
North America > United States > Washington > King County > Bellevue (0.04)

Genre: Research Report > New Finding (0.67)

Industry:

Education > Educational Setting > K-12 Education (0.46)
Education > Curriculum > Subject-Specific Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)

Add feedback