AITopics | gorilla

Doctors perform rare emergency C-section on a gorilla

While Olympia recovers, another postpartum gorilla mom will care for both newborns. More information Adding us as a Preferred Source in Google by using this link indicates that you would like to see more of our content in Google News results. Dr. Andrew Beckstom, Neonatologist and Medical Director of Swedish Medical Center NICU (left). Breakthroughs, discoveries, and DIY tips sent six days a week. By signing up, you confirm you are 16+, will receive newsletters and promotional content and agree to our Terms of Use and acknowledge the data practices in our Privacy Policy .

artificial intelligence, gorilla, physics popular science video space, (10 more...)

Popular Science

Country: North America > United States > California (0.16)

Industry:

Health & Medicine > Health Care Providers & Services (0.70)
Health & Medicine > Therapeutic Area > Pediatrics/Neonatology (0.36)

Technology: Information Technology > Artificial Intelligence (0.36)

Add feedback

Pregnant gorillas undergo ultrasounds and the results might look familiar

More information Adding us as a Preferred Source in Google by using this link indicates that you would like to see more of our content in Google News results. Western lowland gorillas are critically endangered. Breakthroughs, discoveries, and DIY tips sent six days a week. By signing up, you confirm you are 16+, will receive newsletters and promotional content and agree to our Terms of Use and acknowledge the data practices in our Privacy Policy . When Sachita Shah sent her cardiologist brother an ultrasound of her patient's heart, he was very confused.

artificial intelligence, gorilla, ultrasound, (13 more...)

Popular Science

Country: North America > United States > California (0.15)

Industry: Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.49)

Technology: Information Technology > Artificial Intelligence (0.35)

Add feedback

Gorilla: Large Language Model Connected with Massive APIs

Neural Information Processing SystemsMar-22-2026, 18:08:36 GMT

Large Language Models (LLMs) have seen an impressive wave of advances, withmodels now excelling in a variety of tasks, such as mathematical reasoning andprogram synthesis. However, their potential to effectively use tools via API callsremains unfulfilled. This is a challenging task even for today's state-of-the-artLLMs such as GPT-4 largely due to their unawareness of what APIs are availableand how to use them in a frequently updated tool set. We develop Gorilla, afinetuned LLaMA model that surpasses the performance of GPT-4 on writing APIcalls. Trained with the novel Retriever Aware Training (RAT), when combinedwith a document retriever, Gorilla demonstrates a strong capability to adapt totest-time document changes, allowing flexible user updates or version changes.It also substantially mitigates the issue of hallucination, commonly encounteredwhen prompting LLMs directly. To evaluate the model's ability, we introduceAPIBench, a comprehensive dataset consisting of HuggingFace, TorchHub, andTensorHub APIs. The successful integration of the retrieval system with Gorillademonstrates the potential for LLMs to use tools more accurately, keep up withfrequently updated documentation, and consequently increase the reliability andapplicability of their outputs. Gorilla's code, model, data, and demo are availableat: https://gorilla.cs.berkeley.edu

large language model, machine learning, natural language, (7 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

Add feedback

e4c61f578ff07830f5c37378dd3ecb0d-Paper-Conference.pdf

Neural Information Processing SystemsFeb-18-2026, 11:47:58 GMT

large language model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Genre: Research Report (1.00)

Industry: Education (0.93)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
(3 more...)

Add feedback

e4c61f578ff07830f5c37378dd3ecb0d-Paper-Conference.pdf

Neural Information Processing SystemsOct-10-2025, 19:42:17 GMT

api call, arxiv preprint arxiv, gorilla, (14 more...)

Neural Information Processing Systems

Genre: Research Report (1.00)

Industry: Education (0.92)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Speech (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(4 more...)

Add feedback

Female mountain gorillas wield a lot of power

Breakthroughs, discoveries, and DIY tips sent every weekday. Whether it's King Kong climbing the Empire State building or Donkey Kong throwing barrels at unsuspecting Italian plumbers, gorillas in popular culture are symbols of male power. This interpretation by filmmakers and video game creators has some truth to it. Silverback males rule gorilla troops, and occupy a place of power they only vacate after combat or death. The first studies on gorilla behavior began in the 1950s, through the pioneering fieldwork of George Schaller and Dian Fossey.

artificial intelligence, gorilla, interaction, (12 more...)

Popular Science

Country:

North America > United States > New York (0.25)
Africa > Uganda (0.05)

Genre: Research Report > New Finding (0.36)

Industry: Leisure & Entertainment (0.90)

Technology: Information Technology > Artificial Intelligence (0.71)

Add feedback

Gorilla: Large Language Model Connected with Massive APIs

Neural Information Processing SystemsMay-27-2025, 19:56:53 GMT

Large Language Models (LLMs) have seen an impressive wave of advances, withmodels now excelling in a variety of tasks, such as mathematical reasoning andprogram synthesis. However, their potential to effectively use tools via API callsremains unfulfilled. This is a challenging task even for today's state-of-the-artLLMs such as GPT-4 largely due to their unawareness of what APIs are availableand how to use them in a frequently updated tool set. We develop Gorilla, afinetuned LLaMA model that surpasses the performance of GPT-4 on writing APIcalls. Trained with the novel Retriever Aware Training (RAT), when combinedwith a document retriever, Gorilla demonstrates a strong capability to adapt totest-time document changes, allowing flexible user updates or version changes.It also substantially mitigates the issue of hallucination, commonly encounteredwhen prompting LLMs directly.

gorilla, language model connected, massive api, (1 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.53)

Add feedback

RouteNator: A Router-Based Multi-Modal Architecture for Generating Synthetic Training Data for Function Calling LLMs

Belavadi, Vibha, Vatsa, Tushar, Sultania, Dewang, Suresha, Suhas, Verma, Ishita, Chen, Cheng, King, Tracy Holloway, Friedrich, Michael

arXiv.org Artificial IntelligenceMay-26-2025

This paper addresses fine-tuning Large Language Models (LLMs) for function calling tasks when real user interaction data is unavailable. In digital content creation tools, where users express their needs through natural language queries that must be mapped to API calls, the lack of real-world task-specific data and privacy constraints for training on it necessitate synthetic data generation. Existing approaches to synthetic data generation fall short in diversity and complexity, failing to replicate real-world data distributions and leading to suboptimal performance after LLM fine-tuning. We present a novel router-based architecture that leverages domain resources like content metadata and structured knowledge graphs, along with text-to-text and vision-to-text language models to generate high-quality synthetic training data. Our architecture's flexible routing mechanism enables synthetic data generation that matches observed real-world distributions, addressing a fundamental limitation of traditional approaches. Evaluation on a comprehensive set of real user queries demonstrates significant improvements in both function classification accuracy and API parameter selection. Models fine-tuned with our synthetic data consistently outperform traditional approaches, establishing new benchmarks for function calling tasks.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2505.10495

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

A Benchmark of French ASR Systems Based on Error Severity

Tholly, Antoine, Wottawa, Jane, Rouvier, Mickael, Dufour, Richard

arXiv.org Artificial IntelligenceJan-18-2025

Automatic Speech Recognition (ASR) transcription errors are commonly assessed using metrics that compare them with a reference transcription, such as Word Error Rate (WER), which measures spelling deviations from the reference, or semantic score-based metrics. However, these approaches often overlook what is understandable to humans when interpreting transcription errors. To address this limitation, a new evaluation is proposed that categorizes errors into four levels of severity, further divided into subtypes, based on objective linguistic criteria, contextual patterns, and the use of content words as the unit of analysis. This metric is applied to a benchmark of 10 state-of-the-art ASR systems on French language, encompassing both HMM-based and end-to-end models. Our findings reveal the strengths and weaknesses of each system, identifying those that provide the most comfortable reading experience for users.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2501.10879

Country:

Europe > France > Pays de la Loire > Loire-Atlantique > Nantes (0.05)
Europe > Spain (0.04)
Europe > Greece (0.04)

Genre: Research Report > New Finding (0.34)

Industry:

Media (1.00)
Leisure & Entertainment (0.94)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.36)

Add feedback

Controlling Language and Diffusion Models by Transporting Activations

Rodriguez, Pau, Blaas, Arno, Klein, Michal, Zappella, Luca, Apostoloff, Nicholas, Cuturi, Marco, Suau, Xavier

arXiv.org Artificial IntelligenceNov-22-2024

The increasing capabilities of large generative models and their ever more widespread deployment have raised concerns about their reliability, safety, and potential misuse. To address these issues, recent works have proposed to control model generation by steering model activations in order to effectively induce or prevent the emergence of concepts or behaviors in the generated output. In this paper we introduce Activation Transport (AcT), a general framework to steer activations guided by optimal transport theory that generalizes many previous activation-steering works. AcT is modality-agnostic and provides fine-grained control over the model behavior with negligible computational overhead, while minimally impacting model abilities. We experimentally show the effectiveness and versatility of our approach by addressing key challenges in large language models (LLMs) and text-to-image diffusion models (T2Is). For LLMs, we show that AcT can effectively mitigate toxicity, induce arbitrary concepts, and increase their truthfulness. In T2Is, we show how AcT enables fine-grained style control and concept negation.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2410.23054

Country: