AITopics | hyperclova

Collaborating Authors

hyperclova

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

KoSBi: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model Application

Lee, Hwaran, Hong, Seokhee, Park, Joonsuk, Kim, Takyoung, Kim, Gunhee, Ha, Jung-Woo

arXiv.org Artificial IntelligenceMay-29-2023

Large language models (LLMs) learn not only natural text generation abilities but also social biases against different demographic groups from real-world data. This poses a critical risk when deploying LLM-based applications. Existing research and resources are not readily applicable in South Korea due to the differences in language and culture, both of which significantly affect the biases and targeted demographic groups. This limitation requires localized social bias datasets to ensure the safe and effective deployment of LLMs. To this end, we present KO SB I, a new social bias dataset of 34k pairs of contexts and sentences in Korean covering 72 demographic groups in 15 categories. We find that through filtering-based moderation, social biases in generated content can be reduced by 16.47%p on average for HyperCLOVA (30B and 82B), and GPT-3.

computational linguistic, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2305.17701

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
(11 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Government (0.67)
Law (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

SQuARe: A Large-Scale Dataset of Sensitive Questions and Acceptable Responses Created Through Human-Machine Collaboration

Lee, Hwaran, Hong, Seokhee, Park, Joonsuk, Kim, Takyoung, Cha, Meeyoung, Choi, Yejin, Kim, Byoung Pil, Kim, Gunhee, Lee, Eun-Ju, Lim, Yong, Oh, Alice, Park, Sangchul, Ha, Jung-Woo

arXiv.org Artificial IntelligenceMay-28-2023

The potential social harms that large language models pose, such as generating offensive content and reinforcing biases, are steeply rising. Existing works focus on coping with this concern while interacting with ill-intentioned users, such as those who explicitly make hate speech or elicit harmful responses. However, discussions on sensitive issues can become toxic even if the users are well-intentioned. For safer models in such scenarios, we present the Sensitive Questions and Acceptable Response (SQuARe) dataset, a large-scale Korean dataset of 49k sensitive questions with 42k acceptable and 46k non-acceptable responses. The dataset was constructed leveraging HyperCLOVA in a human-in-the-loop manner based on real news headlines. Experiments show that acceptable response generation significantly improves for HyperCLOVA and GPT-3, demonstrating the efficacy of this dataset.

computational linguistic, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2305.17696

Country:

Asia > China (0.04)
North America > Dominican Republic (0.04)
Asia > South Korea > Seoul > Seoul (0.04)
(9 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Government (1.00)
Education (1.00)
Media > News (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Tourist Guidance Robot Based on HyperCLOVA

Yamazaki, Takato, Yoshikawa, Katsumasa, Kawamoto, Toshiki, Ohagi, Masaya, Mizumoto, Tomoya, Ichimura, Shuta, Kida, Yusuke, Sato, Toshinori

arXiv.org Artificial IntelligenceOct-19-2022

This paper describes our system submitted to Dialogue Robot Competition 2022. Our proposed system is a combined model of rule-based and generation-based dialog systems. The system utilizes HyperCLOVA, a Japanese foundation model, not only to generate responses but also summarization, search information, etc. We also used our original speech recognition system, which was fine-tuned for this dialog task. As a result, our system ranked second in the preliminary round and moved on to the finals.

artificial intelligence, natural language, speech recognition, (17 more...)

arXiv.org Artificial Intelligence

2210.104

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.18)
North America > Dominican Republic (0.04)

Genre:

Research Report (0.64)
Personal > Interview (0.46)

Industry: Education > Educational Setting > K-12 Education (0.30)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (0.67)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.56)

Add feedback

Naver trained a 'GPT-3-like' Korean language model

#artificialintelligenceJun-1-2021, 14:25:31 GMT

Naver, the Seongnam, South Korean-based company that operates the eponymous search engine Naver, this week announced that it trained one of the largest AI language models of its kind, called HyperCLOVA. Naver claims that the system learned 6,500 times more Korean data than OpenAI's GPT-3 and contains 204 billion parameters, the parts of the machine learning model learned from historical training data. For the better part of a year, OpenAI's GPT-3 has remained among the largest AI language models ever created. Via an API, people have used it to automatically write emails and articles, summarize text, compose poetry and recipes, create website layouts, and generate code for deep learning in Python. But GPT-3 has key limitations, chief among them that it's only available in English.

gpt-3, hyperclova, language model, (9 more...)

#artificialintelligence

Country:

Asia > South Korea (0.25)
North America > United States > California (0.05)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.49)

Add feedback