AITopics | Kainuu

Collaborating Authors

Kainuu

Open-source Swiss language model to be released this summer

AIHubJul-29-2025, 10:04:45 GMT

This summer, EPFL and ETH Zurich will release a large language model (LLM) developed on public infrastructure. Trained on the "Alps" supercomputer at the Swiss National Supercomputing Centre (CSCS), the new LLM marks a milestone in open-source AI and multilingual excellence. Earlier this month in Geneva, around 50 leading global initiatives and organisations dedicated to open-source LLMs and trustworthy AI convened at the International Open-Source LLM Builders Summit. Hosted by the AI centres of EPFL and ETH Zurich, the event marked a significant step in building a vibrant and collaborative international ecosystem for open foundation models. Open LLMs are increasingly viewed as credible alternatives to commercial systems, most of which are developed behind closed doors in the United States or China.

collaboration, large language model, natural language, (14 more...)

AIHub

Country:

Europe > Switzerland > Zürich > Zürich (0.49)
North America > United States (0.25)
Asia > China (0.25)
Europe > Finland > Kainuu > Kajaani (0.05)

Industry:

Law (0.50)
Information Technology (0.31)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Nakamura, Taishi, Mishra, Mayank, Tedeschi, Simone, Chai, Yekun, Stillerman, Jason T, Friedrich, Felix, Yadav, Prateek, Laud, Tanmay, Chien, Vu Minh, Zhuo, Terry Yue, Misra, Diganta, Bogin, Ben, Vu, Xuan-Son, Karpinska, Marzena, Dantuluri, Arnav Varma, Kusa, Wojciech, Furlanello, Tommaso, Yokota, Rio, Muennighoff, Niklas, Pai, Suhas, Adewumi, Tosin, Laippala, Veronika, Yao, Xiaozhe, Junior, Adalberto, Ariyak, Alpay, Drozd, Aleksandr, Clive, Jordan, Gupta, Kshitij, Chen, Liangyu, Sun, Qi, Tsui, Ken, Persaud, Noah, Fahmy, Nour, Chen, Tianlong, Bansal, Mohit, Monti, Nicolo, Dang, Tai, Luo, Ziyang, Bui, Tien-Tung, Navigli, Roberto, Mehta, Virendra, Blumberg, Matthew, May, Victor, Nguyen, Huu, Pyysalo, Sampo

arXiv.org Artificial IntelligenceApr-23-2024

Pretrained language models underpin several AI applications, but their high computational cost for training limits accessibility. Initiatives such as BLOOM and StarCoder aim to democratize access to pretrained models for collaborative community development. However, such existing models face challenges: limited multilingual capabilities, continual pretraining causing catastrophic forgetting, whereas pretraining from scratch is computationally expensive, and compliance with AI safety and development laws. This paper presents Aurora-M, a 15B parameter multilingual open-source model trained on English, Finnish, Hindi, Japanese, Vietnamese, and code. Continually pretrained from StarCoderPlus on 435 billion additional tokens, Aurora-M surpasses 2 trillion tokens in total training token count. It is the first open-source multilingual model fine-tuned on human-reviewed safety instructions, thus aligning its development not only with conventional red-teaming considerations, but also with the specific concerns articulated in the Biden-Harris Executive Order on the Safe, Secure, and Trustworthy Development and Use of Artificial Intelligence. Aurora-M is rigorously evaluated across various tasks and languages, demonstrating robustness against catastrophic forgetting and outperforming alternatives in multilingual settings, particularly in safety evaluations. To promote responsible open-source LLM development, Aurora-M and its variants are released at https://huggingface.co/collections/aurora-m/aurora-m-models-65fdfdff62471e09812f5407 .

dataset, instruction, language model, (13 more...)

arXiv.org Artificial Intelligence

2404.00399

Country:

Asia > Middle East > Jordan (0.04)
North America > Canada > Ontario > Toronto (0.04)
South America > Peru > Lima Department > Lima Province > Lima (0.04)
(25 more...)

Genre: Research Report (0.64)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Eva-KELLM: A New Benchmark for Evaluating Knowledge Editing of LLMs

Wu, Suhang, Peng, Minlong, Chen, Yue, Su, Jinsong, Sun, Mingming

arXiv.org Artificial IntelligenceAug-19-2023

Large language models (LLMs) possess a wealth of knowledge encoded in their parameters. However, this knowledge may become outdated or unsuitable over time. As a result, there has been a growing interest in knowledge editing for LLMs and evaluating its effectiveness. Existing studies primarily focus on knowledge editing using factual triplets, which not only incur high costs for collection but also struggle to express complex facts. Furthermore, these studies are often limited in their evaluation perspectives. In this paper, we propose Eva-KELLM, a new benchmark for evaluating knowledge editing of LLMs. This benchmark includes an evaluation framework and a corresponding dataset. Under our framework, we first ask the LLM to perform knowledge editing using raw documents, which provides a more convenient and universal approach compared to using factual triplets. We then evaluate the updated LLM from multiple perspectives. In addition to assessing the effectiveness of knowledge editing and the retention of unrelated knowledge from conventional studies, we further test the LLM's ability in two aspects: 1) Reasoning with the altered knowledge, aiming for the LLM to genuinely learn the altered knowledge instead of simply memorizing it. 2) Cross-lingual knowledge transfer, where the LLM updated with raw documents in one language should be capable of handling queries from another language. To facilitate further research, we construct and release the corresponding dataset. Using this benchmark, we investigate the effectiveness of several commonly-used knowledge editing methods. Experimental results indicate that the current methods for knowledge editing using raw documents are not effective in yielding satisfactory results, particularly when it comes to reasoning with altered knowledge and cross-lingual knowledge transfer.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2308.09954

Country:

Europe > United Kingdom > England (0.05)
North America > United States > New York (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(3 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Europe's fastest supercomputer is now connected to a quantum computer

New ScientistDec-6-2022, 12:00:18 GMT

A quantum computer has been connected to Europe's fastest supercomputer. It may be a step towards a new type of computing that combines traditional and quantum computers to quickly solve complex problems. The promise of quantum computers is that they will eventually complete calculations that are impossible for the most powerful conventional computers. Though many researchers are working on perfecting quantum computers, many are also suggesting that existing, imperfect quantum computers could be more useful if connected to traditional supercomputers.

computer, fastest supercomputer, quantum computer, (1 more...)

New Scientist

Country: Europe > Finland > Kainuu > Kajaani (0.15)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

US's Frontier supercomputer becomes the fastest in the world

Daily Mail - Science & techMay-31-2022, 11:52:32 GMT

A supercomputer in the US called'Frontier' has become the fastest in the world, beating its closest rival in Japan. Frontier, based at the US Department of Energy's Oak Ridge National Laboratory in Tennessee, is the first to achieve a level of computing known as'exascale'. Exascale refers to a system that can perform at least one quintillion operations per second – a billion billion calculations, or 1 followed by 18 zeroes. This makes Frontier more than twice as powerful than the Fugaku supercomputer in Japan, which was deemed the world's fastest supercomputer back in June 2020. Frontier will allow scientists to develop technologies for the US's energy, economic and national security, said Oak Ridge National Laboratory, and solve computational problems that were impossible to do just five years ago.

frontier, oak ridge national laboratory, supercomputer, (12 more...)

Daily Mail - Science & tech

Country:

North America > United States > Tennessee (0.28)
Asia > China (0.07)
North America > United States > California > San Mateo County > Menlo Park (0.05)
(3 more...)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Energy (1.00)

Technology:

Information Technology > Scientific Computing (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

A Comic Walks Into a VR Comedy Club…

WSJ.com: WSJD - TechnologyApr-28-2016, 16:45:38 GMT

Samantha Gilweit, who is used to performing improv comedy in front of large crowds, went with her best opening bit, the one about drunken princesses that never fails. She delivered the punchline, and…dead silence. Her mind raced with how to recover. A second later, smiley-face emoji appeared over the heads of the animated robots and digitally rendered humanoids that stood in for the audience, followed by a gush of hearts. That's how "you knew you were killing it," the 31-year-old said.

artificial intelligence, avatar, social media, (9 more...)

WSJ.com: WSJD - Technology

Country:

North America > United States > New York (0.05)
North America > United States > California (0.05)
Europe > Finland > Kainuu > Kajaani (0.05)

Industry:

Leisure & Entertainment (1.00)
Media > Film (0.72)

Technology:

Information Technology > Artificial Intelligence (0.74)
Information Technology > Communications > Social Media (0.34)

Add feedback