AITopics | chuxin

Collaborating Authors

chuxin

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

ChuXin: 1.6B Technical Report

Zhuang, Xiaomin, Jiang, Yufan, He, Qiaozhi, Wu, Zhihua

arXiv.org Artificial IntelligenceMay-8-2024

Unlike the majority of works that only opensourced the model weights and architecture, we have made everything needed to train a model available, including the training data, the training process, and the evaluation code. Our goal is to empower and strengthen the open research community, fostering transparency and enabling a new wave of innovation in the field of language modeling. Furthermore, we extend the context length to 1M tokens through lightweight continual pretraining and demonstrate strong needlein-a-haystack retrieval performance. Countless models have been opensourced on AI communities like HuggingFace to facilitate their use by researchers (Bai et al., 2023; Singer et al., 2024; Zhang et al., 2024). These models can broadly be divided into two categories: 1) Open source model weights and data sources, which constitute the vast majority.

arxiv preprint arxiv, chuxin, language model, (14 more...)

arXiv.org Artificial Intelligence

2405.04828

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.89)
Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback