AITopics | contributed

Collaborating Authors

contributed

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

In this technical report, we present the Zamba2 series -- a suite of 1.2B, 2.7B, and 7.4B parameter hybrid Mamba2-transformer models that achieve state of the art performance against the leading open-weights models of their class, while achieving substantial gains in inference latency, throughput, and memory efficiency. The Zamba2 series builds upon our initial work with Zamba1-7B, optimizing its architecture, training and annealing datasets, and training for up to three trillion tokens. We provide open-source weights for all models of the Zamba2 series as well as instruction-tuned variants that are strongly competitive against comparable instruct-tuned models of their class. We additionally open-source the pretraining dataset, which we call Zyda-2, used to train the Zamba2 series of models. The models and datasets used in this work are openly available at https://huggingface.co/Zyphra

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2411.15242

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre: Research Report (0.43)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.69)

Add feedback

Real or Robotic? Assessing Whether LLMs Accurately Simulate Qualities of Human Responses in Dialogue

Ivey, Jonathan, Kumar, Shivani, Liu, Jiayu, Shen, Hua, Rakshit, Sushrita, Raju, Rohan, Zhang, Haotian, Ananthasubramaniam, Aparna, Kim, Junghwan, Yi, Bowen, Wright, Dustin, Israeli, Abraham, Møller, Anders Giovanni, Zhang, Lechen, Jurgens, David

arXiv.org Artificial IntelligenceSep-16-2024

Studying and building datasets for dialogue tasks is both expensive and time-consuming due to the need to recruit, train, and collect data from study participants. In response, much recent work has sought to use large language models (LLMs) to simulate both human-human and human-LLM interactions, as they have been shown to generate convincingly human-like text in many settings. However, to what extent do LLM-based simulations \textit{actually} reflect human dialogues? In this work, we answer this question by generating a large-scale dataset of 100,000 paired LLM-LLM and human-LLM dialogues from the WildChat dataset and quantifying how well the LLM simulations align with their human counterparts. Overall, we find relatively low alignment between simulations and human interactions, demonstrating a systematic divergence along the multiple textual properties, including style and content. Further, in comparisons of English, Chinese, and Russian dialogues, we find that models perform similarly. Our results suggest that LLMs generally perform better when the human themself writes in a way that is more similar to the LLM's own style.

arxiv, similarity, turn 3, (16 more...)

arXiv.org Artificial Intelligence

2409.0833

Country:

Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
Europe > Denmark > Capital Region > Copenhagen (0.04)
South America (0.04)
(12 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Zamba: A Compact 7B SSM Hybrid Model

Glorioso, Paolo, Anthony, Quentin, Tokpanov, Yury, Whittington, James, Pilault, Jonathan, Ibrahim, Adam, Millidge, Beren

arXiv.org Artificial IntelligenceMay-26-2024

In this technical report, we present Zamba, a novel 7B SSM-transformer hybrid model which achieves competitive performance against leading open-weight models at a comparable scale. Zamba is trained on 1T tokens from openly available datasets and is the best non-transformer model at this scale. Zamba pioneers a unique architecture combining a Mamba backbone with a single shared attention module, thus obtaining the benefits of attention at minimal parameter cost. Due to its architecture, Zamba is significantly faster at inference than comparable transformer models and requires substantially less memory for generation of long sequences. Zamba is pretrained in two phases: the first phase is based on existing web datasets, while the second one consists of annealing the model over high-quality instruct and synthetic datasets, and is characterized by a rapid learning rate decay. We open-source the weights and all checkpoints for Zamba, through both phase 1 and annealing phases.

architecture, dataset, zamba, (14 more...)

arXiv.org Artificial Intelligence

2405.16712

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Asia > Middle East > Jordan (0.04)
Asia > Japan > Honshū > Chūbu > Toyama Prefecture > Toyama (0.04)

Genre: Research Report (0.50)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

Gecko: Versatile Text Embeddings Distilled from Large Language Models

Lee, Jinhyuk, Dai, Zhuyun, Ren, Xiaoqi, Chen, Blair, Cer, Daniel, Cole, Jeremy R., Hui, Kai, Boratko, Michael, Kapadia, Rajvi, Ding, Wen, Luan, Yi, Duddu, Sai Meher Karthik, Abrego, Gustavo Hernandez, Shi, Weiqiang, Gupta, Nithi, Kusupati, Aditya, Jain, Prateek, Jonnalagadda, Siddhartha Reddy, Chang, Ming-Wei, Naim, Iftekhar

arXiv.org Artificial IntelligenceMar-29-2024

Text embedding models represent natural language as dense vectors, positioning semantically similar text near each other within the embedding space (Gao et al., 2021; Le and Mikolov, 2014; Reimers and Gurevych, 2019). These embeddings are commonly used for a wide range of downstream tasks including document retrieval, sentence similarity, classification, and clustering (Muennighoff et al., 2023). Instead of building separate embedding models for each downstream task, recent efforts seek to create a single embedding model supporting many tasks. The recent development of general-purpose text embedding models presents a challenge: these models require large amounts of training data to comprehensively cover desired domains and skills. Recent embedding efforts have focused on using extensive collections of training examples (Li et al., 2023; Wang et al., 2022).

gecko, query, versatile text embedding distilled, (11 more...)

arXiv.org Artificial Intelligence

2403.20327

Country:

Asia > Middle East > Republic of Türkiye > Batman Province > Batman (0.05)
North America > United States > New York (0.04)
North America > Dominican Republic (0.04)
(7 more...)

Genre: Research Report (0.82)

Industry:

Media > Film (0.67)
Leisure & Entertainment > Sports > Olympic Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

MetaHate: A Dataset for Unifying Efforts on Hate Speech Detection

Piot, Paloma, Martín-Rodilla, Patricia, Parapar, Javier

arXiv.org Artificial IntelligenceJan-12-2024

Hate speech represents a pervasive and detrimental form of online discourse, often manifested through an array of slurs, from hateful tweets to defamatory posts. As such speech proliferates, it connects people globally and poses significant social, psychological, and occasionally physical threats to targeted individuals and communities. Current computational linguistic approaches for tackling this phenomenon rely on labelled social media datasets for training. For unifying efforts, our study advances in the critical need for a comprehensive meta-collection, advocating for an extensive dataset to help counteract this problem effectively. We scrutinized over 60 datasets, selectively integrating those pertinent into MetaHate. This paper offers a detailed examination of existing collections, highlighting their strengths and limitations. Our findings contribute to a deeper understanding of the existing datasets, paving the way for training more robust and adaptable models. These enhanced models are essential for effectively combating the dynamic and complex nature of hate speech in the digital realm.

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2401.06526

Country:

North America > United States (0.28)
Europe (0.04)
Asia > Japan (0.04)

Genre: Research Report (0.84)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.68)
Information Technology (0.68)
Media > News (0.48)
(3 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.46)

Add feedback

Contributed: The power of AI in surgery

#artificialintelligenceNov-23-2021, 23:20:03 GMT

Artificial intelligence (AI) defined as algorithms that enable machines to perform cognitive functions (such as problem solving and decision-making) has changed for some time now the face of healthcare through Machine Learning (ML) and Natural Language Processing (NLP). Its use in surgery, however, took a longer time than in other medical specialties, mainly because of missing information regarding the possibilities of computational implementation in practical surgery. Thanks to fast developments registered, AI is currently perceived as a supplement and not a replacement for the skill of a human surgeon. And although the potential of the surgeon-patient-computer relationship is a long way from being fully explored, the use of AI in surgery is already driving significant changes for doctors and patients alike. For example, surgical planning and navigation have improved consistently through computed tomography (CT), ultrasound and magnetic resonance imaging (MRI), while minimally invasive surgery (MIS), combined with robotic assistance, resulted in decreased surgical trauma and improved patient recovery. Preoperative planning is the stage in which surgeons plan the surgical intervention based on the patient's medical records and imaging.

robot, surgeon, surgery, (16 more...)

#artificialintelligence

Country: Europe > Netherlands > Limburg > Maastricht (0.05)

Genre: Research Report > Experimental Study (0.35)

Industry:

Health & Medicine > Surgery (1.00)
Health & Medicine > Health Care Technology (0.92)
Health & Medicine > Therapeutic Area > Neurology (0.35)
Health & Medicine > Diagnostic Medicine > Imaging (0.35)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)

Add feedback

Contributed: Top 10 Use Cases for AI in Healthcare

#artificialintelligenceJul-5-2021, 13:00:16 GMT

Artificial intelligence (AI) is reshaping healthcare, and its use is becoming a reality in many medical fields and specialties. AI, machine learning (ML), natural language processing (NLP) and deep learning (DL) enable healthcare stakeholders and medical professionals to identify healthcare needs and solutions faster with more accuracy, using data patterns to make informed medical or business decisions quickly. AI is able to analyze large amounts of data stored by healthcare organizations in the form of images, clinical research trials and medical claims, and can identify patterns and insights often undetectable by manual human skill sets. AI algorithms are "taught" to identify and label data patterns, while NLP allows these algorithms to isolate relevant data. With DL, the data is analyzed and interpreted with the help of extended knowledge by computers.

algorithm, clinician, healthcare, (16 more...)

#artificialintelligence

Country:

North America > Canada > Ontario > Toronto (0.14)
Europe > Denmark > Capital Region > Copenhagen (0.04)

Genre:

Research Report > New Finding (0.50)
Research Report > Experimental Study (0.50)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Channeling AI into Government Citizen Engagement (Contributed)

#artificialintelligenceAug-27-2019, 13:52:11 GMT

In recent years, the proliferation of digital technologies has created multiple customer service channels and touchpoints through which citizens can access online government services. Unfortunately, user experience is often overlooked in the design and deployment of these new digital services. Citizens' expectations of service are shaped not only by their interactions with government agencies, but also by their everyday digital experiences. For example, a recent Accenture survey of over 5,000 citizens from five countries found that as they encounter more user-friendly AI solutions in their daily lives, expectations for government use of these technologies increase. In this changing environment, the need for a convenient and seamless customer experience across all engagement channels has never been more pressing.

artificial intelligence, government agency, natural language, (16 more...)

#artificialintelligence

Country: North America (0.16)

Industry: Government (0.97)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.79)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.52)

Add feedback