AITopics

Industry: Information Technology > Services (1.00)

Technology:

Information Technology > Cloud Computing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.30)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.30)

Diversity Preference-Aware Link Recommendation for Online Social Networks

Yin, Kexin, Fang, Xiao, Chen, Bintong, Sheng, Olivia

Link recommendation, which recommends links to connect unlinked online social network users, is a fundamental social network analytics problem with ample business implications. Existing link recommendation methods tend to recommend similar friends to a user but overlook the user's diversity preference, although social psychology theories suggest the criticality of diversity preference to link recommendation performance. In recommender systems, a field related to link recommendation, a number of diversification methods have been proposed to improve the diversity of recommended items. Nevertheless, diversity preference is distinct from diversity studied by diversification methods. To address these research gaps, we define and operationalize the concept of diversity preference for link recommendation and propose a new link recommendation problem: the diversity preference-aware link recommendation problem. We then analyze key properties of the new link recommendation problem and develop a novel link recommendation method to solve the problem. Using two large-scale online social network data sets, we conduct extensive empirical evaluations to demonstrate the superior performance of our method over representative diversification methods adapted for link recommendation as well as state-of-the-art link recommendation methods.

artificial intelligence, diversity preference, machine learning, (18 more...)

2205.10689

Country:

North America > United States > Delaware > New Castle County > Newark (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
North America > United States > New York (0.04)

Genre:

Overview (0.92)
Research Report > Experimental Study (0.68)
Research Report > New Finding (0.46)

Industry: Information Technology > Services (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

A GPU-specialized Inference Parameter Server for Large-Scale Deep Recommendation Models

Wei, Yingcan, Langer, Matthias, Yu, Fan, Lee, Minseok, Liu, Kingsley, Shi, Jerry, Wang, Joey

Recommendation systems are of crucial importance for a variety of modern apps and web services, such as news feeds, social networks, e-commerce, search, etc. To achieve peak prediction accuracy, modern recommendation models combine deep learning with terabyte-scale embedding tables to obtain a fine-grained representation of the underlying data. Traditional inference serving architectures require deploying the whole model to standalone servers, which is infeasible at such massive scale. In this paper, we provide insights into the intriguing and challenging inference domain of online recommendation systems. We propose the HugeCTR Hierarchical Parameter Server (HPS), an industry-leading distributed recommendation inference framework, that combines a high-performance GPU embedding cache with an hierarchical storage architecture, to realize low-latency retrieval of embeddings for online model inference tasks. Among other things, HPS features (1) a redundant hierarchical storage system, (2) a novel high-bandwidth cache to accelerate parallel embedding lookup on NVIDIA GPUs, (3) online training support and (4) light-weight APIs for easy integration into existing large-scale recommendation workflows. To demonstrate its capabilities, we conduct extensive studies using both synthetically engineered and public datasets. We show that our HPS can dramatically reduce end-to-end inference latency, achieving 5~62x speedup (depending on the batch size) over CPU baseline implementations for popular recommendation models. Through multi-GPU concurrent deployment, the HPS can also greatly increase the inference QPS.

artificial intelligence, cache, machine learning, (18 more...)

doi: 10.1145/3523227.3546765

2210.08804

Country:

North America > United States > Washington > King County > Seattle (0.05)
North America > United States > New York > New York County > New York City (0.05)
Asia > China > Shanghai > Shanghai (0.04)
(11 more...)

Genre: Research Report (0.50)

Industry:

Information Technology > Services (0.68)
Education > Educational Setting > Online (0.68)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.86)

Knowledge Distillation Transfer Sets and their Impact on Downstream NLU Tasks

Peris, Charith, Tan, Lizhen, Gueudre, Thomas, Gojayev, Turan, Wei, Pan, Oz, Gokmen

Teacher-student knowledge distillation is a popular technique for compressing today's prevailing large language models into manageable sizes that fit low-latency downstream applications. Both the teacher and the choice of transfer set used for distillation are crucial ingredients in creating a high quality student. Yet, the generic corpora used to pretrain the teacher and the corpora associated with the downstream target domain are often significantly different, which raises a natural question: should the student be distilled over the generic corpora, so as to learn from high-quality teacher predictions, or over the downstream task corpora to align with finetuning? Our study investigates this trade-off using Domain Classification (DC) and Intent Classification/Named Entity Recognition (ICNER) as downstream tasks. We distill several multilingual students from a larger multilingual LM with varying proportions of generic and task-specific datasets, and report their performance after finetuning on DC and ICNER. We observe significant improvements across tasks and test sets when only task-specific corpora is used. We also report on how the impact of adding task-specific data to the transfer set correlates with the similarity between generic and task-specific data. Our results clearly indicate that, while distillation from a generic LM benefits downstream tasks, students learn better using target domain data even if it comes at the price of noisier teacher predictions. In other words, target domain data still trumps teacher knowledge.

artificial intelligence, chatbot, natural language, (20 more...)

2210.04834

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Italy > Piedmont > Turin Province > Turin (0.04)
Europe > Germany > Berlin (0.04)
Asia > India > Bihar > Patna (0.04)

Genre: Research Report > New Finding (0.66)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.93)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.86)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.68)

Disentangling Confidence Score Distribution for Out-of-Domain Intent Detection with Energy-Based Learning

Wu, Yanan, Zeng, Zhiyuan, He, Keqing, Mou, Yutao, Wang, Pei, Yan, Yuanmeng, Xu, Weiran

Detecting Out-of-Domain (OOD) or unknown intents from user queries is essential in a task-oriented dialog system. Traditional softmax-based confidence scores are susceptible to the overconfidence issue. In this paper, we propose a simple but strong energy-based score function to detect OOD where the energy scores of OOD samples are higher than IND samples. Further, given a small set of labeled OOD samples, we introduce an energy-based margin objective for supervised OOD detection to explicitly distinguish OOD samples from INDs. Comprehensive experiments and analysis prove our method helps disentangle confidence score distributions of IND and OOD data.\footnote{Our code is available at \url{https://github.com/pris-nlp/EMNLP2022-energy_for_OOD/}.}

detection, machine learning, natural language, (19 more...)

2210.0883

Country:

Asia > China > Beijing > Beijing (0.05)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
Asia > India (0.04)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.54)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.49)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.46)

Combrink, Herkulaas MvE, Marivate, Vukosi, Rosman, Benjamin

A Framework for Undergraduate Data Collection Strategies for Student Support Recommendation Systems in Higher Education

arXiv.org Artificial IntelligenceOct-16-2022

Understanding which student support strategies mitigate dropout and improve student retention is an important part of modern higher educational research. One of the largest challenges institutions of higher learning currently face is the scalability of student support. Part of this is due to the shortage of staff addressing the needs of students, and the subsequent referral pathways associated to provide timeous student support strategies. This is further complicated by the difficulty of these referrals, especially as students are often faced with a combination of administrative, academic, social, and socio-economic challenges. A possible solution to this problem can be a combination of student outcome predictions and applying algorithmic recommender systems within the context of higher education. While much effort and detail has gone into the expansion of explaining algorithmic decision making in this context, there is still a need to develop data collection strategies Therefore, the purpose of this paper is to outline a data collection framework specific to recommender systems within this context in order to reduce collection biases, understand student characteristics, and find an ideal way to infer optimal influences on the student journey. If confirmation biases, challenges in data sparsity and the type of information to collect from students are not addressed, it will have detrimental effects on attempts to assess and evaluate the effects of these systems within higher education.

artificial intelligence, collection strategy, recommender system, (15 more...)

2210.10657

Country:

North America > United States > New York (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)
Africa > South Africa > Gauteng > Pretoria (0.04)
(5 more...)

Genre:

Research Report (0.82)
Instructional Material (0.68)

Industry: Education > Educational Setting > Higher Education (1.00)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)

#artificialintelligenceOct-15-2022, 18:35:12 GMT

How is Artificial Intelligence ruling the Medical Industry?

As technology evolves, especially in the field of Artificial Intelligence, its dominance is pervading in every industry rapidly like never before and is becoming increasingly popular and valuable daily. With the use of AI, numerous autonomous applications are created to make human life and task easy. In addition, researchers are working tirelessly around the clock to make this technology more potent in the Medical Industry to provide better medical treatments, accurate diagnoses, elevate service deliveries, and more. Machine learning models are used to implement AI in the medical sector, enabling the search of healthcare data for better patient experiences and improved health results. A massive amount of machine-learning datasets and several algorithms with extraordinary decision-making capabilities are used by artificial intelligence to provide hands-on solutions based on the user requirements in every sector.

artificial intelligence, health, medical industry, (13 more...)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Diagnostic Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.32)

#artificialintelligenceOct-15-2022, 18:15:17 GMT

La veille de la cybersécurité

Over the last decade, Artificial intelligence (AI) has become embedded in every aspect of our society and lives. From chatbots and virtual assistants like Siri and Alexa to automated industrial machinery and self-driving cars, it's hard to ignore its impact. Today, the technology most commonly used to achieve AI is machine learning – advanced software algorithms designed to carry out one specific task, such as answering questions, translating languages or navigating a journey – and become increasingly good at it as they are exposed to more and more data. Worldwide, spending by governments and business on AI technology will top $500 billion in 2023, according to IDC research. But how will it be used, and what impact will it have?

veille

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.62)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.62)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.62)

#artificialintelligenceOct-15-2022, 15:49:12 GMT

Question Answering Over Biological Knowledge Graph via Amazon Alexa

Structured and unstructured data and facts about drugs, genes, protein, viruses, and their mechanism are spread across a huge number of scientific articles. These articles are a large-scale knowledge source and can have a huge impact on disseminating knowledge about the mechanisms of certain biological processes. A knowledge graph (KG) can be constructed by integrating such facts and data and be used for data integration, exploration, and federated queries. However, exploration and querying large-scale KGs is tedious for certain groups of users due to a lack of knowledge about underlying data assets or semantic technologies. A question-answering (QA) system allows the answer of natural language questions over KGs automatically using triples contained in a KG.

amazon alexa, biological knowledge graph, mechanism

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.65)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.65)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.59)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.45)

#artificialintelligenceOct-14-2022, 14:25:05 GMT

The Future Of A.I As Personal Assistance- Jarvis Is Coming

Artificial intelligence is coming to your phone and the future of AI is personal assistance. This means that AI will be able to help with your daily tasks, from making suggestions on what you should wear or buy, to answering questions about history or science. The use cases for this technology are endless but especially interesting when it comes to personal assistants and virtual assistants. In order for AI assistants to be useful, they need to make decisions that are measurable and explainable. They also have to be able to learn from their mistakes and improve over time.

ai assistant, make decision, personal assistance-jarvis, (5 more...)

Industry: Leisure & Entertainment (0.33)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > The Future (0.91)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.91)