Goto

Collaborating Authors

 Personal Assistant Systems


Digital Transformation Examples for Business Success - DATAVERSITY

#artificialintelligence

Today, most businesses are embracing digital transformation to meet ever-increasing customer expectations and to remain competitive and relevant in the world economy. While many organizations still struggle to adopt new technologies โ€“ often due to the lack of a tailored digital strategy and the reluctance of many to change the way of business operations that have been continuing for decades โ€“ digital transformation examples abound across all industries. According to Statista, worldwide digital transformation spending is projected to reach $2.8 trillion in 2025. Although digital transformation undeniably makes businesses, regardless of their size and kind, more viable and easily accessible to customers, becoming a digital-first organization is not as simple as it seems. Digital transformation involves simple processes like the integration of communication tools as well as complex ones such as the migration of entire business operations to a cloud-based platform.


Diversity Preference-Aware Link Recommendation for Online Social Networks

arXiv.org Artificial Intelligence

Link recommendation, which recommends links to connect unlinked online social network users, is a fundamental social network analytics problem with ample business implications. Existing link recommendation methods tend to recommend similar friends to a user but overlook the user's diversity preference, although social psychology theories suggest the criticality of diversity preference to link recommendation performance. In recommender systems, a field related to link recommendation, a number of diversification methods have been proposed to improve the diversity of recommended items. Nevertheless, diversity preference is distinct from diversity studied by diversification methods. To address these research gaps, we define and operationalize the concept of diversity preference for link recommendation and propose a new link recommendation problem: the diversity preference-aware link recommendation problem. We then analyze key properties of the new link recommendation problem and develop a novel link recommendation method to solve the problem. Using two large-scale online social network data sets, we conduct extensive empirical evaluations to demonstrate the superior performance of our method over representative diversification methods adapted for link recommendation as well as state-of-the-art link recommendation methods.


A GPU-specialized Inference Parameter Server for Large-Scale Deep Recommendation Models

arXiv.org Artificial Intelligence

Recommendation systems are of crucial importance for a variety of modern apps and web services, such as news feeds, social networks, e-commerce, search, etc. To achieve peak prediction accuracy, modern recommendation models combine deep learning with terabyte-scale embedding tables to obtain a fine-grained representation of the underlying data. Traditional inference serving architectures require deploying the whole model to standalone servers, which is infeasible at such massive scale. In this paper, we provide insights into the intriguing and challenging inference domain of online recommendation systems. We propose the HugeCTR Hierarchical Parameter Server (HPS), an industry-leading distributed recommendation inference framework, that combines a high-performance GPU embedding cache with an hierarchical storage architecture, to realize low-latency retrieval of embeddings for online model inference tasks. Among other things, HPS features (1) a redundant hierarchical storage system, (2) a novel high-bandwidth cache to accelerate parallel embedding lookup on NVIDIA GPUs, (3) online training support and (4) light-weight APIs for easy integration into existing large-scale recommendation workflows. To demonstrate its capabilities, we conduct extensive studies using both synthetically engineered and public datasets. We show that our HPS can dramatically reduce end-to-end inference latency, achieving 5~62x speedup (depending on the batch size) over CPU baseline implementations for popular recommendation models. Through multi-GPU concurrent deployment, the HPS can also greatly increase the inference QPS.


Knowledge Distillation Transfer Sets and their Impact on Downstream NLU Tasks

arXiv.org Artificial Intelligence

Teacher-student knowledge distillation is a popular technique for compressing today's prevailing large language models into manageable sizes that fit low-latency downstream applications. Both the teacher and the choice of transfer set used for distillation are crucial ingredients in creating a high quality student. Yet, the generic corpora used to pretrain the teacher and the corpora associated with the downstream target domain are often significantly different, which raises a natural question: should the student be distilled over the generic corpora, so as to learn from high-quality teacher predictions, or over the downstream task corpora to align with finetuning? Our study investigates this trade-off using Domain Classification (DC) and Intent Classification/Named Entity Recognition (ICNER) as downstream tasks. We distill several multilingual students from a larger multilingual LM with varying proportions of generic and task-specific datasets, and report their performance after finetuning on DC and ICNER. We observe significant improvements across tasks and test sets when only task-specific corpora is used. We also report on how the impact of adding task-specific data to the transfer set correlates with the similarity between generic and task-specific data. Our results clearly indicate that, while distillation from a generic LM benefits downstream tasks, students learn better using target domain data even if it comes at the price of noisier teacher predictions. In other words, target domain data still trumps teacher knowledge.


Disentangling Confidence Score Distribution for Out-of-Domain Intent Detection with Energy-Based Learning

arXiv.org Artificial Intelligence

Detecting Out-of-Domain (OOD) or unknown intents from user queries is essential in a task-oriented dialog system. Traditional softmax-based confidence scores are susceptible to the overconfidence issue. In this paper, we propose a simple but strong energy-based score function to detect OOD where the energy scores of OOD samples are higher than IND samples. Further, given a small set of labeled OOD samples, we introduce an energy-based margin objective for supervised OOD detection to explicitly distinguish OOD samples from INDs. Comprehensive experiments and analysis prove our method helps disentangle confidence score distributions of IND and OOD data.\footnote{Our code is available at \url{https://github.com/pris-nlp/EMNLP2022-energy_for_OOD/}.}


A Framework for Undergraduate Data Collection Strategies for Student Support Recommendation Systems in Higher Education

arXiv.org Artificial Intelligence

Understanding which student support strategies mitigate dropout and improve student retention is an important part of modern higher educational research. One of the largest challenges institutions of higher learning currently face is the scalability of student support. Part of this is due to the shortage of staff addressing the needs of students, and the subsequent referral pathways associated to provide timeous student support strategies. This is further complicated by the difficulty of these referrals, especially as students are often faced with a combination of administrative, academic, social, and socio-economic challenges. A possible solution to this problem can be a combination of student outcome predictions and applying algorithmic recommender systems within the context of higher education. While much effort and detail has gone into the expansion of explaining algorithmic decision making in this context, there is still a need to develop data collection strategies Therefore, the purpose of this paper is to outline a data collection framework specific to recommender systems within this context in order to reduce collection biases, understand student characteristics, and find an ideal way to infer optimal influences on the student journey. If confirmation biases, challenges in data sparsity and the type of information to collect from students are not addressed, it will have detrimental effects on attempts to assess and evaluate the effects of these systems within higher education.


How is Artificial Intelligence ruling the Medical Industry?

#artificialintelligence

As technology evolves, especially in the field of Artificial Intelligence, its dominance is pervading in every industry rapidly like never before and is becoming increasingly popular and valuable daily. With the use of AI, numerous autonomous applications are created to make human life and task easy. In addition, researchers are working tirelessly around the clock to make this technology more potent in the Medical Industry to provide better medical treatments, accurate diagnoses, elevate service deliveries, and more. Machine learning models are used to implement AI in the medical sector, enabling the search of healthcare data for better patient experiences and improved health results. A massive amount of machine-learning datasets and several algorithms with extraordinary decision-making capabilities are used by artificial intelligence to provide hands-on solutions based on the user requirements in every sector.


La veille de la cybersรฉcuritรฉ

#artificialintelligence

Over the last decade, Artificial intelligence (AI) has become embedded in every aspect of our society and lives. From chatbots and virtual assistants like Siri and Alexa to automated industrial machinery and self-driving cars, it's hard to ignore its impact. Today, the technology most commonly used to achieve AI is machine learning โ€“ advanced software algorithms designed to carry out one specific task, such as answering questions, translating languages or navigating a journey โ€“ and become increasingly good at it as they are exposed to more and more data. Worldwide, spending by governments and business on AI technology will top $500 billion in 2023, according to IDC research. But how will it be used, and what impact will it have?


Question Answering Over Biological Knowledge Graph via Amazon Alexa

#artificialintelligence

Structured and unstructured data and facts about drugs, genes, protein, viruses, and their mechanism are spread across a huge number of scientific articles. These articles are a large-scale knowledge source and can have a huge impact on disseminating knowledge about the mechanisms of certain biological processes. A knowledge graph (KG) can be constructed by integrating such facts and data and be used for data integration, exploration, and federated queries. However, exploration and querying large-scale KGs is tedious for certain groups of users due to a lack of knowledge about underlying data assets or semantic technologies. A question-answering (QA) system allows the answer of natural language questions over KGs automatically using triples contained in a KG.


The Future Of A.I As Personal Assistance- Jarvis Is Coming

#artificialintelligence

Artificial intelligence is coming to your phone and the future of AI is personal assistance. This means that AI will be able to help with your daily tasks, from making suggestions on what you should wear or buy, to answering questions about history or science. The use cases for this technology are endless but especially interesting when it comes to personal assistants and virtual assistants. In order for AI assistants to be useful, they need to make decisions that are measurable and explainable. They also have to be able to learn from their mistakes and improve over time.