AITopics | Overview

Collaborating Authors

Overview

Open World Learning Graph Convolution for Latency Estimation in Routing Networks

Jin, Yifei, Daoutis, Marios, Girdzijauskas, Sarunas, Gionis, Aristides

arXiv.org Artificial IntelligenceJul-8-2022

Accurate routing network status estimation is a key component in Software Defined Networking. However, existing deep-learning-based methods for modeling network routing are not able to extrapolate towards unseen feature distributions. Nor are they able to handle scaled and drifted network attributes in test sets that include open-world inputs. To deal with these challenges, we propose a novel approach for modeling network routing, using Graph Neural Networks. Our method can also be used for network-latency estimation. Supported by a domain-knowledge-assisted graph formulation, our model shares a stable performance across different network sizes and configurations of routing networks, while at the same time being able to extrapolate towards unseen sizes, configurations, and user behavior. We show that our model outperforms most conventional deep-learning-based models, in terms of prediction accuracy, computational resources, inference speed, as well as ability to generalize towards open-world input.

formulation, node, od pair, (14 more...)

arXiv.org Artificial Intelligence

2207.14643

Country:

Europe > Austria > Salzburg > Salzburg (0.05)
Europe > Sweden > Stockholm > Stockholm (0.05)

Genre:

Research Report > Promising Solution (0.48)
Overview > Innovation (0.34)

Industry:

Telecommunications (0.46)
Information Technology (0.46)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

One for All: Simultaneous Metric and Preference Learning over Multiple Users

Canal, Gregory, Mason, Blake, Vinayak, Ramya Korlakai, Nowak, Robert

arXiv.org Artificial IntelligenceJul-7-2022

This paper investigates simultaneous preference and metric learning from a crowd of respondents. A set of items represented by $d$-dimensional feature vectors and paired comparisons of the form ``item $i$ is preferable to item $j$'' made by each user is given. Our model jointly learns a distance metric that characterizes the crowd's general measure of item similarities along with a latent ideal point for each user reflecting their individual preferences. This model has the flexibility to capture individual preferences, while enjoying a metric learning sample cost that is amortized over the crowd. We first study this problem in a noiseless, continuous response setting (i.e., responses equal to differences of item distances) to understand the fundamental limits of learning. Next, we establish prediction error guarantees for noisy, binary measurements such as may be collected from human respondents, and show how the sample complexity improves when the underlying metric is low-rank. Finally, we establish recovery guarantees under assumptions on the response distribution. We demonstrate the performance of our model on both simulated data and on a dataset of color preference judgements across a large number of users.

probability, proposition 2, selection matrix, (13 more...)

arXiv.org Artificial Intelligence

2207.03609

Country:

North America > United States > Wisconsin > Dane County > Madison (0.14)
North America > United States > Texas > Harris County > Houston (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre:

Research Report (1.00)
Overview (0.85)

Industry: Education (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Word Embedding for Social Sciences: An Interdisciplinary Survey

Matsui, Akira, Ferrara, Emilio

arXiv.org Artificial IntelligenceJul-7-2022

To extract essential information from complex data, computer scientists have been developing machine learning models that learn low-dimensional representation mode. From such advances in machine learning research, not only computer scientists but also social scientists have benefited and advanced their research because human behavior or social phenomena lies in complex data. To document this emerging trend, we survey the recent studies that apply word embedding techniques to human behavior mining, building a taxonomy to illustrate the methods and procedures used in the surveyed papers and highlight the recent emerging trends applying word embedding models to non-textual human behavior data. This survey conducts a simple experiment to warn that common similarity measurements used in the literature could yield different results even if they return consistent results at an aggregate level.

arXiv.org Artificial Intelligence

2207.03086

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.28)
Asia > Japan > Honshū > Kantō > Kanagawa Prefecture > Yokohama (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
(2 more...)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Government (0.93)
Education (0.68)
Media > News (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.95)

Add feedback

Towards Knowledge-based Mining of Mental Disorder Patterns from Textual Data

Shahabikargar, Maryam

arXiv.org Artificial IntelligenceJul-7-2022

Mental health disorders may cause severe consequences on all the countries' economies and health. For example, the impacts of the COVID-19 pandemic, such as isolation and travel ban, can make us feel depressed. Identifying early signs of mental health disorders is vital. For example, depression may increase an individual's risk of suicide. The state-of-the-art research in identifying mental disorder patterns from textual data, uses hand-labelled training sets, especially when a domain expert's knowledge is required to analyse various symptoms. This task could be time-consuming and expensive. To address this challenge, in this paper, we study and analyse the various clinical and non-clinical approaches to identifying mental health disorders. We leverage the domain knowledge and expertise in cognitive science to build a domain-specific Knowledge Base (KB) for the mental health disorder concepts and patterns. We present a weaker form of supervision by facilitating the generating of training data from a domain-specific Knowledge Base (KB). We adopt a typical scenario for analysing social media to identify major depressive disorder symptoms from the textual content generated by social users. We use this scenario to evaluate how our knowledge-based approach significantly improves the quality of results.

depression, disorder, symptom, (15 more...)

arXiv.org Artificial Intelligence

2207.06254

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Oceania > New Zealand (0.04)
South America > Argentina > Patagonia > Río Negro Province > Viedma (0.04)
(8 more...)

Genre:

Research Report > New Finding (1.00)
Questionnaire & Opinion Survey (1.00)
Overview (0.92)
Research Report > Experimental Study (0.92)

Industry: Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (1.00)

Technology:

Information Technology > Knowledge Management > Knowledge Engineering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Comprehensive Framework for Learning Declarative Action Models

Aineto, Diego | Jiménez, Sergio (Universitat Politècnica de València) | Onaindia, Eva (Universitat Politècnica de València)

Journal of Artificial Intelligence ResearchJul-7-2022

A declarative action model is a compact representation of the state transitions of dynamic systems that generalizes over world objects. The specification of declarative action models is often a complex hand-crafted task. In this paper we formulate declarative action models via state constraints, and present the learning of such models as a combinatorial search. The comprehensive framework presented here allows us to connect the learning of declarative action models to well-known problem solving tasks. In addition, our framework allows us to characterize the existing work in the literature according to four dimensions: (1) the target action models, in terms of the state transitions they define; (2) the available learning examples; (3) the functions used to guide the learning process, and to evaluate the quality of the learned action models; (4) the learning algorithm. Last, the paper lists relevant successful applications of the learning of declarative actions models and discusses some open challenges with the aim of encouraging future research work.

action model, artificial intelligence, declarative action model, (16 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1.13073

AI Access Foundation

13073

Journal of Artificial Intelligence Research

Country:

Africa > Senegal > Louga Region > Louga (0.04)
Europe > France (0.04)
Africa > South Sudan > Equatoria > Central Equatoria > Juba (0.04)
(3 more...)

Genre:

Overview (0.67)
Workflow (0.67)

Industry: Leisure & Entertainment > Games (0.67)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
(6 more...)

Add feedback

Sparse Weight Activation Training- Reduce memory and training time in Machine Learning

#artificialintelligenceJul-6-2022, 14:02:51 GMT

A little bit ago, I covered Google AI's pathways architecture, calling it a revolution in Machine Learning. One of the standouts in Google's novel approach was the implementation of sparse activation in their training architecture. I liked this idea so much that I decided to explore this in a lot more depth. That's where I came across Sparse Weight Activation Training (SWAT), by some researchers at the Department of Electrical And Computer Engineering, University of British Columbia. And the paper definitely has me excited.

activation, machine learning, weight activation training-reduce memory, (11 more...)

#artificialintelligence

Country: North America > Canada > British Columbia (0.25)

Genre:

Research Report > Promising Solution (0.55)
Overview (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.31)

Add feedback

Roadmap for Data Science 2022

#artificialintelligenceJul-6-2022, 05:10:55 GMT

This article will help you strengthen your plan by providing you with a learning framework, resources, and project ideas to aid in the development of a robust portfolio of work demonstrating data science ability. Just a note: I created this roadmap based on my own data science experience. This roadmap can be customised to fit any topic or field of study that interests you. Also, because Python is my preferred programming language, this was built with it in mind. What is the purpose of a learning roadmap?

learning, regression, roadmap, (13 more...)

#artificialintelligence

Genre: Overview (0.40)

Industry: Information Technology > Services (0.71)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.97)
Information Technology > Software > Programming Languages (0.93)

Add feedback

Brainish: Formalizing A Multimodal Language for Intelligence and Consciousness

Liang, Paul Pu

arXiv.org Artificial IntelligenceJul-6-2022

Having a rich multimodal inner language is an important component of human intelligence that enables several necessary core cognitive functions such as multimodal prediction, translation, and generation. Building upon the Conscious Turing Machine (CTM), a machine model for consciousness proposed by Blum and Blum (2021), we describe the desiderata of a multimodal language called Brainish, comprising words, images, audio, and sensations combined in representations that the CTM's processors use to communicate with each other. We define the syntax and semantics of Brainish before operationalizing this language through the lens of multimodal artificial intelligence, a vibrant research area studying the computational tools necessary for processing and relating information from heterogeneous signals. Our general framework for learning Brainish involves designing (1) unimodal encoders to segment and represent unimodal data, (2) a coordinated representation space that relates and composes unimodal features to derive holistic meaning across multimodal inputs, and (3) decoders to map multimodal representations into predictions (for fusion) or raw data (for translation or generation). Through discussing how Brainish is crucial for communication and coordination in order to achieve consciousness in the CTM, and by implementing a simple version of Brainish and evaluating its capability of demonstrating intelligence on multimodal prediction and retrieval tasks on several real-world image, text, and audio datasets, we argue that such an inner language will be important for advances in machine models of intelligence and consciousness.

artificial intelligence, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2205.00001

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > New York > New York County > New York City (0.04)
(2 more...)

Genre:

Research Report (0.64)
Overview (0.46)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(4 more...)

Add feedback

Tensor networks in machine learning

Sengupta, Richik, Adhikary, Soumik, Oseledets, Ivan, Biamonte, Jacob

arXiv.org Artificial IntelligenceJul-6-2022

A tensor network is a type of decomposition used to express and approximate large arrays of data. A given data-set, quantum state or higher dimensional multi-linear map is factored and approximated by a composition of smaller multi-linear maps. This is reminiscent to how a Boolean function might be decomposed into a gate array: this represents a special case of tensor decomposition, in which the tensor entries are replaced by 0, 1 and the factorisation becomes exact. The collection of associated techniques are called, tensor network methods: the subject developed independently in several distinct fields of study, which have more recently become interrelated through the language of tensor networks. The tantamount questions in the field relate to expressability of tensor networks and the reduction of computational overheads. A merger of tensor networks with machine learning is natural. On the one hand, machine learning can aid in determining a factorization of a tensor network approximating a data set. On the other hand, a given tensor network structure can be viewed as a machine learning model. Herein the tensor network parameters are adjusted to learn or classify a data-set. In this survey we recover the basics of tensor networks and explain the ongoing effort to develop the theory of tensor networks in machine learning.

artificial intelligence, machine learning, representation, (13 more...)

arXiv.org Artificial Intelligence

2207.02851

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Russia (0.04)
Asia > Russia (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Overview (0.86)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

A Survey on Hyperlink Prediction

Chen, Can, Liu, Yang-Yu

arXiv.org Artificial IntelligenceJul-6-2022

As a natural extension of link prediction on graphs, hyperlink prediction aims for the inference of missing hyperlinks in hypergraphs, where a hyperlink can connect more than two nodes. Hyperlink prediction has applications in a wide range of systems, from chemical reaction networks, social communication networks, to protein-protein interaction networks. In this paper, we provide a systematic and comprehensive survey on hyperlink prediction. We propose a new taxonomy to classify existing hyperlink prediction methods into four categories: similarity-based, probability-based, matrix optimization-based, and deep learning-based methods. To compare the performance of methods from different categories, we perform a benchmark study on various hypergraph applications using representative methods from each category. Notably, deep learning-based methods prevail over other methods in hyperlink prediction.

artificial intelligence, hyperlink, machine learning, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/TNNLS.2023.3286280

2207.02911

Country:

North America > United States > Massachusetts > Suffolk County > Boston (0.04)
Asia > China (0.04)
Africa > Senegal > Kolda Region > Kolda (0.04)

Genre:

Research Report (0.64)
Overview (0.48)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback