AITopics

Transfer learning proves to be effective for leveraging labeled data in the source domain to build an accurate classifier in the target domain. The basic assumption behind transfer learning is that the involved domains share some common latent factors. Previous methods usually explore these latent factors by optimizing two separate objective functions, i.e., either maximizing the empirical likelihood, or preserving the geometric structure. Actually, these two objective functions are complementary to each other and optimizing them simultaneously can make the solution smoother and further improve the accuracy of the final model. In this paper, we propose a novel approach called Graph co-regularized Transfer Learning (GTL) for this purpose, which integrates the two objective functions seamlessly into one unified optimization problem. Thereafter, we present an iterative algorithm for the optimization problem with rigorous analysis on convergence and complexity. Our empirical study on two open data sets validates that GTL can consistently improve the classification accuracy compared to the state-of-the-art transfer learning methods.

artificial intelligence, geometric structure, optimization problem, (15 more...)

Twenty-Sixth AAAI Conference on Artificial Intelligence

Country:

Asia > China (0.28)
North America > United States (0.28)

Genre: Research Report (0.49)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.86)

An Efficient Simulation-Based Approach to Ambulance Fleet Allocation and Dynamic Redeployment

Yue, Yisong (Carnegie Mellon University) | Marla, Lavanya (Carnegie Mellon University) | Krishnan, Ramayya (Carnegie Mellon University)

We present an efficient approach to ambulance fleet allocation and dynamic redeployment, where the goal is to position an entire fleet of ambulances to base locations to maximize the service level (or utility) of the Emergency Medical Services (EMS) system. We take a simulation-based approach, where the utility of an allocation is measured by directly simulating emergency requests. In both the static and dynamic settings, this modeling approach leads to an exponentially large action space (with respect to the number of ambulances). Futhermore, the utility of any particular allocation can only be measured via a seemingly “black box” simulator. Despite this complexity, we show that embedding our simulator within a simple and efficient greedy allocation algorithm produces good solutions. We derive data-driven performance guarantees which yield small optimality gap. Given its efficiency, we can repeatedly employ this approach in real-time for dynamic repositioning. We conduct simulation experiments based on real usage data of an EMS system from a large Asian city, and demonstrate significant improvement in the system’s service levels using static allocations and redeployment policies discovered by our approach.

ambulance, health & medicine, optimization problem, (19 more...)

Twenty-Sixth AAAI Conference on Artificial Intelligence

Country:

Asia (0.48)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)

Industry:

Health & Medicine (0.66)
Transportation (0.66)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.69)

Optimal Proportional Cake Cutting with Connected Pieces

Bei, Xiaohui (Tsinghua University) | Chen, Ning (Nanyang Technological University) | Hua, Xia (Nanyang Technological University) | Tao, Biaoshuai (Nanyang Technological University) | Yang, Endong (Nanyang Technological University)

We consider the classic cake cutting problem where one allocates a divisible cake to n participating agents. Among all valid divisions, fairness and efficiency (a.k.a. ~social welfare) are the most critical criteria to satisfy and optimize, respectively. We study computational complexity of computing an efficiency optimal division given the conditions that the allocation satisfies proportional fairness and assigns each agent a connected piece. For linear valuation functions, we give a polynomial time approximation scheme to compute an efficiency optimal allocation. On the other hand, we show that the problem is NP-hard to approximate within a factor of Ω 1/√ n for general piecewise constant functions, and is NP-hard to compute for normalized functions.

agent, allocation, artificial intelligence, (17 more...)

Twenty-Sixth AAAI Conference on Artificial Intelligence

Country:

Asia > Singapore (0.14)
Asia > China (0.14)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.50)

Zhu, Yin (Hong Kong University of Science and Technology (HKUST)) | Wang, Xiao (Renren Inc.) | Zhong, Erheng (Hong Kong University of Science and Technology (HKUST)) | Liu, Nathan N. (Hong Kong University of Science and Technology (HKUST)) | Li, He (Renren Inc.) | Yang, Qiang (Hong Kong University of Science and Technology (HKUST))

Discovering Spammers in Social Networks

As the popularity of the social media increases, as evidenced in Twitter, Facebook and China's Renren, spamming activities also picked up in numbers and variety. On social network sites, spammers often disguise themselves by creating fake accounts and hijacking normal users' accounts for personal gains. Different from the spammers in traditional systems such as SMS and email, spammers in social media behave like normal users and they continue to change their spamming strategies to fool anti spamming systems. However, due to the privacy and resource concerns, many social media websites cannot fully monitor all the contents of users, making many of the previous approaches, such as topology-based and content-classification-based methods, infeasible to use. In this paper, we propose a novel method for spammer detection in social networks that exploits both social activities as well as users' social relations in an innovative and highly scalable manner. The proposed method detects spammers following collective activities based on users' social actions and relations. We have empirically tested our method on data from Renren.com, which is the largest social network in China, and demonstrated that our new method can improve the detection performance significantly.

optimization problem, spammer, survey article, (19 more...)

Twenty-Sixth AAAI Conference on Artificial Intelligence

Country: Asia > China (0.55)

Genre: Research Report (0.34)

Industry: Information Technology > Services (1.00)

Technology:

Information Technology > Security & Privacy > Spam Filtering (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Levy, Omer (Technion - Israel Institute of Technology) | Markovitch, Shaul (Technion - Israel Institute of Technology)

Teaching Machines to Learn by Metaphors

Humans have an uncanny ability to learn new concepts with very few examples. Cognitive theories have suggested that this is done by utilizing prior experience of related tasks. We propose to emulate this process in machines, by transforming new problems into old ones. These transformations are called metaphors. Obviously, the learner is not given a metaphor, but must acquire one through a learning process. We show that learning metaphors yield better results than existing transfer learning methods. Moreover, we argue that metaphors give a qualitative assessment of task relatedness.

artificial intelligence, machine learning, metaphor, (17 more...)

Twenty-Sixth AAAI Conference on Artificial Intelligence

Country:

Asia > Middle East > Israel (0.14)
North America > United States (0.14)
Europe > Czechia (0.14)

Genre: Research Report (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.52)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Document Summarization Based on Data Reconstruction

Document summarization is of great value to many real world applications, such as snippets generation for search results and news headlines generation. Traditionally, document summarization is implemented by extracting sentences that cover the main topics of a document with a minimum redundancy. In this paper, we take a different perspective from data reconstruction and propose a novel framework named Document Summarization based on Data Reconstruction (DSDR). Specifically, our approach generates a summary which consist of those sentences that can best reconstruct the original document. To model the relationship among sentences, we introduce two objective functions: (1) linear reconstruction, which approximates the document by linear combinations of the selected sentences; (2) nonnegative linear reconstruction, which allows only additive, not subtractive, linear combinations. In this framework, the reconstruction error becomes a natural criterion for measuring the quality of the summary. For each objective function, we develop an efficient algorithm to solve the corresponding optimization problem. Extensive experiments on summarization benchmark data sets DUC 2006 and DUC 2007 demonstrate the effectiveness of our proposed approach.

information management, optimization problem, summarization, (18 more...)

Twenty-Sixth AAAI Conference on Artificial Intelligence

Country: Asia > China (0.15)

Genre: Research Report (0.94)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)

Emoticon Smoothed Language Models for Twitter Sentiment Analysis

Liu, Kun-Lin (Shanghai Jiao Tong University) | Li, Wu-Jun (Shanghai Jiao Tong University) | Guo, Minyi (Shanghai Jiao Tong University)

Twitter sentiment analysis (TSA) has become a hot research topic in recent years. The goal of this task is to discover the attitude or opinion of the tweets, which is typically formulated as a machine learning based text classification problem. Some methods use manually labeled data to train fully supervised models, while others use some noisy labels, such as emoticons and hashtags, for model training. In general, we can only get a limited number of training data for the fully supervised models because it is very labor-intensive and time-consuming to manually label the tweets. As for the models with noisy labels, it is hard for them to achieve satisfactory performance due to the noise in the labels although it is easy to get a large amount of data for training. Hence, the best strategy is to utilize both manually labeled data and noisy labeled data for training. However, how to seamlessly integrate these two different kinds of data into the same learning framework is still a challenge. In this paper, we present a novel model, called emoticon smoothed language model (ESLAM), to handle this challenge. The basic idea is to train a language model based on the manually labeled data, and then use the noisy emoticon data for smoothing. Experiments on real data sets demonstrate that ESLAM can effectively integrate both kinds of data to outperform those methods using only one of them.

artificial intelligence, social media, tweet, (20 more...)

Twenty-Sixth AAAI Conference on Artificial Intelligence

Country: Asia > China (0.14)

Genre: Research Report (0.48)

Industry: Information Technology > Services (0.65)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.86)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.86)

Opinion Target Extraction Using a Shallow Semantic Parsing Framework

Li, Shoushan (Soochow University) | Wang, Rongyang (Soochow University) | Zhou, Guodong (Soochow University)

In this paper, we present a simplified shallow semantic parsing approach to extracting opinion targets. This is done by formulating opinion target extraction (OTE) as a shallow semantic parsing problem with the opinion expression as the predicate and the corresponding targets as its arguments. In principle, our parsing approach to OTE differs from the state-of-the-art sequence labeling one in two aspects. First, we model OTE from parse tree level, where abundant structured syntactic information is available for use, instead of word sequence level, where only lexical information is available. Second, we focus on determining whether a constituent, rather than a word, is an opinion target or not, via a simplified shallow semantic parsing framework. Evaluation on two datasets shows that structured syntactic information plays a critical role in capturing the domination relationship between an opinion expression and its targets. It also shows that our parsing approach much outperforms the state-of-the-art sequence labeling one.

artificial intelligence, natural language, opinion target, (16 more...)

Twenty-Sixth AAAI Conference on Artificial Intelligence

Country: Asia (0.14)

Genre: Research Report (0.68)

Technology: Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)

Ghasemi, Alireza (Ecole Polytechnique Federale de Lausanne (EPFL)) | Rabiee, Hamid R. (Sharif University of Technology) | Manzuri, Mohammad Taghi (Sharif University of Technology) | Rohban, Mohammad Hossein (Sharif University of Technology)

A Bayesian Approach to the Data Description Problem

In this paper, we address the problem of data description using a Bayesian framework. The goal of data description is to draw a boundary around objects of a certain class of interest to discriminate that class from the rest of the feature space. Data description is also known as one-class learning and has a wide range of applications. The proposed approach uses a Bayesian framework to precisely compute the class boundary and therefore can utilize domain information in form of prior knowledge in the framework. It can also operate in the kernel space and therefore recognize arbitrary boundary shapes. Moreover, the proposed method can utilize unlabeled data in order to improve accuracy of discrimination. We evaluate our method using various real-world datasets and compare it with other state of the art approaches of data description. Experiments show promising results and improved performance over other data description and one-class learning algorithms.

algorithm, artificial intelligence, bayesian inference, (18 more...)

Twenty-Sixth AAAI Conference on Artificial Intelligence

Country:

Europe > Switzerland (0.14)
Asia > Middle East > Iran (0.14)

Genre: Overview (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

FLP Semantics Without Circular Justifications for General Logic Programs

Shen, Yi-Dong (Chinese Academy of Sciences) | Wang, Kewen (Griffith University)

The FLP semantics presented by (Faber, Leone, and Pfeifer 2004) has been widely used to define answer sets, called FLP answer sets, for different types of logic programs such as logic programs with aggregates, description logic programs (dl-programs), Hex programs, and logic programs with first-order formulas (general logic programs). However, it was recently observed that the FLP semantics may produce unintuitive answer sets with circular justifications caused by self-supporting loops. In this paper, we address the circular justification problem for general logic programs by enhancing the FLP semantics with a level mapping formalism. In particular, we extend the Gelfond-Lifschitz three step definition of the standard answer set semantics from normal logic programs to general logic programs and define for general logic programs the first FLP semantics that is free of circular justifications. We call this FLP semantics the well-justified FLP semantics. This method naturally extends to general logic programs with additional constraints like aggregates, thus providing a unifying framework for defining the well-justified FLP semantics for various types of logic programs. When this method is applied to normal logic programs with aggregates, the well-justified FLP semantics agrees with the conditional satisfaction based semantics defined by (Son, Pontelli, and Tu 2007); and when applied to dl-programs, the semantics agrees with the strongly well-supported semantics defined by (Shen 2011).

artificial intelligence, logic program, logic programming, (16 more...)

Twenty-Sixth AAAI Conference on Artificial Intelligence

Country:

Oceania > Australia (0.14)
Asia > China (0.14)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)