AITopics | Media

Collaborating Authors

Media

Wikipedia-Based Distributional Semantics for Entity Relatedness

Aggarwal, Nitish (National University of Ireland, Galway) | Buitelaar, Paul (National University of Ireland, Galway)

AAAI ConferencesNov-1-2014

Wikipedia provides an enormous amount of background knowledge to reason about the semantic relatedness between two entities. We propose Wikipedia-based Distributional Semantics for Entity Relatedness (DiSER), which represents the semantics of an entity by its distribution in the high dimensional concept space derived from Wikipedia. DiSER measures the semantic relatedness between two entities by quantifying the distance between the corresponding high-dimensional vectors. DiSER builds the model by taking the annotated entities only, therefore it improves over existing approaches, which do not distinguish between an entity and its surface form. We evaluate the approach on a benchmark that contains the relative entity relatedness scores for 420 entity pairs. Our approach improves the accuracy by 12% on state of the art methods for computing entity relatedness. We also show an evaluation of DiSER in the Entity Disambiguation task on a dataset of 50 sentences with highly ambiguous entity mentions. It shows an improvement of 10% in precision over the best performing methods. In order to provide the resource that can be used to find out all the related entities for a given entity, a graph is constructed, where the nodes represent Wikipedia entities and the relatedness scores are reflected by the edges. Wikipedia contains more than 4.1 millions entities, which required efficient computation of the relatedness scores between the corresponding 17 trillions of entity-pairs.

AAAI Conferences

2014 AAAI Fall Symposium Series

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(6 more...)

Genre: Research Report > Promising Solution (0.34)

Industry:

Leisure & Entertainment (0.94)
Information Technology (0.94)
Media (0.69)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)

Add feedback

Mining Large-Scale Knowledge Graphs to Discover Inference Paths for Query Expansion in NLIDB

Yeh, Peter Z. (Nuance Communications) | Ratnaparkhi, Adwait (Nuance Communications)

AAAI ConferencesNov-1-2014

In this paper, we present an approach to mine large-scale knowledge graphs to discover inference paths for query expansion in NLIDB (Natural Language Interface to Databases). Addressing this problem is important in order for NLIDB applications to effectively handle relevant concepts in the domain of interest that do not correspond to any structured fields in the target database. We also present preliminary observations on the performance of our approach applied to Freebase, and conclude with discussions on next steps to further evaluate and extend our approach.

artificial intelligence, information retrieval query processing, natural language, (18 more...)

AAAI Conferences

2014 AAAI Fall Symposium Series

Country: North America > United States > California > Santa Clara County > Sunnyvale (0.04)

Industry:

Leisure & Entertainment (0.69)
Media > Film (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.64)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval > Query Processing (0.64)

Add feedback

Post It or Not: Viewership Based Posting of Crowdsourced Tasks

Manohar, Pallavi (Xerox Research Centre India) | Chander, Deepthi (Xerox Research Centre India) | Celis, Elisa (Ecole Polytechnique Fédérale de Lausanne (EPFL)) | Dasgupta, Koustuv (Xerox Research Centre India) | Bhattacharya, Sakyajit (Xerox Research Centre India)

AAAI ConferencesOct-31-2014

We propose an online scheduling algorithm for posting crowdsourcing tasks which maximizes a novel metric called task viewership. This metric is computed using stochastic model based on coverage process and it measures the likelihood that a task is viewed by multiple crowd workers, which is correlated to the likelihood that it will be selected and completed.

artificial intelligence, machine learning, platform, (19 more...)

AAAI Conferences

Second AAAI Conference on Human Computation and Crowdsourcing

Country:

Europe > Switzerland > Vaud > Lausanne (0.05)
Asia > India (0.05)

Industry:

Media > Television (0.73)
Leisure & Entertainment (0.73)

Technology:

Information Technology > Communications > Social Media > Crowdsourcing (0.91)
Information Technology > Artificial Intelligence > Machine Learning (0.71)

Add feedback

Adapting Collaborative Filtering to Personalized Audio Production

Kim, Bongjun (Northwestern University) | Pardo, Bryan (Northwestern University)

AAAI ConferencesOct-31-2014

Recommending media objects to users typically requires users to rate existing media objects so as to understand their preferences. The number of ratings required to produce good suggestions can be reduced through collaborative filtering. Collaborative filtering is more difficult when prior users have not rated the same set of media objects as the current user or each other. In this work, we describe an approach to applying prior user data in a way that does not require users to rate the same media objects and that does not require imputation (estimation) of prior user ratings of objects they have not rated. This approach is applied to the problem of finding good equalizer settings for music audio and is shown to greatly reduce the number of ratings the current user must make to find a good equalization setting.

artificial intelligence, machine learning, social media, (17 more...)

AAAI Conferences

Second AAAI Conference on Human Computation and Crowdsourcing

Country:

South America > Brazil > Paraná > Curitiba (0.05)
North America > United States > New York > New York County > New York City (0.05)
North America > United States > Illinois > Cook County > Evanston (0.05)

Industry:

Information Technology (0.69)
Media (0.49)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.99)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.93)

Add feedback

A Crowd of Your Own: Crowdsourcing for On-Demand Personalization

Organisciak, Peter (University of Illinois at Urbana-Champaign) | Teevan, Jaime (Microsoft Research) | Dumais, Susan (Microsoft Research) | Miller, Robert C. (MIT CSAIL) | Kalai, Adam Tauman (Microsoft Research)

AAAI ConferencesOct-31-2014

Personalization is a way for computers to support people’s diverse interests and needs by providing content tailored to the individual. While strides have been made in algorithmic approaches to personalization, most require access to a significant amount of data. However, even when data is limited online crowds can be used to infer an individual’s personal preferences. Aided by the diversity of tastes among online crowds and their ability to understand others, we show that crowdsourcing is an effective on-demand tool for personalization. Unlike typical crowdsourcing approaches that seek a ground truth, we present and evaluate two crowdsourcing approaches designed to capture personal preferences. The first, taste-matching , identifies workers with similar taste to the requester and uses their taste to infer the requester’s taste. The second, taste-grokking , asks workers to explicitly predict the requester’s taste based on training examples. These techniques are evaluated on two subjective tasks, personalized image recommendation and tailored textual summaries. Taste-matching and taste-grokking both show improvement over the use of generic workers, and have different benefits and drawbacks depending on the complexity of the task and the variability of the taste space.

artificial intelligence, machine learning, social media, (18 more...)

AAAI Conferences

Second AAAI Conference on Human Computation and Crowdsourcing

Country:

Europe > Austria > Vienna (0.14)
North America > United States > New York > New York County > New York City (0.05)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)

Genre: Research Report > New Finding (0.68)

Industry:

Leisure & Entertainment (0.68)
Consumer Products & Services (0.68)
Media > Film (0.46)

Technology:

Information Technology > Communications > Social Media > Crowdsourcing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.95)

Add feedback

A Latent Source Model for Online Collaborative Filtering

Bresler, Guy, Chen, George H., Shah, Devavrat

arXiv.org Machine LearningOct-31-2014

Despite the prevalence of collaborative filtering in recommendation systems, there has been little theoretical development on why and how well it works, especially in the "online" setting, where items are recommended to users over time. We address this theoretical gap by introducing a model for online recommendation systems, cast item recommendation under the model as a learning problem, and analyze the performance of a cosine-similarity collaborative filtering method. In our model, each of $n$ users either likes or dislikes each of $m$ items. We assume there to be $k$ types of users, and all the users of a given type share a common string of probabilities determining the chance of liking each item. At each time step, we recommend an item to each user, where a key distinction from related bandit literature is that once a user consumes an item (e.g., watches a movie), then that item cannot be recommended to the same user again. The goal is to maximize the number of likable items recommended to users over time. Our main result establishes that after nearly $\log(km)$ initial learning time steps, a simple collaborative filtering algorithm achieves essentially optimal performance without knowing $k$. The algorithm has an exploitation step that uses cosine similarity and two types of exploration steps, one to explore the space of items (standard in the literature) and the other to explore similarity between users (novel to this work).

artificial intelligence, neighbor, probability, (16 more...)

arXiv.org Machine Learning

1411.6591

Country: North America > United States > Massachusetts (0.28)

Genre: Research Report (0.40)

Industry: Media > Film (0.67)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)

Add feedback

Latent Feature Based FM Model For Rating Prediction

Liu, Xudong, Zhang, Bin, Zhang, Ting, Liu, Chang

arXiv.org Machine LearningOct-29-2014

Rating Prediction is a basic problem in Recommender System, and one of the most widely used method is Factorization Machines(FM). However, traditional matrix factorization methods fail to utilize the benefit of implicit feedback, which has been proved to be important in Rating Prediction problem. In this work, we consider a specific situation, movie rating prediction, where we assume that a user's watching history has a big influence on his/her rating behavior on an item. We introduce two models, Latent Dirichlet Allocation(LDA) and word2vec, both of which perform state-of-the-art results in training latent features. Based on that, we propose two feature based models. One is the Topic-based FM Model which provides the implicit feedback to the matrix factorization, the other is the Vector-based FM Model which exploits the order info of a user's watching history resulting in better performance. Empirical results on three datasets demonstrate that our method performs better than the baseline model and confirm that Vector-based FM Model usually works better as it contains the order info.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Machine Learning

1410.8034

Country: Asia > China (0.29)

Genre: Research Report (0.50)

Industry: Media > Film (0.31)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.90)

Add feedback

Regularizing Recurrent Networks - On Injected Noise and Norm-based Methods

Ognawala, Saahil, Bayer, Justin

arXiv.org Machine LearningOct-21-2014

Advancements in parallel processing have lead to a surge in multilayer perceptrons' (MLP) applications and deep learning in the past decades. Recurrent Neural Networks (RNNs) give additional representational power to feedforward MLPs by providing a way to treat sequential data. However, RNNs are hard to train using conventional error backpropagation methods because of the difficulty in relating inputs over many time-steps. Regularization approaches from MLP sphere, like dropout and noisy weight training, have been insufficiently applied and tested on simple RNNs. Moreover, solutions have been proposed to improve convergence in RNNs but not enough to improve the long term dependency remembering capabilities thereof. In this study, we aim to empirically evaluate the remembering and generalization ability of RNNs on polyphonic musical datasets. The models are trained with injected noise, random dropout, norm-based regularizers and their respective performances compared to well-initialized plain RNNs and advanced regularization methods like fast-dropout. We conclude with evidence that training with noise does not improve performance as conjectured by a few works in RNN optimization before ours.

artificial intelligence, machine learning, noise, (17 more...)

arXiv.org Machine Learning

1410.5684

Genre: Research Report > New Finding (0.48)

Industry:

Media > Music (0.93)
Leisure & Entertainment (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Sparsity Based Poisson Denoising with Dictionary Learning

Giryes, Raja, Elad, Michael

arXiv.org Machine LearningOct-14-2014

The problem of Poisson denoising appears in various imaging applications, such as low-light photography, medical imaging and microscopy. In cases of high SNR, several transformations exist so as to convert the Poisson noise into an additive i.i.d. Gaussian noise, for which many effective algorithms are available. However, in a low SNR regime, these transformations are significantly less accurate, and a strategy that relies directly on the true noise statistics is required. A recent work by Salmon et al. took this route, proposing a patch-based exponential image representation model based on GMM (Gaussian mixture model), leading to state-of-the-art results. In this paper, we propose to harness sparse-representation modeling to the image patches, adopting the same exponential idea. Our scheme uses a greedy pursuit with boot-strapping based stopping condition and dictionary learning within the denoising process. The reconstruction performance of the proposed scheme is competitive with leading methods in high SNR, and achieving state-of-the-art results in cases of low SNR.

algorithm, artificial intelligence, machine learning, (18 more...)

arXiv.org Machine Learning

doi: 10.1109/TIP.2014.2362057

1309.4306

Genre: Research Report (0.64)

Industry:

Health & Medicine (0.54)
Media > Photography (0.54)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Matrix Completion and Low-Rank SVD via Fast Alternating Least Squares

Hastie, Trevor, Mazumder, Rahul, Lee, Jason, Zadeh, Reza

arXiv.org Machine LearningOct-9-2014

The matrix-completion problem has attracted a lot of attention, largely as a result of the celebrated Netflix competition. Two popular approaches for solving the problem are nuclear-norm-regularized matrix approximation (Candes and Tao, 2009, Mazumder, Hastie and Tibshirani, 2010), and maximum-margin matrix factorization (Srebro, Rennie and Jaakkola, 2005). These two procedures are in some cases solving equivalent problems, but with quite different algorithms. In this article we bring the two approaches together, leading to an efficient algorithm for large matrix factorization and completion that outperforms both of these. We develop a software package "softImpute" in R for implementing our approaches, and a distributed version for very large matrices using the "Spark" cluster programming environment.

algorithm, artificial intelligence, machine learning, (16 more...)

arXiv.org Machine Learning

1410.2596

Country: North America > United States (0.28)

Genre:

Workflow (0.47)
Research Report (0.40)

Industry:

Media (0.55)
Leisure & Entertainment (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback