AITopics

Li, Ping, Moore, Joshua, Konig, Christian

b-Bit Minwise Hashing for Large-Scale Linear SVM

arXiv.org Machine LearningMay-22-2011

In this paper, we propose to (seamlessly) integrate b-bit minwise hashing with linear SVM to substantially improve the training (and testing) efficiency using much smaller memory, with essentially no loss of accuracy. Theoretically, we prove that the resemblance matrix, the minwise hashing matrix, and the b-bit minwise hashing matrix are all positive definite matrices (kernels). Interestingly, our proof for the positive definiteness of the b-bit minwise hashing kernel naturally suggests a simple strategy to integrate b-bit hashing with linear SVM. Our technique is particularly useful when the data can not fit in memory, which is an increasingly critical issue in large-scale machine learning. Our preliminary experimental results on a publicly available webspam dataset (350K samples and 16 million dimensions) verified the effectiveness of our algorithm. For example, the training time was reduced to merely a few seconds. In addition, our technique can be easily extended to many other linear and nonlinear machine learning applications such as logistic regression.

artificial intelligence, machine learning, webspam, (15 more...)

1105.4385

Country:

Europe (1.00)
North America > Canada (0.94)
North America > United States > California > Santa Clara County (0.46)
North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Research Report (1.00)

Industry: Information Technology (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)

Gast, Nicolas, Gaujal, Bruno, Boudec, Jean-Yves Le

Mean field for Markov Decision Processes: from Discrete to Continuous Optimization

arXiv.org Artificial IntelligenceMay-19-2011

We study the convergence of Markov Decision Processes made of a large number of objects to optimization problems on ordinary differential equations (ODE). We show that the optimal reward of such a Markov Decision Process, satisfying a Bellman equation, converges to the solution of a continuous Hamilton-Jacobi-Bellman (HJB) equation based on the mean field approximation of the Markov Decision Process. We give bounds on the difference of the rewards, and a constructive algorithm for deriving an approximating solution to the Markov Decision Process from a solution of the HJB equations. We illustrate the method on three examples pertaining respectively to investment strategies, population dynamics control and scheduling in queues are developed. They are used to illustrate and justify the construction of the controlled ODE and to show the gain obtained by solving a continuous HJB equation rather than a large discrete Bellman equation.

action function, artificial intelligence, optimization problem, (15 more...)

arXiv.org Artificial Intelligence

1004.2342

Country: Europe > France (0.28)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Quijano-Sánchez, Lara (Universidad Complutense de Madrid) | Recio-Garcia, Juan A. (Universidad Complutense de Madrid) | Díaz-Agudo, Belén (Universidad Complutense de Madrid) | Jimenez-Diaz, Guillermo (Universidad Complutense de Madrid)

Happy Movie: A Group Recommender Application in Facebook

In this paper we introduce our recommender Happy Movie, a Facebook application for movie recommendation to groups. This system exploits information about the social relationships and behaviour of the users to provide better recommendations. Our previous works have shown that social factors improve the recommendation results. However it required many questionnaires to be filled for obtaining the social information, so we have moved to a social network environment where this information is easily available.

artificial intelligence, recommendation, social media, (13 more...)

Country: Europe > Spain (0.30)

Industry: Information Technology > Services (0.36)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.87)

Building Integrated Opinion Delivery Environment

Galitsky, Boris (University of Girona) | Rose, Josep Lluis de la (Universitat de Girona) | Dobrocsi, Gabor (University of Miskolc Miskolc )

We introduce a search engine and information retrieval system for providing access to opinion data. Natural language technology of generalization of syntactic parse trees is introduced as a similarity measure between subjects of textual opinions to link them on the fly. Information extraction algorithm for automatic summarization of web pages in the format of Google sponsored links is presented. We outline the usability of the implemented system, integrated opinion delivery environment (IODE).

artificial intelligence, banking & finance, generalization, (17 more...)

Country:

Europe > Spain (0.14)
Europe > Hungary (0.14)
North America > United States (0.14)
Europe > Switzerland (0.14)

Industry:

Banking & Finance (0.69)
Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.90)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.55)

Paradesi, Sharon Myrtle (Massachusetts Institute of Technology)

Geotagging Tweets Using Their Content

Harnessing rich, but unstructured information on social networks in real-time and showing it to relevant audience based on its geographic location is a major challenge. The system developed, TwitterTagger, geotags tweets and shows them to users based on their current physical location. Experimental validation shows a performance improvement of three orders by TwitterTagger compared to that of the baseline model.

artificial intelligence, social media, tweet, (18 more...)

Country: North America > United States > Massachusetts (0.16)

Industry: Information Technology > Services (0.50)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.97)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.30)

Supporting End-User Authoring of Alternate Reality Games with Cross-Location Compatibility

Hajarnis, Sanjeet (Georgia Institute of Technology) | Barve, Chinmay (Georgia Institute of Technology) | Karnik, Devika (Georgia Institute of Technology) | Riedl, Mark (Georgia Institute of Technology)

A typical ARG consists of a Puppet Master who issues that have historically prevented ARGs from designs the game and informs players of the unfolding of mainstream adoption. A generic game engine runs on a the story. With the advent of smart-phones with GPS, geo-location enabled mobile device enables users to play ARGs progressively make use of the actual world as the any game modeled as a dependency graph of game content.

artificial intelligence, computer game, storyline, (15 more...)

Industry:

Information Technology (0.70)
Leisure & Entertainment > Games > Computer Games (0.36)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.76)
Information Technology > Communications > Mobile (0.56)

Lunga, Dalton, Kirshner, Sergey

Generating Similar Graphs From Spherical Features

arXiv.org Machine LearningMay-18-2011

We propose a novel model for generating graphs similar to a given example graph. Unlike standard approaches that compute features of graphs in Euclidean space, our approach obtains features on a surface of a hypersphere. We then utilize a von Mises-Fisher distribution, an exponential family distribution on the surface of a hypersphere, to define a model over possible feature values. While our approach bears similarity to a popular exponential random graph model (ERGM), unlike ERGMs, it does not suffer from degeneracy, a situation when a significant probability mass is placed on unrealistic graphs. We propose a parameter estimation approach for our model, and a procedure for drawing samples from the distribution. We evaluate the performance of our approach both on the small domain of all 8-node graphs as well as larger real-world social networks.

artificial intelligence, graph, machine learning, (17 more...)

1105.2965

Country: North America > United States > Indiana > Tippecanoe County (0.14)

Genre:

Research Report > New Finding (0.46)
Research Report > Promising Solution (0.34)

Industry: Information Technology (0.35)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.91)

Acar, Evrim, Kolda, Tamara G., Dunlavy, Daniel M.

All-at-once Optimization for Coupled Matrix and Tensor Factorizations

arXiv.org Machine LearningMay-17-2011

Joint analysis of data from multiple sources has the potential to improve our understanding of the underlying structures in complex data sets. For instance, in restaurant recommendation systems, recommendations can be based on rating histories of customers. In addition to rating histories, customers' social networks (e.g., Facebook friendships) and restaurant categories information (e.g., Thai or Italian) can also be used to make better recommendations. The task of fusing data, however, is challenging since data sets can be incomplete and heterogeneous, i.e., data consist of both matrices, e.g., the person by person social network matrix or the restaurant by category matrix, and higher-order tensors, e.g., the "ratings" tensor of the form restaurant by meal by person. In this paper, we are particularly interested in fusing data sets with the goal of capturing their underlying latent structures. We formulate this problem as a coupled matrix and tensor factorization (CMTF) problem where heterogeneous data sets are modeled by fitting outer-product models to higher-order tensors and matrices in a coupled manner. Unlike traditional approaches solving this problem using alternating algorithms, we propose an all-at-once optimization approach called CMTF-OPT (CMTF-OPTimization), which is a gradient-based optimization approach for joint analysis of matrices and higher-order tensors. We also extend the algorithm to handle coupled incomplete data sets. Using numerical experiments, we demonstrate that the proposed all-at-once approach is more accurate than the alternating least squares approach.

health & medicine, matrix, optimization problem, (21 more...)

1105.3422

Country:

North America > United States (1.00)
Europe > United Kingdom > England (0.14)

Genre: Research Report > Experimental Study (0.46)

Industry:

Health & Medicine > Diagnostic Medicine (0.93)
Information Technology (0.68)
Energy (0.68)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.86)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Choi, David S., Wolfe, Patrick J., Airoldi, Edoardo M.

Stochastic blockmodels with growing number of classes

arXiv.org Machine LearningApr-30-2011

We present asymptotic and finite-sample results on the use of stochastic blockmodels for the analysis of network data. We show that the fraction of misclassified network nodes converges in probability to zero under maximum likelihood fitting when the number of classes is allowed to grow as the root of the network size and the average network degree grows at least poly-logarithmically in this size. We also establish finite-sample confidence bounds on maximum-likelihood blockmodel parameter estimates from data comprising independent Bernoulli random variates; these results hold uniformly over class assignment. We provide simulations verifying the conditions sufficient for our results, and conclude by fitting a logit parameterization of a stochastic blockmodel with covariates to a network data example comprising a collection of Facebook profiles, resulting in block estimates that reveal residual structure.

bayesian inference, blockmodel, social media, (19 more...)

doi: 10.1093/biomet/asr053

1011.4644

Country: North America > United States > Rhode Island (0.14)

Genre: Research Report > New Finding (0.34)

Industry: Information Technology > Services (0.35)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.69)