AITopics

In this paper, we outline a system for evaluating the performance of scientific research across a number of outcome metrics (e.g. publications, sales, new hires). Our system is designed to classify research performance into a number of metrics, evaluate each metric’s performance using only data on other metrics, and to cast predictions of future performance by metric. This study shows how data mining techniques can be used to provide a predictive analytic approach to the management of resources for scientific research.

metric, performance class, research project, (15 more...)

Country:

North America > United States > Washington > Benton County > Richland (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)

Genre: Research Report (0.47)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.47)

Quantitative Comparison of Linear and Non-linear Dimensionality Reduction Techniques for Solar Image Archives

Banda, Juan M. (Montana State University) | Angryk, Rafal A. (Montana State University) | Martens, Petrus C. (Montana State University)

This work investigates the applicability of several dimensionality reduction techniques for large scale solar data analysis. Using the first solar domain-specific benchmark dataset that contains images of multiple types of phenomena, we investigate linear and non-linear dimensionality reduction methods in order to reduce our storage costs and maintain an accurate representation of our data in a new vector space. We present a comparative analysis between several dimensionality reduction methods and different numbers of target dimensions by utilizing different classifiers in order to determine the percentage of dimensionality reduction that can be achieved on solar data with said methods, and to discover the method that is the most effective for solar images.

dataset, dimensionality reduction method, reduction method, (13 more...)

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Montana > Gallatin County > Bozeman (0.04)
North America > United States > District of Columbia > Washington (0.04)
(3 more...)

Genre: Overview (0.47)

Industry:

Energy > Renewable > Solar (0.34)
Energy > Power Industry (0.34)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Dimensionality Reduction (1.00)

Komisin, Michael C. (University of North Carolina Wilmington) | Guinn, Curry I. (University of North Carolina Wilmington)

Identifying Personality Types Using Document Classification Methods

Are the words that people use indicative of their personality type preferences? In this paper, it is hypothesized that word-usage is not independent of personality type, as measured by the Myers-Briggs Type Indicator (MBTI) personality assessment tool. In-class writing samples were taken from 40 graduate students along with the MBTI. The experiment utilizes naïve Bayes classifiers and Support Vector Machines (SVMs) in an attempt to guess an individual’s personality type based on their word-choice. Classification is also attempted using emotional, social, cognitive, and psychological dimensions elicited by the analysis software, Linguistic Inquiry and Word Count (LIWC). The classifiers are evaluated with 40 distinct trials (leave-one-out cross validation), and parameters are chosen using leave-one-out cross validation of each trial’s training set. The experiment showed that the naïve Bayes classifiers (word-based and LIWC-based) outperformed the SVMs when guessing Sensing-Intuition (S-N) and Thinking-Feeling (T-F).

clarity score, classifier, dichotomy, (14 more...)

Country:

North America > United States > North Carolina > New Hanover County > Wilmington (0.04)
North America > United States > New York (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Education > Educational Setting > Higher Education (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Proper Noun Semantic Clustering Using Bag-of-Vectors

Ebadat, Ali Reza (INRIA-INSA) | Claveau, Vincent (IRISA-CNRS) | Sébillot, Pascale (IRISA-INS)

In this paper, we propose a model for semantic clustering of entities extracted from a text, and we apply it to a Proper Noun classification task.This model is based on a new method to compute the similarity between the entities.Indeed, the classical way of calculating similarity is to build a feature vector or Bag-of-Features for each entity and then use classical similarity functions like Cosine.In practice, the features are contextual, such as words around the different occurrences of each entity. Here, we propose to use an alternative representation for entities, called Bag-of-Vectors, or Bag-of-Bags-of-Features.In this new model, each entity is not defined as a unique vector but as a set of vectors, in which each vector is built based on the contextual features of one occurrence of the entity.In order to use Bag-of-Vectors for clustering, we introduce new versions of classical similarity functions such as Cosine and Scalar Products. Experimentally, we show that the Bag-of-Vectors representation always improve the clustering results compared to classical Bag-of-Features representations.

computational linguistic, representation, similarity function, (12 more...)

Country:

North America > United States (0.15)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > France (0.04)
Europe > Czechia > Prague (0.04)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.99)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.69)

Syntagmatic, Paradigmatic, and Automatic N-Gram Approaches to Assessing Essay Quality

Crossley, Scott (Georgia State University) | Cai, Zhiqiang (University of Memphis) | McNamara, Danielle S. (Arizona State University)

Computational indices related to n-gram production were developed in order to assess the potential for n-gram indices to predict human scores of essay quality. A regression analyses was conducted on a corpus of 313 argumentative essays. The analyses demonstrated that a variety of n-gram indices were highly correlated to essay quality, but were also highly correlated to the number of words in the text (although many of the n-gram indices were stronger predictors of writing quality than the number of words in a text). A second regression analysis was conducted on a corpus of 88 argumentative essays that were controlled for text length differences. This analysis demonstrated that n-gram indices were still strong predictors of essay quality when text length was not a factor.

bigram, correlation, frequency, (15 more...)

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
North America > United States > New Jersey > Bergen County > Mahwah (0.04)
North America > United States > Mississippi (0.04)
(3 more...)

Genre: Research Report > New Finding (0.87)

Industry:

Education > Educational Setting (1.00)
Education > Assessment & Standards (0.69)
Education > Educational Technology > Educational Software > Computer-Aided Assessment (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.69)

Calix, Ricardo A. (Purdue University Calumet)

Emotion Expression 3-D Synthesis From Predicted Emotion Magnitudes

Many studies have been conducted on how to detect emotion classes or magnitudes from multimedia information such as text, audio, and images. However, the methods that can use predicted emotion classes and magnitudes to render emotion expressions in Embodied Conversational Agents (ECA) are still unclear. This paper proposes a computer graphics methodology that uses predicted non-linear regression values to render facial expressions using mesh morphing techniques. Results of the rendering technique are presented and discussed.

expression, magnitude, mesh, (16 more...)

Country:

North America > United States > New York > New York County > New York City (0.05)
North America > United States > California > San Francisco County > San Francisco (0.05)
North America > United States > New Jersey (0.04)
(7 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Schuh, Michael A. (Montana State University) | Angryk, Rafal (Montana State University) | Sheppard, John (Montana State University and The Johns Hopkins University)

Evolving Kernel Functions with Particle Swarms and Genetic Programming

The Support Vector Machine has gained significant popularity over recent years as a kernel-based supervised learning technique. However, choosing the appropriate kernel function and its associated parameters is not a trivial task. The kernel is often chosen from several widely-used and general-purpose functions, and the parameters are then empirically tuned for the best results on a specific data set. This paper explores the use of Particle Swarm Optimization and Genetic Programming as evolutionary approaches to evolve effective kernel functions for a given dataset. Rather than using expert knowledge, we evolve kernel functions without human-guided knowledge or intuition. Our results show consistently better SVM performance with evolved kernels over a variety of traditional kernels on several datasets.

dataset, kernel, kernel function, (16 more...)

Country:

North America > United States > New York > New York County > New York City (0.05)
South America > Paraguay > Asunción > Asunción (0.04)
Oceania > New Zealand > North Island > Waikato (0.04)
(7 more...)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.71)

arXiv.org Artificial IntelligenceMay-19-2012

Efficient Methods for Unsupervised Learning of Probabilistic Models

Sohl-Dickstein, Jascha

Interpreting neural spike trains, compressing video, identifying features in DNA microarrays, and recognizing particles in high energy physics all rely upon the ability to find and model complex structure in a high dimensional space. Despite their great promise, high dimensional probabilistic models are frequently computationally intractable to work with in practice. In this thesis I develop solutions to overcome this intractability, primarily in the context of energy based models. A common cause of intractability is that model distributions cannot be analytically normalized. Probabilities can only be computed up to a constant, making training exceedingly difficult. To solve this problem I propose'minimum probability flow learning', a variational technique for parameter estimation in such models.

artificial intelligence, machine learning, objective function, (14 more...)

arXiv.org Artificial Intelligence

1205.4295

Country:

North America > United States (0.92)
Asia (0.92)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.67)
Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
(2 more...)

Challenges and Opportunities in Applied Machine Learning

Brodley, Carla E. (Tufts University) | Rebbapragada, Umaa (Jet Propulsion Laboratory) | Small, Kevin (Tufts Medical Center) | Wallace, Byron (Tufts University)

AI MagazineMay-14-2012

Machine learning research is often conducted in vitro, divorced from motivating practical applications. A researcher might develop a new method for the general task of classification, then assess its utility by comparing its performance (such as accuracy or AUC) to that of existing classification models on publicly available datasets. In terms of advancing machine learning as an academic discipline, this approach has thus far proven quite fruitful. However, it is our view that the most interesting open problems in machine learning are those that arise during its application to real-world problems. We illustrate this point by reviewing two of our interdisciplinary collaborations, both of which have posed unique machine learning problems, providing fertile ground for novel research.

artificial intelligence, brodley, machine learning, (17 more...)

AI Magazine

Country: North America > United States > California (0.93)

Genre: Research Report > New Finding (0.93)

Industry:

Health & Medicine > Therapeutic Area (0.93)
Education (0.87)
Information Technology > Security & Privacy (0.68)
Health & Medicine > Diagnostic Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.93)

Durut, Matthieu, Patra, Benoît, Rossi, Fabrice

A Discussion on Parallelization Schemes for Stochastic Vector Quantization Algorithms

arXiv.org Machine LearningMay-10-2012

This paper studies parallelization schemes for stochastic Vector Quantization algorithms in order to obtain time speed-ups using distributed resources. We show that the most intuitive parallelization scheme does not lead to better performances than the sequential algorithm. Another distributed scheme is therefore introduced which obtains the expected speed-ups. Then, it is improved to fit implementation on distributed architectures where communications are slow and inter-machines synchronization too costly. The schemes are tested with simulated distributed architectures and, for the last one, with Microsoft Windows Azure platform obtaining speed-ups up to 32 Virtual Machines.

algorithm, artificial intelligence, machine learning, (15 more...)

arXiv.org Machine Learning

1205.2282

Country: Europe > France (0.17)

Genre: Research Report (0.84)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.51)