AITopics

1202.3706

Country:

North America > United States (1.00)
Europe (0.93)
North America > Canada > Ontario > Toronto (0.15)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

arXiv.org Machine LearningFeb-11-2012

Regularized Tensor Factorizations and Higher-Order Principal Components Analysis

Allen, Genevera I.

High-dimensional tensors or multi-way data are becoming prevalent in areas such as biomedical imaging, chemometrics, networking and bibliometrics. Traditional approaches to finding lower dimensional representations of tensor data include flattening the data and applying matrix factorizations such as principal components analysis (PCA) or employing tensor decompositions such as the CANDECOMP / PARAFAC (CP) and Tucker decompositions. The former can lose important structure in the data, while the latter Higher-Order PCA (HOPCA) methods can be problematic in high-dimensions with many irrelevant features. We introduce frameworks for sparse tensor factorizations or Sparse HOPCA based on heuristic algorithmic approaches and by solving penalized optimization problems related to the CP decomposition. Extensions of these approaches lead to methods for general regularized tensor factorizations, multi-way Functional HOPCA and generalizations of HOPCA for structured data. We illustrate the utility of our methods for dimension reduction, feature selection, and signal recovery on simulated data and multi-dimensional microarrays and functional MRIs.

artificial intelligence, decomposition, machine learning, (15 more...)

1202.2476

Country: North America > United States (0.28)

Genre: Research Report (0.64)

Industry:

Health & Medicine > Diagnostic Medicine > Imaging (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (0.88)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.61)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

arXiv.org Artificial IntelligenceFeb-11-2012

Unfair items detection in educational measurement

Bakman, Yefim

Measurement professionals cannot come to an agreement on the definition of the term 'item fairness'. In this paper a continuous measure of item unfairness is proposed. The more the unfairness measure deviates from zero, the less fair the item is. If the measure exceeds the cutoff value, the item is identified as definitely unfair. The new approach can identify unfair items that would not be identified with conventional procedures. The results are in accord with experts' judgments on the item qualities. Since no assumptions about scores distributions and/or correlations are assumed, the method is applicable to any educational test. Its performance is illustrated through application to scores of a real test.

artificial intelligence, machine learning, unfair item, (17 more...)

1205.338

Country: North America > United States (0.28)

Genre:

Instructional Material (0.46)
Research Report (0.40)

Industry: Education (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)

Zhang, Cun-Hui, Zhang, Tong

A General Theory of Concave Regularization for High Dimensional Sparse Estimation Problems

arXiv.org Machine LearningFeb-10-2012

Concave regularization methods provide natural procedures for sparse recovery. However, they are difficult to analyze in the high dimensional setting. Only recently a few sparse recovery results have been established for some specific local solutions obtained via specialized numerical procedures. Still, the fundamental relationship between these solutions such as whether they are identical or their relationship to the global minimizer of the underlying nonconvex formulation is unknown. The current paper fills this conceptual gap by presenting a general theoretical framework showing that under appropriate conditions, the global solution of nonconvex regularization leads to desirable recovery performance; moreover, under suitable conditions, the global solution corresponds to the unique sparse local solution, which can be obtained via different numerical procedures. Under this unified framework, we present an overview of existing results and discuss their connections. The unified view of this work leads to a more satisfactory treatment of concave high dimensional sparse estimation procedures, and serves as guideline for developing further numerical procedures for concave regularization.

artificial intelligence, local solution, machine learning, (20 more...)

1108.4988

Country:

Europe (0.28)
North America > United States (0.28)

Genre:

Overview (0.68)
Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)

Jayabrabu, R., Saravanan, V., Vivekanandan, K.

A framework: Cluster detection and multidimensional visualization of automated data mining using intelligent agents

arXiv.org Artificial IntelligenceFeb-9-2012

Data Mining techniques plays a vital role like extraction of required knowledge, finding unsuspected information to make strategic decision in a novel way which in term understandable by domain experts. A generalized frame work is proposed by considering non - domain experts during mining process for better understanding, making better decision and better finding new patters in case of selecting suitable data mining techniques based on the user profile by means of intelligent agents.

artificial intelligence, data mining, machine learning, (14 more...)

1202.1945

Country:

Europe (0.93)
North America > United States (0.47)

Genre: Research Report (0.50)

Industry:

Information Technology > Security & Privacy (1.00)
Materials > Metals & Mining (0.76)
Education (0.69)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.48)

Ugtakhbayar, N., Battulga, D., Sodbileg, Sh.

Classification of artificial intelligence ids for smurf attack

arXiv.org Artificial IntelligenceFeb-8-2012

Many methods have been developed to secure the network infrastructure and communication over the Internet. Intrusion detection is a relatively new addition to such techniques. Intrusion detection systems (IDS) are used to find out if someone has intrusion into or is trying to get it the network. One big problem is amount of Intrusion which is increasing day by day. We need to know about network attack information using IDS, then analysing the effect. Due to the nature of IDSs which are solely signature based, every new intrusion cannot be detected; so it is important to introduce artificial intelligence (AI) methods / techniques in IDS. Introduction of AI necessitates the importance of normalization in intrusions. This work is focused on classification of AI based IDS techniques which will help better design intrusion detection systems in the future. We have also proposed a support vector machine for IDS to detect Smurf attack with much reliable accuracy.

artificial intelligence, machine learning, packet, (14 more...)

1202.1886

Country: North America > United States (0.29)

Genre: Research Report (0.50)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.61)

Kiraly, Franz J., von Buenau, Paul, Meinecke, Frank C., Blythe, Duncan A. J., Mueller, Klaus-Robert

Algebraic Geometric Comparison of Probability Distributions

arXiv.org Machine LearningFeb-7-2012

We propose a novel algebraic algorithmic framework for dealing with probability distributions represented by their cumulants such as the mean and covariance matrix. As an example, we consider the unsupervised learning problem of finding the subspace on which several probability distributions agree. Instead of minimizing an objective function involving the estimated cumulants, we show that by treating the cumulants as elements of the polynomial ring we can directly solve the problem, at a lower computational cost and with higher accuracy. Moreover, the algebraic viewpoint on probability distributions allows us to invoke the theory of algebraic geometry, which we demonstrate in a compact proof for an identifiability criterion.

artificial intelligence, machine learning, polynomial, (16 more...)

1108.1483

Country:

North America > United States (1.00)
Europe (0.93)

Genre: Research Report (0.49)

Industry:

Health & Medicine (0.67)
Education (0.48)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.92)

Vu, Vincent Q., Lei, Jing

Minimax Rates of Estimation for Sparse PCA in High Dimensions

arXiv.org Machine LearningFeb-5-2012

We study sparse principal components analysis in the high-dimensional setting, where $p$ (the number of variables) can be much larger than $n$ (the number of observations). We prove optimal, non-asymptotic lower and upper bounds on the minimax estimation error for the leading eigenvector when it belongs to an $\ell_q$ ball for $q \in [0,1]$. Our bounds are sharp in $p$ and $n$ for all $q \in [0, 1]$ over a wide class of distributions. The upper bound is obtained by analyzing the performance of $\ell_q$-constrained PCA. In particular, our results provide convergence rates for $\ell_1$-constrained PCA.

artificial intelligence, assumption 2, machine learning, (14 more...)

1202.0786

Genre: Research Report (0.70)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.89)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.62)

Zhou, Mingyuan, Hannah, Lauren, Dunson, David, Carin, Lawrence

Beta-Negative Binomial Process and Poisson Factor Analysis

arXiv.org Machine LearningFeb-4-2012

A beta-negative binomial (BNB) process is proposed, leading to a beta-gamma-Poisson process, which may be viewed as a "multi-scoop" generalization of the beta-Bernoulli process. The BNB process is augmented into a beta-gamma-gamma-Poisson hierarchical structure, and applied as a nonparametric Bayesian prior for an infinite Poisson factor analysis model. A finite approximation for the beta process Levy random measure is constructed for convenient implementation. Efficient MCMC computations are performed with data augmentation and marginalization techniques. Encouraging results are shown on document count matrix factorization.

beta-negative binomial process, machine learning, negative binomial distribution, (13 more...)

1112.3605

Country: North America > United States (0.68)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

arXiv.org Machine LearningFeb-3-2012

A Reconstruction Error Formulation for Semi-Supervised Multi-task and Multi-view Learning

Qian, Buyue, Wang, Xiang, Davidson, Ian

A significant challenge to make learning techniques more sui table for general purpose use is to move beyond i) complete supervision, ii) low dimensional data, iii) a single t ask and single view per instance. Solving these challenges a llows working with "Big Data" problems that are typically high dim ensional with multiple (but possibly incomplete) labeling s and views. While other work has addressed each of these probl ems separately, in this paper we show how to address them together, namelysemi-supervised dimension reduction for multi-task and multi-view learning (SSDR-MML), which performs optimization for dimension reduction and label inference in semi-supervised setting. The proposed framework is designed to handle both multi-task and multi-view learning settings, and can be easily adapted to many useful applications. Inform ation obtained from all tasks and views is combined via reconstruction errors in a linear fashion that can be efficiently solvedusing an alternating optimization scheme. Our formulation has a number of advantages. W e explicitly model the information combining mechanism as a data structure (a weight/nearest-nei ghbor matrix) which allows investigating fundamental ques tions in multi-task and multi-view learning. W e address one such question by presenting a general measure to quantify the success of simultaneous learning of multiple tasks or from multiple views. W e show that our SSDR-MML approach can outperform many state-of-the-art baseline methods and demonstrate the effectiveness of connecting dimension reduction and learning.

artificial intelligence, machine learning, optimization problem, (16 more...)

1202.0855

Country: North America > United States > California (0.93)

Genre: Research Report (0.63)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)