AITopics | Schweitzer, Haim

Collaborating Authors

Schweitzer, Haim

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Improving the Accuracy of Principal Component Analysis by the Maximum Entropy Method

Wan, Guihong, Maung, Crystal, Schweitzer, Haim

arXiv.org Machine LearningJul-23-2019

Classical Principal Component Analysis (PCA) approximates data in terms of projections on a small number of orthogonal vectors. There are simple procedures to efficiently compute various functions of the data from the PCA approximation. The most important function is arguably the Euclidean distance between data items, This can be used, for example, to solve the approximate nearest neighbor problem. We use random variables to model the inherent uncertainty in such approximations, and apply the Maximum Entropy Method to infer the underlying probability distribution. We propose using the expected values of distances between these random variables as improved estimates of the distance. We show by analysis and experimentally that in most cases results obtained by our method are more accurate than what is obtained by the classical approach. This improves the accuracy of a classical technique that have been used with little change for over 100 years.

artificial intelligence, machine learning, null 2, (17 more...)

arXiv.org Machine Learning

1907.11094

Country: North America > United States > Texas > Dallas County (0.14)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Maximum Entropy (0.63)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.61)

Add feedback

Solving Generalized Column Subset Selection With Heuristic Search

Shah, Swair (The University of Texas at Dallas) | He, Baokun (The University of Texas at Dallas) | Xu, Ke (The University of Texas at Dallas) | Maung, Crystal (The University of Texas at Dallas) | Schweitzer, Haim (The University of Texas at Dallas)

AAAI ConferencesFeb-8-2018

We address the problem of approximating a matrix by the linear combination of a column sparse matrix and a low rank matrix. Two variants of a heuristic search algorithm are described. The first produces an optimal solution but may be slow, as these problems are believed to be NP-hard. The second is much faster, but only guarantees a suboptimal solution. The quality of the approximation and the optimality criterion can be specified in terms of unitarily invariant norms.

algorithm, artificial intelligence, generalized column subset selection, (14 more...)

AAAI Conferences

Thirty-Second AAAI Conference on Artificial Intelligence

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)

Add feedback

Cleaning the Null Space: A Privacy Mechanism for Predictors

Xu, Ke (The University of Texas at Dallas) | Cao, Tongyi ( University of Massachusetts Amherst ) | Shah, Swair (The University of Texas at Dallas) | Maung, Crystal (The University of Texas at Dallas) | Schweitzer, Haim (The University of Texas at Dallas)

AAAI ConferencesFeb-14-2017

In standard machine learning and regression setting feature values are used to predict some desired information. The privacy challenge considered here is to prevent an adversary from using available feature values to predict confidential information that one wishes to keep secret. We show that this can sometimes be achieved with almost no effect on the qual- ity of predicting desired information. We describe two algorithms aimed at providing such privacy when the predictors have a linear operator in the first stage. The desired effect can be achieved by zeroing out feature components in the approximate null space of the linear operator.

artificial intelligence, machine learning, privacy, (17 more...)

AAAI Conferences

Thirty-First AAAI Conference on Artificial Intelligence

Country: North America > United States (0.28)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.72)

Add feedback

Enhancing the Privacy of Predictors

Xu, Ke (The University of Texas at Dallas) | Shah, Swair (The University of Texas at Dallas) | Cao, Tongyi (University of Massachusetts Amherst) | Maung, Crystal (The University of Texas at Dallas) | Schweitzer, Haim (The University of Texas at Dallas)

AAAI ConferencesFeb-14-2017

The privacy challenge considered here is to prevent an adversary from using available feature values to predict confi- dential information. We propose an algorithm providing such privacy for predictors that have a linear operator in the first stage. Privacy is achieved by zeroing out feature components in the approximate null space of the linear operator. We show that this has little effect on predicting desired information.

artificial intelligence, machine learning, privacy, (17 more...)

AAAI Conferences

Thirty-First AAAI Conference on Artificial Intelligence

Industry: Information Technology > Security & Privacy (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.84)

Add feedback

Weighted A* Algorithms for Unsupervised Feature Selection with Provable Bounds on Suboptimality

Arai, Hiromasa (The University of Texas at Dallas) | Xu, Ke (The University of Texas at Dallas) | Maung, Crystal (The University of Texas at Dallas) | Schweitzer, Haim (The University of Texas at Dallas)

AAAI ConferencesApr-19-2016

Identifying a small number of features that can represent the data is believed to be NP-hard. Previous approaches exploit algebraic structure and use randomization. We propose an algorithm based on ideas similar to the Weighted A* algorithm in heuristic search. Our experiments show this new algorithm to be more accurate than the current state of the art.

algorithm, artificial intelligence, machine learning, (16 more...)

AAAI Conferences

Thirtieth AAAI Conference on Artificial Intelligence

Country: North America > United States > Texas > Dallas County (0.15)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.91)
Information Technology > Artificial Intelligence > Machine Learning (0.87)

Add feedback

Unsupervised Feature Selection by Heuristic Search with Provable Bounds on Suboptimality

Arai, Hiromasa (The University of Texas at Dallas) | Maung, Crystal (The University of Texas at Dallas) | Xu, Ke (The University of Texas at Dallas) | Schweitzer, Haim (The University of Texas at Dallas)

AAAI ConferencesApr-19-2016

Identifying a small number of features that can represent the data is a known problem that comes up in areas such as machine learning, knowledge representation, data mining, and numerical linear algebra. Computing an optimal solution is believed to be NP-hard, and there is extensive work on approximation algorithms. Classic approaches exploit the algebraic structure of the underlying matrix, while more recent approaches use randomization. An entirely different approach that uses the A* heuristic search algorithm to find an optimal solution was recently proposed. Not surprisingly it is limited to effectively selecting only a small number of features. We propose a similar approach related to the Weighted A* algorithm. This gives algorithms that are not guaranteed to find an optimal solution but run much faster than the A* approach, enabling effective selection of many features from large datasets. We demonstrate experimentally that these new algorithms are more accurate than the current state-of-the-art while still being practical. Furthermore, they come with an adjustable guarantee on how different their error may be from the smallest possible (optimal) error. Their accuracy can always be increased at the expense of a longer running time.

algorithm, artificial intelligence, machine learning, (18 more...)

AAAI Conferences

Thirtieth AAAI Conference on Artificial Intelligence

Country: North America > United States > Texas > Dallas County (0.15)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Optimal Column Subset Selection by A-Star Search

Arai, Hiromasa (The University of Texas at Dallas) | Maung, Crystal (The University of Texas at Dallas) | Schweitzer, Haim (University of Texas at Dallas)

AAAI ConferencesMar-6-2015

Approximating a matrix by a small subset of its columns is a known problem in numerical linear algebra. Algorithms that address this problem have been used in areas which include, among others, sparse approximation, unsupervised feature selection, data mining, and knowledge representation. Such algorithms were investigated since the 1960's, with recent results that use randomization. The problem is believed to be NP-Hard, and to the best of our knowledge there are no previously published algorithms aimed at computing optimal solutions. We show how to model the problem as a graph search, and propose a heuristic based on eigenvalues of related matrices. Applying the A* search strategy with this heuristic is guaranteed to find the optimal solution. Experimental results on common datasets show that the proposed algorithm can effectively select columns from moderate size matrices, typically improving by orders of magnitude the run time of exhaustive search. We also show how to combine the proposed algorithm with other non-optimal (but much faster) algorithms in a ``two stage'' framework, which is guaranteed to improve the accuracy of the other algorithms.

algorithm, artificial intelligence, matrix, (16 more...)

AAAI Conferences

Twenty-Ninth AAAI Conference on Artificial Intelligence

Country: North America > United States > Texas (0.15)

Genre: Research Report (0.68)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)

Add feedback

Pass-efficient unsupervised feature selection

Maung, Crystal, Schweitzer, Haim

Neural Information Processing SystemsDec-31-2013

The goal of unsupervised feature selection is to identify a small number of important features that can represent the data. We propose a new algorithm, a modification of the classical pivoted QR algorithm of Businger and Golub, that requires a small number of passes over the data. The improvements are based on two ideas: keeping track of multiple features in each pass, and skipping calculations that can be shown not to affect the final selection. Our algorithm selects the exact same features as the classical pivoted QR algorithm, and has the same favorable numerical stability. We describe experiments on real-world datasets which sometimes show improvements of {\em several orders of magnitude} over the classical algorithm. These results appear to be competitive with recently proposed randomized algorithms in terms of pass efficiency and run time. On the other hand, the randomized algorithms may produce better features, at the cost of small probability of failure.

algorithm, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States (0.46)
North America > Canada > Ontario (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback