AITopics

We investigate weakly-supervised image parsing, i.e., assigning class labels to image regions by using image-level labels only. Existing studies pay main attention to the formulation of the weakly-supervised learning problem, i.e., how to propagate class labels from images to regions given an affinity graph of regions. Notably, however, the affinity graph of regions, which is generally constructed in relatively simpler settings in existing methods, is of crucial importance to the parsing performance due to the fact that the weakly-supervised parsing problem cannot be solved within a single image, and that the affinity graph enables label propagation among multiple images. In order to embed more semantics into the affinity graph, we propose novel criteria by exploiting the weak supervision information carefully, and develop two graphs: L1 semantic graph and k-NN semantic graph. Experimental results demonstrate that the proposed semantic graphs not only capture more semantic relevance, but also perform significantly better than conventional graphs in image parsing.

artificial intelligence, machine learning, natural language, (20 more...)

Twenty-Eighth AAAI Conference on Artificial Intelligence

Country:

Asia > Middle East > Jordan (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report > New Finding (0.34)

Industry: Education (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)

A Generalized Genetic Algorithm-Based Solver for Very Large Jigsaw Puzzles of Complex Types

Sholomon, Dror (Bar-Ilan University) | David, Omid E. (Bar-Ilan University) | Netanyahu, Nathan S. (Bar-Ilan University)

In this paper we introduce new types of square-piece jigsaw puzzles, where in addition to the unknown location and orientation of each piece, a piece might also need to be flipped. These puzzles, which are associated with a number of real world problems, are considerably harder, from a computational standpoint. Specifically, we present a novel generalized genetic algorithm (GA)-based solver that can handle puzzle pieces of unknown location and orientation (Type 2 puzzles) and (two-sided) puzzle pieces of unknown location, orientation, and face (Type 4 puzzles). To the best of our knowledge, our solver provides a new state-of-the-art, solving previously attempted puzzles faster and far more accurately, handling puzzle sizes that have never been attempted before, and assembling the newly introduced two-sided puzzles automatically and effectively. This paper also presents, among other results, the most extensive set of experimental results, compiled as of yet, on Type 2 puzzles.

evolutionary algorithm, machine learning, puzzle, (18 more...)

Twenty-Eighth AAAI Conference on Artificial Intelligence

Country:

North America > United States > Maryland > Prince George's County > College Park (0.14)
North America > United States > Michigan > Washtenaw County > Ann Arbor (0.04)
North America > United States > Massachusetts (0.04)
Asia > Middle East > Israel (0.04)

Industry: Health & Medicine (0.33)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)

On Hair Recognition in the Wild by Machine

Roth, Joseph (Michigan State University) | Liu, Xiaoming (Michigan State University)

We present an algorithm for identity verification using only information from the hair. Face recognition in the wild (i.e., unconstrained settings) is highly useful in a variety of applications, but performance suffers due to many factors, e.g., obscured face, lighting variation, extreme pose angle, and expression. It is well known that humans utilize hair for identification under many of these scenarios due to either the consistent hair appearance of the same subject or obvious hair discrepancy of different subjects, but little work exists to replicate this intelligence artificially. We propose a learned hair matcher using shape, color, and texture features derived from localized patches through an AdaBoost technique with abstaining weak classifiers when features are not present in the given location. The proposed hair matcher achieves 71.53% accuracy on the LFW View 2 dataset. Hair also reduces the error of a Commercial Off-The-Shelf (COTS) face matcher through simple score-level fusion by 5.7%.

artificial intelligence, machine learning, recognition, (18 more...)

Twenty-Eighth AAAI Conference on Artificial Intelligence

Country:

Asia > Japan > Honshū > Kansai > Kyoto Prefecture > Kyoto (0.04)
North America > United States > New York (0.04)
North America > United States > Michigan > Ingham County > Lansing (0.04)
(3 more...)

Genre: Research Report (0.68)

Industry:

Information Technology > Security & Privacy (0.48)
Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Vision > Face Recognition (0.93)

Learning Low-Rank Representations with Classwise Block-Diagonal Structure for Robust Face Recognition

Li, Yong (Chinese Academy of Sciences) | Liu, Jing (Chinese Academy of Sciences) | Li, Zechao (Nanjing University of Science and Technology) | Zhang, Yangmuzi (University of Maryland, College Park) | Lu, Hanqing (Chinese Academy of Sciences) | Ma, Songde (Chinese Academy of Sciences)

Face recognition has been widely studied due to its importance in various applications. However, the case that both training images and testing images are corrupted is not well addressed. Motivated by the success of low-rank matrix recovery, we propose a novel semi-supervised low-rank matrix recovery algorithm for robust face recognition. The proposed method can learn robust discriminative representations for both training images and testing images simultaneously by exploiting the classwise block-diagonal structure. Specifically, low-rank matrix approximation can handle the possible contamination of data. Moreover, the classwise block-diagonal structure is exploited to promote discrimination of representations for robust recognition. The above issues are formulated into a unified objective function and we design an efficient optimization procedure based on augmented Lagrange multiplier method to solve it. Extensive experiments on three public databases are performed to validate the effectiveness of our approach. The strong identification capability of representations with block-diagonal structure is verified.

artificial intelligence, machine learning, training image, (14 more...)

Twenty-Eighth AAAI Conference on Artificial Intelligence

Country:

North America > United States > Maryland > Prince George's County > College Park (0.04)
Asia > China > Jiangsu Province > Nanjing (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision > Face Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Uncorrelated Multi-View Discrimination Dictionary Learning for Recognition

Jing, Xiao-Yuan (Wuhan University) | Hu, Rui-Min (Wuhan University) | Wu, Fei (Nanjing University of Posts and Telecommunications) | Chen, Xi-Lin (Chinese Academy of Sciences) | Liu, Qian (Nanjing University of Posts and Telecommunications) | Yao, Yong-Fang (Nanjing University of Posts and Telecommunications)

Dictionary learning (DL) has now become an important feature learning technique that owns state-of-the-art recognition performance. Due to sparse characteristic of data in real-world applications, DL uses a set of learned dictionary bases to represent the linear decomposition of a data point. Fisher discrimination DL (FDDL) is a representative supervised DL method, which constructs a structured dictionary whose atoms correspond to the class labels. Recent years have witnessed a growing interest in multi-view (more than two views) feature learning techniques. Although some multi-view (or multi-modal) DL methods have been presented, there still exists much room for improvement. How to enhance the total discriminability of dictionaries and reduce their redundancy is a crucial research topic. To boost the performance of multi-view DL technique, we propose an uncorrelated multi-view discrimination DL (UMDDL) approach for recognition. By making dictionary atoms correspond to the class labels such that the obtained reconstruction error is discriminative, UMDDL aims to jointly learn multiple dictionaries with totally favorable discriminative power. Furthermore, we design the uncorrelated constraint for multi-view DL, so as to reduce the redundancy among dictionaries learned from different views. Experiments on several public datasets demonstrate the effectiveness of the proposed approach.

artificial intelligence, machine learning, umd 2, (15 more...)

Twenty-Eighth AAAI Conference on Artificial Intelligence

Country:

Asia > China > Hubei Province > Wuhan (0.04)
North America > United States > Ohio > Franklin County > Columbus (0.04)
North America > United States > Massachusetts (0.04)
Asia > China > Jiangsu Province > Nanjing (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.94)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Jiang, Ziheng (Beijing Institute of Technology) | Guo, Ping (Beijing Institute of Technology) | Peng, Lihong (National University of Defense Technology)

Locality-Constrained Low-Rank Coding for Image Classification

Low-rank coding (LRC), originated from matrix decomposition, is recently introduced into image classification. Following the standard bag-of-words (BOW) pipeline, when coding the data matrix in the sense of low-rankness incorporates contextual information into the traditional BOW model, this can capture the dependency relationship among neighbor patches. It differs from the traditional sparse coding paradigms which encode patches independently. Current LRC-based methods use l_1 norm to increase the discrimination and sparseness of the learned codes. However, such methods fail to consider the local manifold structure between dataspace and dictionary space. To solve this problem, we propose a locality-constrained low-rank coding (LCLR) algorithm for image representations. By using the geometric structure information as a regularization term,we can obtain more discriminative representations. In addition, we present a fast and stable online algorithmto solve the optimization problem. In the experiments,we evaluate LCLR with four benchmarks, including one face recognition dataset (extended Yale B), one handwrittendigit recognition dataset (USPS), and two image datasets (Scene13 for scene recognition and Caltech101 for object recognition). Experimental results show thatour approach outperforms many state-of-the-art algorithmseven with a linear classifier.

algorithm, artificial intelligence, machine learning, (18 more...)

Twenty-Eighth AAAI Conference on Artificial Intelligence

Country:

Asia > China > Beijing > Beijing (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision > Image Understanding (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Similarity-Preserving Binary Signature for Linear Subspaces

Ji, Jianqiu (Tsinghua University) | Li, Jianmin (Tsinghua University) | Yan, Shuicheng (National University of Singapore) | Tian, Qi (University of Texas at San Antonio) | Zhang, Bo (Tsinghua University)

Linear subspace is an important representation for many kinds of real-world data in computer vision and pattern recognition, e.g. faces, motion videos, speeches. In this paper, first we define pairwise angular similarity and angular distance for linear subspaces. The angular distance satisfies non-negativity, identity of indiscernibles, symmetry and triangle inequality, and thus it is a metric. Then we propose a method to compress linear subspaces into compact similarity-preserving binary signatures, between which the normalized Hamming distance is an unbiased estimator of the angular distance. We provide a lower bound on the length of the binary signatures which suffices to guarantee uniform distance-preservation within a set of subspaces. Experiments on face recognition demonstrate the effectiveness of the binary signature in terms of recognition accuracy, speed and storage requirement. The results show that, compared with the exact method, the approximation with the binary signatures achieves an order of magnitude speed-up, while requiring significantly smaller amount of storage space, yet it still accurately preserves the similarity, and achieves high recognition accuracy comparable to the exact method in face recognition.

artificial intelligence, machine learning, subspace, (17 more...)

Twenty-Eighth AAAI Conference on Artificial Intelligence

Country:

Asia > Afghanistan > Parwan Province > Charikar (0.05)
Asia > Middle East > Jordan (0.05)
Asia > China > Beijing > Beijing (0.04)
(3 more...)

Genre: Research Report (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision > Face Recognition (0.89)

Towards Topological-Transformation Robust Shape Comparison: A Sparse Representation Based Manifold Embedding Approach

Gao, Longwen (Fudan University) | Zhou, Shuigeng (Fudan University)

Non-rigid shape comparison based on manifold embeddingusing Generalized Multidimensional Scaling(GMDS) has attracted much attention for its highaccuracy. However, this method requires that shape surfaceis not elastic. In other words, it is sensitive totopological transformations such as stretching and compressing.To tackle this problem, we propose a new approachthat constructs a high-dimensional space to embedthe manifolds of shapes based on sparse representation,which is able to completely withstand rigid transformationsand considerably tolerate topological transformations.Experiments on TOSCA shapes validate theproposed approach.

artificial intelligence, optimization problem, transformation, (16 more...)

Twenty-Eighth AAAI Conference on Artificial Intelligence

Country:

North America > United States (0.14)
Asia > China > Shanghai > Shanghai (0.05)

Industry: Health & Medicine (0.47)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.47)

Double Configuration Checking in Stochastic Local Search for Satisfiability

Luo, Chuan (Peking University) | Cai, Shaowei (Chinese Academy of Sciences) | Wu, Wei (Peking University) | Su, Kaile (Peking University)

Stochastic local search (SLS) algorithms have shown effectiveness on satisfiable instances of the Boolean satisfiability (SAT) problem. However, their performance is still unsatisfactory on random k-SAT at the phase transition, which is of significance and is one of the empirically hardest distributions of SAT instances. In this paper, we propose a new heuristic called DCCA, which combines two configuration checking (CC) strategies with different definitions of configuration in a novel way. We use the DCCA heuristic to design an efficient SLS solver for SAT dubbed DCCASat. The experiments show that the DCCASat solver significantly outperforms a number of state-of-the-art solvers on extensive random k-SAT benchmarks at the phase transition. Moreover, DCCASat shows good performance on structured benchmarks, and a combination of DCCASat with a complete solver achieves state-of-the-art performance on structured benchmarks.

artificial intelligence, benchmark, optimization problem, (16 more...)

Twenty-Eighth AAAI Conference on Artificial Intelligence

Country:

Oceania > Australia > Queensland > Brisbane (0.04)
Asia > China > Beijing > Beijing (0.04)
Europe > Italy > Friuli Venezia Giulia > Trieste Province > Trieste (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.72)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.48)

Lee, Jimmy H.M. (The Chinese University of Hong Kong) | Zhu, Zichen (The Chinese University of Hong Kong)

Boosting SBDS for Partial Symmetry Breaking in Constraint Programming

The paper proposes a dynamic method, Recursive SBDS(ReSBDS), for efficient partial symmetry breaking. Wefirst demonstrate how (partial) Symmetry BreakingDuring Search (SBDS) misses important pruning opportunitieswhen given only a subset of symmetries tobreak. The investigation pinpoints the culprit and in turnsuggests rectification. The main idea is to add extra conditionalconstraints during search recursively to prunealso symmetric nodes of some pruned subtrees. Thus,ReSBDS can break extra symmetry compositions, butis carefully designed to break only the ones that areeasy to identify and inexpensive to break. We presenttheorems to guarantee the soundness and terminationof our approach, and compare our method with popularstatic and dynamic methods. When the variable (value)heuristic is static, ReSBDS is also complete in eliminatingall interchangeable variables (values) given only thegenerator symmetries. Extensive experimentations confirmthe efficiency of ReSBDS, when compared againststate of the art methods.

artificial intelligence, constraint-based reasoning, symmetry, (16 more...)

Twenty-Eighth AAAI Conference on Artificial Intelligence

Country: Asia > China > Hong Kong (0.04)

Genre: Research Report (0.68)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (1.00)