AITopics | klsh

Collaborating Authors

klsh

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

AMoreExperiments

Neural Information Processing SystemsFeb-9-2026, 18:15:35 GMT

In our experiments, we adopt the standard exact Hamming search by linear scan. On a single core 2.0GHz CPU compiled with C++, searching over 1M samples onSIFT takes 17 approximately 0.15s per query withb = 512. Note that linear scan is a naive strategy. Firstly,we see that the differences among the curvesare very small. B.2 RankingEfficiency:MorecandρValues we provide more theoretical comparisons on the ranking efficiency at moreρ and c values. Figure 14: CIFAR-VGGTop-10 retrieved images (right) for two example query images (left, automobile and cat) withb = 512.

amoreexperiment, artificial intelligence, efficiency, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.32)

Add feedback

718a3c5cf135894db6e718725f52ef9a-Paper-Conference.pdf

Neural Information Processing SystemsFeb-9-2026, 18:15:32 GMT

neural information processing system, proceedings, signrff, (11 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.28)
North America > Canada > Quebec > Montreal (0.04)
North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
(24 more...)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Information Management (0.93)
(3 more...)

Add feedback

718a3c5cf135894db6e718725f52ef9a-Supplemental-Conference.pdf

Neural Information Processing SystemsAug-15-2025, 19:20:13 GMT

artificial intelligence, ranking efficiency ratio 1, signrff sq-rff signrff sq-rff 1, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.94)

Add feedback

SignRFF: Sign Random Fourier Features

Neural Information Processing SystemsAug-15-2025, 19:20:09 GMT

The industry practice has been moving to embedding based retrieval (EBR).

data mining, machine learning, natural language, (15 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.28)
North America > Canada > Quebec > Montreal (0.04)
North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
(24 more...)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Information Management (0.93)
(3 more...)

Add feedback

SignRFF: Sign Random Fourier Features

Neural Information Processing SystemsOct-11-2024, 14:03:27 GMT

The industry practice has been moving to embedding based retrieval (EBR). For example, in many applications, the embedding vectors are trained by some form of two-tower models. During serving phase, candidates (embedding vectors) are retrieved according to the rankings of cosine similarities either exhaustively or by approximate near neighbor (ANN) search algorithms. For those applications, it is natural to apply sign random projections'' (SignRP) or variants, on the trained embedding vectors to facilitate efficient data storage and cosine distance computations. SignRP is also one of the standard indexing schemes for conducting approximate near neighbor search.

sign random fourier feature, signrff, signrp, (8 more...)

Neural Information Processing Systems

Genre: Research Report (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.56)

Add feedback

Probabilistic Blocking with An Application to the Syrian Conflict

Steorts, Rebecca C., Shrivastava, Anshumali

arXiv.org Machine LearningOct-10-2018

Entity resolution seeks to merge databases as to remove duplicate entries where unique identifiers are typically unknown. We review modern blocking approaches for entity resolution, focusing on those based upon locality sensitive hashing (LSH). First, we introduce $k$-means locality sensitive hashing (KLSH), which is based upon the information retrieval literature and clusters similar records into blocks using a vector-space representation and projections. Second, we introduce a subquadratic variant of LSH to the literature, known as Densified One Permutation Hashing (DOPH). Third, we propose a weighted variant of DOPH. We illustrate each method on an application to a subset of the ongoing Syrian conflict, giving a discussion of each method.

data mining, information retrieval, machine learning, (18 more...)

arXiv.org Machine Learning

1810.05497

Country: Asia > Middle East > Syria (0.87)

Genre:

Overview (0.66)
Research Report (0.40)

Industry:

Government > Regional Government > Asia Government > Middle East Government > Syria Government (0.63)
Government > Military (0.63)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.77)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Binary Embedding with Additive Homogeneous Kernels

Kim, Saehoon ( Pohang University of Science and Technology (POSTECH) ) | Choi, Seungjin ( Pohang University of Science and Technology (POSTECH) )

AAAI ConferencesFeb-14-2017

Binary embedding transforms vectors in Euclidean space into the vertices of Hamming space such that Hamming distance between binary codes reflects a particular distance metric. In machine learning, the similarity metrics induced by Mercer kernels are frequently used, leading to the development of binary embedding with Mercer kernels (BE-MK) where the approximate nearest neighbor search is performed in a reproducing kernel Hilbert space (RKHS). Kernelized locality-sensitive hashing (KLSH), which is one of the representative BE-MK, uses kernel PCA to embed data points into a Euclidean space, followed by the random hyperplane binary embedding. In general, it works well when the query and data points in the database follow the same probability distribution. The streaming data environment, however, continuously requires KLSH to update the leading eigenvectors of the Gram matrix, which can be costly or hard to carry out in practice. In this paper we present a completely randomized binary embedding to work with a family of additive homogeneous kernels, referred to as BE-AHK. The proposed algorithm is easy to implement, built on Vedaldi and Zisserman's work on explicit feature maps for additive homogeneous kernels. We show that our BE-AHK is able to preserve kernel values by developing an upper- and lower-bound on its Hamming distance, which guarantees to solve approximate nearest neighbor search efficiently. Numerical experiments demonstrate that BE-AHK actually yields similarity-preserving binary codes in terms of additive homogeneous kernels and is superior to existing methods in case that training data and queries are generated from different distributions. Moreover, in cases where a large code size is allowed, the performance of BE-AHK is comparable to that of KLSH in general cases.

information retrieval, machine learning, natural language, (20 more...)

AAAI Conferences

Thirty-First AAAI Conference on Artificial Intelligence

Country: Asia (0.15)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.55)

Add feedback

Revisiting Kernelized Locality-Sensitive Hashing for Improved Large-Scale Image Retrieval

Jiang, Ke, Que, Qichao, Kulis, Brian

arXiv.org Machine LearningNov-15-2014

We present a simple but powerful reinterpretation of kernelized locality-sensitive hashing (KLSH), a general and popular method developed in the vision community for performing approximate nearest-neighbor searches in an arbitrary reproducing kernel Hilbert space (RKHS). Our new perspective is based on viewing the steps of the KLSH algorithm in an appropriately projected space, and has several key theoretical and practical benefits. First, it eliminates the problematic conceptual difficulties that are present in the existing motivation of KLSH. Second, it yields the first formal retrieval performance bounds for KLSH. Third, our analysis reveals two techniques for boosting the empirical performance of KLSH. We evaluate these extensions on several large-scale benchmark image retrieval data sets, and show that our analysis leads to improved recall performance of at least 12%, and sometimes much higher, over the standard KLSH method.

kernel, klsh, vector, (12 more...)

arXiv.org Machine Learning

1411.4199

Country:

North America > United States > Ohio (0.04)
Asia > Afghanistan > Parwan Province > Charikar (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.67)

Add feedback

Kernelized Locality-Sensitive Hashing for Semi-Supervised Agglomerative Clustering

Xie, Boyi, Zheng, Shuheng

arXiv.org Machine LearningJan-15-2013

Large scale agglomerative clustering is hindered by computational burdens. We propose a novel scheme where exact inter-instance distance calculation is replaced by the Hamming distance between Kernelized Locality-Sensitive Hashing (KLSH) hashed values. This results in a method that drastically decreases computation time. Additionally, we take advantage of certain labeled data points via distance metric learning to achieve a competitive precision and recall comparing to K-Means but in much less computation time.

artificial intelligence, distance metric learning, machine learning, (14 more...)

arXiv.org Machine Learning

1301.3575

Country: North America > United States (0.29)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Add feedback