AITopics | data vector

Collaborating Authors

data vector

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Exploiting the Structure: Stochastic Gradient Methods Using Raw Clusters

Zeyuan Allen-Zhu, Yang Yuan, Karthik Sridharan

Neural Information Processing SystemsMar-23-2026, 08:18:05 GMT

The amount of data available in the world is growing faster than our ability to deal with it. However, if we take advantage of the internal structure, data may become much smaller for machine learning purposes. In this paper we focus on one of the fundamental machine learning tasks, empirical risk minimization (ERM), and provide faster algorithms with the help from the clustering structure of the data. We introduce a simple notion of raw clustering that can be efficiently computed from the data, and propose two algorithms based on clustering information. Our accelerated algorithm ClusterACDM is built on a novel Haar transformation applied to the dual space of the ERM problem, and our variance-reduction based algorithm ClusterSVRG introduces a new gradient estimator using clustering. Our algorithms outperform their classical counterparts ACDM and SVRG respectively.

artificial intelligence, machine learning, vector, (14 more...)

Neural Information Processing Systems

Country: Europe (0.28)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.54)

Add feedback

Random Projections with Asymmetric Quantization

Xiaoyun Li, Ping Li

Neural Information Processing SystemsFeb-13-2026, 08:20:28 GMT

Neural Information Processing Systems http://nips.cc/

estimator, projection, random projection, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Asia > Afghanistan > Parwan Province > Charikar (0.04)
North America > United States > Texas > Dallas County > Dallas (0.04)
(14 more...)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Re-randomized Densification for One Permutation Hashing and Bin-wise Consistent Weighted Sampling

Ping Li, Xiaoyun Li, Cun-Hui Zhang

Neural Information Processing SystemsFeb-13-2026, 06:39:58 GMT

Typically, hashing methods are essential for the use of Jaccard similarity tobepractical inlarge-scale settings.

artificial intelligence, data mining, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.05)
Oceania > Australia > New South Wales > Sydney (0.05)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.05)
(12 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.94)
Information Technology > Data Science > Data Mining (0.68)

Add feedback

Leveraging Spatial and Temporal Correlations in Sparsified Mean Estimation

Neural Information Processing SystemsDec-24-2025, 08:05:06 GMT

We study the problem of estimating at a central server the mean of a set of vectors distributed across several nodes (one vector per node). When the vectors are high-dimensional, the communication cost of sending entire vectors may be prohibitive, and it may be imperative for them to use sparsification techniques. While most existing work on sparsified mean estimation is agnostic to the characteristics of the data vectors, in many practical applications such as federated learning, there may be spatial correlations (similarities in the vectors sent by different nodes) or temporal correlations (similarities in the data sent by a single node over different iterations of the algorithm) in the data vectors. We leverage these correlations by simply modifying the decoding method used by the server to estimate the mean. We provide an analysis of the resulting estimation error as well as experiments for PCA, K-Means and Logistic Regression, which show that our estimators consistently outperform more sophisticated and expensive sparsification methods.

leveraging spatial and temporal correlation, sparsified mean estimation, vector, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.60)

Add feedback

Random Projections with Asymmetric Quantization

Xiaoyun Li, Ping Li

Neural Information Processing SystemsOct-3-2025, 09:27:55 GMT

The method of random projection has been a popular tool for data compression, similarity search, and machine learning. In many practical scenarios, applying quantization on randomly projected data could be very helpful to further reduce storage cost and facilitate more efficient retrievals, while only suffering from little loss in accuracy.

estimator, projection, random projection, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Asia > Afghanistan > Parwan Province > Charikar (0.04)
North America > United States > Texas > Dallas County > Dallas (0.04)
(14 more...)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Re-randomized Densification for One Permutation Hashing and Bin-wise Consistent Weighted Sampling

Ping Li, Xiaoyun Li, Cun-Hui Zhang

Neural Information Processing SystemsOct-3-2025, 08:13:22 GMT

Jaccard similarity to be practical in large-scale settings.

international conference, non-empty bin, proceedings, (13 more...)

Neural Information Processing Systems

Country:

Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > New Jersey > Middlesex County > Piscataway (0.04)
(12 more...)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Generating Synthetic Relational Tabular Data via Structural Causal Models

Hoppe, Frederik, Franz, Astrid, Kleinemeier, Lars, Göbel, Udo

arXiv.org Artificial IntelligenceJul-8-2025

Synthetic tabular data generation has received increasing attention in recent years, particularly with the emergence of foundation models for tabular data. The breakthrough success of TabPFN (Hollmann et al.,2025), which leverages vast quantities of synthetic tabular datasets derived from structural causal models (SCMs), demonstrates the critical role synthetic data plays in developing powerful tabular foundation models. However, most real-world tabular data exists in relational formats spanning multiple interconnected tables - a structure not adequately addressed by current generation methods. In this work, we extend the SCM-based approach by developing a novel framework that generates realistic synthetic relational tabular data including causal relationships across tables. Our experiments confirm that this framework is able to construct relational datasets with complex inter-table dependencies mimicking real-world scenarios.

artificial intelligence, dataset, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2507.03528

Country:

North America > Canada > Quebec > Montreal (0.04)
Europe > Germany > Bremen > Bremen (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.88)

Add feedback

Kernel Recursive Least Squares Dictionary Learning Algorithm

Alipoor, Ghasem, Skretting, Karl

arXiv.org Artificial IntelligenceJul-3-2025

Data factorization methods have met with considerable success in discovering latent features of the signals encountered in wide-ranging applications. In this way, the representation bases, which make up the columns of the basis matrix or dictionary, are learned from the available samples of the target environment. An example is the sparse representation (SR) in which the dictionary is intended to best represent the data with a small number of atoms, much smaller than the dimension of the signal space. It has been shown that, in addition to a more informative representation of signals, imposing sparsity constraints on the representation coefficients can improve the generalization performance and the computational efficiency [1, 2, 3]. Furthermore, the sparse representation is more robust to noise, redundancy, and missing data. These features are mainly attributed to the fact that the intrinsic dimension of natural signals is usually much smaller than their apparent dimension and hence SR in an appropriate dictionary can extract these intrinsic features more efficiently. SR has been a successful strategy and has received considerable attention and achieved state-of-the-art results in many applications, e.g.

algorithm, artificial intelligence, machine learning, (13 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.dsp.2023.104159

2507.01636

Country:

North America > United States (0.28)
Europe > Norway > Western Norway > Rogaland > Stavanger (0.04)
Asia > Middle East > Iran (0.04)
(3 more...)

Genre: Research Report (0.82)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Quasicyclic Principal Component Analysis

Rumsey, Susanna E., Draper, Stark C., Kschischang, Frank R.

arXiv.org Machine LearningFeb-7-2025

We present quasicyclic principal component analysis (QPCA), a generalization of principal component analysis (PCA), that determines an optimized basis for a dataset in terms of families of shift-orthogonal principal vectors. This is of particular interest when analyzing cyclostationary data, whose cyclic structure is not exploited by the standard PCA algorithm. We first formulate QPCA as an optimization problem, which we show may be decomposed into a series of PCA problems in the frequency domain. We then formalize our solution as an explicit algorithm and analyze its computational complexity. Finally, we provide some examples of applications of QPCA to cyclostationary signal processing data, including an investigation of carrier pulse recovery, a presentation of methods for estimating an unknown oversampling rate, and a discussion of an appropriate approach for pre-processing data with a non-integer oversampling rate in order to better apply the QPCA algorithm.

artificial intelligence, machine learning, vector, (15 more...)

arXiv.org Machine Learning

2502.05297

Country: North America > Canada > Ontario > Toronto (0.14)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.82)

Add feedback