AITopics

Country:

North America > United States (0.04)
Asia > Afghanistan > Parwan Province > Charikar (0.04)

Industry: Information Technology > Security & Privacy (0.93)

Technology:

Information Technology > Security & Privacy (0.93)
Information Technology > Data Science > Data Mining (0.68)
Information Technology > Communications (0.68)
Information Technology > Artificial Intelligence > Machine Learning (0.46)

Neural Information Processing SystemsFeb-12-2026, 08:04:16 GMT

Private Federated Frequency Estimation: Adapting to the Hardness of the Instance

Work done during an internship at Google Research 37th Conference on Neural Information Processing Systems (NeurIPS 2023).

artificial intelligence, data mining, machine learning, (18 more...)

Country:

North America > United States (0.14)
Asia > Afghanistan > Parwan Province > Charikar (0.04)

Industry: Information Technology > Security & Privacy (0.93)

Technology:

Information Technology > Security & Privacy (0.93)
Information Technology > Data Science > Data Mining (0.68)
Information Technology > Artificial Intelligence > Machine Learning (0.66)

Neural Information Processing SystemsFeb-8-2026, 07:45:45 GMT

AdversariallyRobustDense-SparseTradeoffsvia Heavy-Hitters

In the adversarial streaming model, the input is a sequence of adaptive updates that defines an underlying dataset and the goal is to approximate, collect, or compute some statistic while using space sublinear in the size of the dataset.

algorithm, artificial intelligence, machine learning, (19 more...)

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)

Genre: Research Report (0.46)

Industry: Banking & Finance (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Neural Information Processing SystemsOct-9-2025, 19:09:12 GMT

14c00f4bc19a5498982b16647998e894-Paper-Conference.pdf

algorithm, frequency vector, vector, (17 more...)

Country:

North America > United States > Texas (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Asia > Afghanistan > Parwan Province > Charikar (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry: Information Technology > Services (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Communications (0.93)
Information Technology > Data Science (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Neural Information Processing SystemsOct-8-2025, 18:27:49 GMT

Private Federated Frequency Estimation: Adapting to the Hardness of the Instance Anonymous Author(s) Affiliation Address email

In what follows, we will focus on this regime and assume that d > n.

artificial intelligence, data mining, machine learning, (20 more...)

Country:

North America > United States (0.04)
Asia > Afghanistan > Parwan Province > Charikar (0.04)

Industry: Information Technology > Security & Privacy (0.93)

Technology:

Information Technology > Security & Privacy (0.93)
Information Technology > Data Science > Data Mining (0.68)
Information Technology > Communications (0.68)
Information Technology > Artificial Intelligence > Machine Learning (0.46)

Neural Information Processing SystemsOct-8-2025, 18:27:46 GMT

5bf40077b2bac53399676d33d564ef58-Paper-Conference.pdf

artificial intelligence, data mining, machine learning, (18 more...)

Country:

Asia > Afghanistan > Parwan Province > Charikar (0.04)
North America > United States > New York > New York County > New York City (0.04)

Industry: Information Technology > Security & Privacy (0.93)

Technology:

Information Technology > Security & Privacy (0.93)
Information Technology > Data Science > Data Mining (0.68)
Information Technology > Communications (0.68)
Information Technology > Artificial Intelligence > Machine Learning (0.46)

Pogăcean, Paul-Andrei, Avram, Sanda-Maria

Language Detection by Means of the Minkowski Norm: Identification Through Character Bigrams and Frequency Analysis

arXiv.org Artificial IntelligenceJul-24-2025

The debate surrounding language identification has gained renewed attention in recent years, especially with the rapid evolution of AI-powered language models. However, the non-AI-based approaches to language identification have been overshadowed. This research explores a mathematical implementation of an algorithm for language determinism by leveraging monograms and bigrams frequency rankings derived from established linguistic research. The datasets used comprise texts varying in length, historical period, and genre, including short stories, fairy tales, and poems. Despite these variations, the method achieves over 80\% accuracy on texts shorter than 150 characters and reaches 100\% accuracy for longer texts. These results demonstrate that classical frequency-based approaches remain effective and scalable alternatives to AI-driven models for language detection.

artificial intelligence, machine learning, natural language, (17 more...)

2507.16284

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (0.47)

Jonáš, Martin, Kučera, Antonín, Kůr, Vojtěch, Mačák, Jan

Steady-State Strategy Synthesis for Swarms of Autonomous Agents

arXiv.org Artificial IntelligenceMay-21-2025

Steady-state synthesis aims to construct a policy for a given MDP $D$ such that the long-run average frequencies of visits to the vertices of $D$ satisfy given numerical constraints. This problem is solvable in polynomial time, and memoryless policies are sufficient for approximating an arbitrary frequency vector achievable by a general (infinite-memory) policy. We study the steady-state synthesis problem for multiagent systems, where multiple autonomous agents jointly strive to achieve a suitable frequency vector. We show that the problem for multiple agents is computationally hard (PSPACE or NP hard, depending on the variant), and memoryless strategy profiles are insufficient for approximating achievable frequency vectors. Furthermore, we prove that even evaluating the frequency vector achieved by a given memoryless profile is computationally hard. This reveals a severe barrier to constructing an efficient synthesis algorithm, even for memoryless profiles. Nevertheless, we design an efficient and scalable synthesis algorithm for a subclass of full memoryless profiles, and we evaluate this algorithm on a large class of randomly generated instances. The experimental results demonstrate a significant improvement against a naive algorithm based on strategy sharing.

agent, artificial intelligence, vertex, (16 more...)

2505.12406

Country: Europe (0.28)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Wu, Jingfeng, Zhu, Wennan, Kairouz, Peter, Braverman, Vladimir

Private Federated Frequency Estimation: Adapting to the Hardness of the Instance

arXiv.org Artificial IntelligenceDec-2-2023

In federated frequency estimation (FFE), multiple clients work together to estimate the frequencies of their collective data by communicating with a server that respects the privacy constraints of Secure Summation (SecSum), a cryptographic multi-party computation protocol that ensures that the server can only access the sum of client-held vectors. For single-round FFE, it is known that count sketching is nearly information-theoretically optimal for achieving the fundamental accuracy-communication trade-offs [Chen et al., 2022]. However, we show that under the more practical multi-round FEE setting, simple adaptations of count sketching are strictly sub-optimal, and we propose a novel hybrid sketching algorithm that is provably more accurate. We also address the following fundamental question: how should a practitioner set the sketch size in a way that adapts to the hardness of the underlying problem? We propose a two-phase approach that allows for the use of a smaller sketch size for simpler problems (e.g., near-sparse or light-tailed distributions). We conclude our work by showing how differential privacy can be added to our algorithm and verifying its superior performance through extensive experiments conducted on large-scale datasets.

countsketch, frequency vector, vector, (16 more...)

2306.09396

Country:

Asia > Afghanistan > Parwan Province > Charikar (0.04)
North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report (0.64)

Industry: Information Technology > Security & Privacy (0.93)

Technology:

Information Technology > Security & Privacy (0.93)
Information Technology > Data Science > Data Mining (0.68)
Information Technology > Communications (0.68)
Information Technology > Artificial Intelligence > Machine Learning (0.46)

arXiv.org Artificial IntelligenceFeb-9-2021

COLOGNE: Coordinated Local Graph Neighborhood Sampling

Kutzkov, Konstantin

Representation learning for graphs enables the application of standard machine learning algorithms and data analysis tools to graph data. Replacing discrete unordered objects such as graph nodes by real-valued vectors is at the heart of many approaches to learning from graph data. Such vector representations, or embeddings, capture the discrete relationships in the original data by representing nodes as vectors in a high-dimensional space. In most applications graphs model the relationship between real-life objects and often nodes contain valuable meta-information about the original objects. While being a powerful machine learning tool, embeddings are not able to preserve such node attributes. We address this shortcoming and consider the problem of learning discrete node embeddings such that the coordinates of the node vector representations are graph nodes. This opens the door to designing interpretable machine learning algorithms for graphs as all attributes originally present in the nodes are preserved. We present a framework for coordinated local graph neighborhood sampling (COLOGNE) such that each node is represented by a fixed number of graph nodes, together with their attributes. Individual samples are coordinated and they preserve the similarity between node neighborhoods. We consider different notions of similarity for which we design scalable algorithms. We show theoretical results for all proposed algorithms. Experiments on benchmark graphs evaluate the quality of the designed embeddings and demonstrate how the proposed embeddings can be used in training interpretable machine learning algorithms for graph data.

algorithm, neighborhood, node, (15 more...)

2102.0477

Country: Asia > Afghanistan > Parwan Province > Charikar (0.04)

Genre: Research Report (0.64)

Industry:

Leisure & Entertainment (0.46)
Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)