AITopics

2312.0518

Country:

North America > United States > Pennsylvania (0.04)
Asia > Vietnam (0.04)
North America > United States > New York > New York County > New York City (0.04)
(4 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)

arXiv.org Artificial IntelligenceDec-12-2023

Faster Stochastic Variance Reduction Methods for Compositional MiniMax Optimization

Liu, Jin, Pan, Xiaokang, Duan, Junwen, Li, Hongdong, Li, Youqi, Qu, Zhe

This paper delves into the realm of stochastic optimization for compositional minimax optimization - a pivotal challenge across various machine learning domains, including deep AUC and reinforcement learning policy evaluation. Despite its significance, the problem of compositional minimax optimization is still under-explored. Adding to the complexity, current methods of compositional minimax optimization are plagued by sub-optimal complexities or heavy reliance on sizable batch sizes. To respond to these constraints, this paper introduces a novel method, called Nested STOchastic Recursive Momentum (NSTORM), which can achieve the optimal sample complexity of $O(\kappa^3 /\epsilon^3 )$ to obtain the $\epsilon$-accuracy solution. We also demonstrate that NSTORM can achieve the same sample complexity under the Polyak-\L ojasiewicz (PL)-condition - an insightful extension of its capabilities. Yet, NSTORM encounters an issue with its requirement for low learning rates, potentially constraining its real-world applicability in machine learning. To overcome this hurdle, we present ADAptive NSTORM (ADA-NSTORM) with adaptive learning rates. We demonstrate that ADA-NSTORM can achieve the same sample complexity but the experimental results show its more effectiveness. All the proposed complexities indicate that our proposed methods can match lower bounds to existing minimax optimizations, without requiring a large batch size in each iteration. Extensive experiments support the efficiency of our proposed methods.

faster stochastic variance reduction method, optimization, sample complexity, (9 more...)

2308.09604

Country:

Asia > Middle East > Jordan (0.04)
North America (0.04)
Europe (0.04)
(2 more...)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.45)

arXiv.org Machine LearningDec-12-2023

Multi-granularity Causal Structure Learning

Liang, Jiaxuan, Wang, Jun, Yu, Guoxian, Xia, Shuyin, Wang, Guoyin

However, these algorithms simply deem causal relationships stand exclusively at the level of individual variables Data science is moving from the data-centric paradigm forward (micro-variable), ignoring the collective interactions from the science-centric paradigm, and causal revolution multiple variables (macro-variable). For instance, the brain is sweeping across various research fields. Causality learning can be characterized at a micro granularity of neurons and endeavors to unearth causal relationships among variables their synapses, but high-order synergistic subsystems are from observational data and generate causal graph, widespread, which typically sit between canonical functional that is, directed acyclic graph (DAG). Unlike correlationbased networks and may serve an integrative role (Varley study, causality analysis reveals the causal mechanism et al. 2023). Actually, observational data can be regarded of data generation. Identifying causality holds paramount as knowledge in the lowest granularity level, while knowledge significance for stable inference and rational decisions can be regarded as the abstraction of data at different in many applications, such as recommendation systems granularity levels (Wang 2017; Wang et al. 2022). Similar (Wang et al. 2020), medical diagnostics (Richens, Lee, and viewpoints appear in the research of complex systems, Johri 2020), epidemiology (Vandenbroucke, Broadbent, and which suggests that causal relationship is more pronounced Pearce 2016) and many others (Von Kügelgen et al. 2022).

artificial intelligence, machine learning, optimization problem, (18 more...)

arXiv.org Machine Learning

2312.05549

Country:

Asia > China > Chongqing Province > Chongqing (0.05)
Asia > China > Shandong Province > Jinan (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine > Diagnostic Medicine (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.83)

Hahm, Jaehoon, Kim, Hayeon, Park, Young June

Improvement in Variational Quantum Algorithms by Measurement Simplification

arXiv.org Artificial IntelligenceDec-11-2023

After the discovery of Shor's algorithm and Grover's search algorithm, there has been many researches covering the concept of quantum advantage, which insists quantum computers will exhibit specific advantages over classical computers. Google named the advantage as "Quantum Supremacy"[1] and explains for specific problems, quantum computers can surpass classical computer in computation time and required memory capacity. However, complex quantum algorithms such as Shor's algorithm requires number of qubits and gate fidelity exponentially more than currently we have, and therefore investigating executable algorithms that show quantum advantage even with noisy and few qubits have been arised as an important question in the NISQ (Noisy Intermediate-Scale Quantum) era[2]. Among them, VQAs (Variational Quantum Algorithms)[3] have been remarked as efficient algorithms that can been executed in NISQ devices with low limitation. VQA is a hybrid quantum algorithm that utilizes classical optimizer and Variational Quantum Circuit (VQC), it first measures a state's probability after quantum circuit, and passes the result to classical optimizer.

algorithm, measurement simplification, quantum circuit, (13 more...)

2312.06176

Country: Asia > South Korea > Seoul > Seoul (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.34)

Audemard, Gilles, Lecoutre, Christophe, Lonca, Emmanuel

Proceedings of the 2023 XCSP3 Competition

This short paper gives an overview of the XCSP3 solver implemented in Picat. Picat provides several constraint modules, and the Picat XCSP3 solver uses the sat module. The XCSP3 solver mainly consists of a parser implemented in Picat, which converts constraints from XCSP3 format to Picat. The solver demonstrates the strengths of Picat, a logic-based language, in parsing, modeling, and encoding constraints into SAT. The solver submitted to the 2022 XCSP competition is based on the one that won the 2019 XCSP competition.

competition, constraint, vararray, (16 more...)

2312.05877

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > Austria > Vienna (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
(30 more...)

Genre:

Research Report (1.00)
Overview (0.74)

Industry: Leisure & Entertainment > Sports (0.46)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Software Engineering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
(4 more...)

Frank, Natalie S., Niles-Weed, Jonathan

Existence and Minimax Theorems for Adversarial Surrogate Risks in Binary Classification

Adversarial training is one of the most popular methods for training methods robust to adversarial attacks, however, it is not well-understood from a theoretical perspective. We prove and existence, regularity, and minimax theorems for adversarial surrogate risks. Our results explain some empirical observations on adversarial robustness from prior work and suggest new directions in algorithm development. Furthermore, our results extend previously known existence and minimax theorems for the adversarial classification risk to surrogate risks.

dp 1, existence and minimax theorem, minimizer, (12 more...)

2206.09098

Country:

North America > United States > New York > New York County > New York City (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.54)

Industry: Government (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.82)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Audemard, Gilles, Lecoutre, Christophe, Lonca, Emmanuel

Proceedings of the 2022 XCSP3 Competition

competition, constraint, vararray, (16 more...)

2209.00917

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > France > Provence-Alpes-Côte d'Azur > Bouches-du-Rhône > Marseille (0.04)
(28 more...)

Genre: Research Report (1.00)

Industry: Government > Military (0.46)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Information Management (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
(5 more...)

Sheikh, Jannik, Melnik, Andrew, Nandi, Gora Chand, Haschke, Robert

Language-Conditioned Semantic Search-Based Policy for Robotic Manipulation Tasks

Reinforcement learning and Imitation Learning approaches utilize policy learning strategies that are difficult to generalize well with just a few examples of a task. In this work, we propose a language-conditioned semantic search-based method to produce an online search-based policy from the available demonstration dataset of state-action trajectories. Here we directly acquire actions from the most similar manipulation trajectories found in the dataset. Our approach surpasses the performance of the baselines on the CALVIN benchmark and exhibits strong zero-shot adaptation capabilities. This holds great potential for expanding the use of our online search-based policy approach to tasks typically addressed by Imitation Learning or Reinforcement Learning-based policies.

instruction, latent representation, trajectory, (12 more...)

2312.05925

Country:

Europe > Germany (0.05)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
Europe > Italy (0.04)
Asia > India (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Patania, Alice, Allard, Antoine, Young, Jean-Gabriel

Exact and rapid linear clustering of networks with dynamic programming

arXiv.org Artificial IntelligenceDec-8-2023

We study the problem of clustering networks whose nodes have imputed or physical positions in a single dimension, for example prestige hierarchies or the similarity dimension of hyperbolic embeddings. Existing algorithms, such as the critical gap method and other greedy strategies, only offer approximate solutions to this problem. Here, we introduce a dynamic programming approach that returns provably optimal solutions in polynomial time -- O(n^2) steps -- for a broad class of clustering objectives. We demonstrate the algorithm through applications to synthetic and empirical networks and show that it outperforms existing heuristics by a significant margin, with a similar execution time.

algorithm, node, partition, (15 more...)

doi: 10.1098/rspa.2023.0159

2301.10403

Country:

North America > United States > Vermont > Chittenden County > Burlington (0.14)
North America > Canada > Quebec (0.04)
North America > United States > North Carolina (0.04)
(4 more...)

Genre: Research Report (0.82)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.46)

arXiv.org Machine LearningDec-8-2023

Construction of Hierarchical Neural Architecture Search Spaces based on Context-free Grammars

Schrodi, Simon, Stoll, Danny, Ru, Binxin, Sukthanker, Rhea, Brox, Thomas, Hutter, Frank

The discovery of neural architectures from simple building blocks is a long-standing goal of Neural Architecture Search (NAS). Hierarchical search spaces are a promising step towards this goal but lack a unifying search space design framework and typically only search over some limited aspect of architectures. In this work, we introduce a unifying search space design framework based on context-free grammars that can naturally and compactly generate expressive hierarchical search spaces that are 100s of orders of magnitude larger than common spaces from the literature. By enhancing and using their properties, we effectively enable search over the complete architecture and can foster regularity. Further, we propose an efficient hierarchical kernel design for a Bayesian Optimization search strategy to efficiently search over such huge spaces. We demonstrate the versatility of our search space design framework and show that our search strategy can be superior to existing NAS approaches. Code is available at https://github.com/automl/hierarchical_nas_construction.

machine learning, natural language, sequential3, (19 more...)

arXiv.org Machine Learning

2211.01842

Country:

North America > Canada > Ontario > Toronto (0.04)
Europe > Germany > Baden-Württemberg > Freiburg (0.04)
Oceania > Australia > New South Wales (0.04)
(4 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)