AITopics | prox

Collaborating Authors

prox

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

6a8018b3a00b69c008601b8becae392b-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-12-2026, 11:47:45 GMT

algorithm, spla, splitting, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.30)

Add feedback

d630537fc4402cfa3ebbc7450a0cac91-Paper-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 04:11:11 GMT

assumption, convergence, convergence result, (15 more...)

Neural Information Processing Systems

Country:

Asia > China > Guangdong Province > Shenzhen (0.05)
Asia > China > Hong Kong (0.04)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.32)

Add feedback

6ceb6c2150bbf46fd75528a6cd6be793-Paper-Conference.pdf

Neural Information Processing SystemsFeb-9-2026, 15:54:52 GMT

artificial intelligence, arxivpreprintarxiv, machine learning, (15 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Jordan (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.51)

Add feedback

Stochastic Optimization with Laggard Data Pipelines

Neural Information Processing SystemsOct-3-2025, 06:22:26 GMT

State-of-the-art optimization is steadily shifting towards massively parallel pipelines with extremely large batch sizes.

algorithm, arxiv preprint arxiv, gradient descent, (13 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
(4 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

We thank to Reviewers 1, 2 and 3 (who gave us marks 7, 8 and 6, respectively) for their pertinent remarks

Neural Information Processing SystemsOct-2-2025, 22:31:19 GMT

We agree that we could improve the experimental section by using a ground truth.

algorithm, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.30)

Add feedback

We restricted our attention to the 1D case for two reasons: 1) It is not trivial to

Neural Information Processing SystemsAug-22-2025, 00:22:10 GMT

Figure 4 depicts the value of the proximal loss, F(z), in Eq. (3).

algorithm, contribution, formulation, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.50)

Add feedback

RefineX: Learning to Refine Pre-training Data at Scale from Expert-Guided Programs

Bi, Baolong, Liu, Shenghua, Ren, Xingzhang, Liu, Dayiheng, Lin, Junyang, Wang, Yiwei, Mei, Lingrui, Fang, Junfeng, Guo, Jiafeng, Cheng, Xueqi

arXiv.org Artificial IntelligenceJul-10-2025

The foundational capabilities of large language models (LLMs) are deeply influenced by the quality of their pre-training corpora. However, enhancing data quality at scale remains a significant challenge, primarily due to the trade-off between refinement effectiveness and processing efficiency. While rule-based filtering remains the dominant paradigm, it typically operates at the document level and lacks the granularity needed to refine specific content within documents. Inspired by emerging work such as ProX, we propose $\textbf{RefineX}$, a novel framework for large-scale, surgical refinement of pre-training data through programmatic editing tasks. RefineX enables efficient and fine-grained data refinement while reliably preserving the diversity and naturalness of raw text. The core strength of RefineX lies in distilling high-quality, expert-guided end-to-end refinement results into minimal edit-based deletion programs. This high-precision distillation pipeline is used to train an efficient and reliable refine model that can systematically improve every instance in the corpus at scale. We evaluate RefineX across from-scratch pre-training at multiple model scales and find that it consistently outperforms models trained on raw, filtered, or alternatively refined data across diverse downstream tasks. On the 750M model, RefineX yields 2.6%-7.2% average gains on lighteval tasks, and achieves comparable performance using significantly fewer training tokens. Further analysis shows that RefineX reliably enhances text quality with both high efficiency and precision, outperforming prior approaches such as end-to-end generation and Prox-C. These results position RefineX as a scalable, effective, and reliable solution for optimizing pre-training data in modern LLM pipelines.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2507.03253

Country:

Asia (1.00)
Europe (0.67)
North America > United States > Florida (0.46)
(2 more...)

Genre: Research Report > New Finding (0.67)

Industry:

Education (1.00)
Leisure & Entertainment (0.67)
Energy > Renewable > Solar (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Formal Models of Active Learning from Contrastive Examples

Mansouri, Farnam, Simon, Hans U., Singla, Adish, Chen, Yuxin, Zilles, Sandra

arXiv.org Artificial IntelligenceJun-23-2025

Machine learning can greatly benefit from providing learning algorithms with pairs of contrastive training examples -- typically pairs of instances that differ only slightly, yet have different class labels. Intuitively, the difference in the instances helps explain the difference in the class labels. This paper proposes a theoretical framework in which the effect of various types of contrastive examples on active learners is studied formally. The focus is on the sample complexity of learning concept classes and how it is influenced by the choice of contrastive examples. We illustrate our results with geometric concept classes and classes of Boolean functions. Interestingly, we reveal a connection between learning from contrastive examples and the classical model of self-directed learning.

artificial intelligence, inductive learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2506.15893

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.86)

Add feedback

Non-Euclidean High-Order Smooth Convex Optimization

Contreras, Juan Pablo, Guzmán, Cristóbal, Martínez-Rubio, David

arXiv.org Machine LearningNov-13-2024

We develop algorithms for the optimization of convex objectives that have H\"older continuous $q$-th derivatives with respect to a $p$-norm by using a $q$-th order oracle, for $p, q \geq 1$. We can also optimize other structured functions. We do this by developing a non-Euclidean inexact accelerated proximal point method that makes use of an inexact uniformly convex regularizer. We also provide nearly matching lower bounds for any deterministic algorithm that interacts with the function via a local oracle.

algorithm, def, oracle, (14 more...)

arXiv.org Machine Learning

2411.08987

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
South America > Chile (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(3 more...)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.94)

Add feedback

ADMM for Structured Fractional Minimization

Yuan, Ganzhao

arXiv.org Artificial IntelligenceNov-11-2024

We consider a class of structured fractional minimization problems, where the numerator includes a differentiable function, a simple nonconvex nonsmooth function, a concave nonsmooth function, and a convex nonsmooth function composed with a linear operator, while the denominator is a continuous function that is either weakly convex or has a weakly convex square root. These problems are widespread and span numerous essential applications in machine learning and data science. Existing methods are mainly based on subgradient methods and smoothing proximal gradient methods, which may suffer from slow convergence and numerical stability issues. In this paper, we introduce {\sf FADMM}, the first Alternating Direction Method of Multipliers tailored for this class of problems. {\sf FADMM} decouples the original problem into linearized proximal subproblems, featuring two variants: one using Dinkelbach's parametric method ({\sf FADMM-D}) and the other using the quadratic transform method ({\sf FADMM-Q}). By introducing a novel Lyapunov function, we establish that {\sf FADMM} converges to $\epsilon$-approximate critical points of the problem within an oracle complexity of $\mathcal{O}(1/\epsilon^{3})$. Our experiments on synthetic and real-world data for sparse Fisher discriminant analysis, robust Sharpe ratio minimization, and robust sparse recovery demonstrate the effectiveness of our approach. Keywords: Fractional Minimization, Nonconvex Optimization, Proximal Linearized ADMM, Nonsmooth Optimization, Convergence Analysis

artificial intelligence, machine learning, optimization problem, (15 more...)

arXiv.org Artificial Intelligence

2411.07496

Country:

Asia > Middle East > Israel (0.04)
Asia > China (0.04)

Genre: Research Report (0.49)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback