AITopics | piecewise linear

Collaborating Authors

piecewise linear

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

OnEmbeddingsforNumericalFeatures inTabularDeepLearning

Neural Information Processing SystemsFeb-11-2026, 01:31:00 GMT

Unlike traditional models, e.g., MLP,these architectures mapscalar valuesofnumerical features tohigh-dimensional embeddings before mixing them inthemain backbone.

artificial intelligence, machine learning, numerical feature, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

c4c28b367e14df88993ad475dedf6b77-Paper.pdf

Neural Information Processing SystemsFeb-10-2026, 05:44:02 GMT

algorithm, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
North America > Canada (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > New Finding (0.68)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)

Add feedback

c4c28b367e14df88993ad475dedf6b77-Paper.pdf

Neural Information Processing SystemsAug-16-2025, 07:48:15 GMT

algorithm, ova, spwlin, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.14)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > New Finding (0.68)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)

Add feedback

Injective Sliced-Wasserstein embedding for weighted sets and point clouds

Amir, Tal, Dym, Nadav

arXiv.org Artificial IntelligenceMay-26-2024

We present the $\textit{Sliced Wasserstein Embedding}$ $\unicode{x2014}$ a novel method to embed multisets and distributions over $\mathbb{R}^d$ into Euclidean space. Our embedding is injective and approximately preserves the Sliced Wasserstein distance. Moreover, when restricted to multisets, it is bi-Lipschitz. We also prove that it is $\textit{impossible}$ to embed distributions over $\mathbb{R}^d$ into a Euclidean space in a bi-Lipschitz manner, even under the assumption that their support is bounded and finite. We demonstrate empirically that our embedding offers practical advantage in learning tasks over existing methods for handling multisets.

architecture, multiset, wasserstein distance, (16 more...)

arXiv.org Artificial Intelligence

2405.16519

Country:

Asia > Middle East > Israel > Haifa District > Haifa (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)

Genre: Research Report > Promising Solution (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Characterization of the Distortion-Perception Tradeoff for Finite Channels with Arbitrary Metrics

Freirich, Dror, Weinberger, Nir, Meir, Ron

arXiv.org Machine LearningFeb-3-2024

Whenever inspected by humans, reconstructed signals should not be distinguished from real ones. Typically, such a high perceptual quality comes at the price of high reconstruction error, and vice versa. We study this distortion-perception (DP) tradeoff over finite-alphabet channels, for the Wasserstein-$1$ distance induced by a general metric as the perception index, and an arbitrary distortion matrix. Under this setting, we show that computing the DP function and the optimal reconstructions is equivalent to solving a set of linear programming problems. We provide a structural characterization of the DP tradeoff, where the DP function is piecewise linear in the perception index. We further derive a closed-form expression for the case of binary sources.

constraint, dp function, tradeoff, (17 more...)

arXiv.org Machine Learning

2402.02265

Country:

Europe > Switzerland > Zürich > Zürich (0.04)
Asia > Middle East > Israel (0.04)
North America > United States > Massachusetts > Middlesex County > Belmont (0.04)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.49)

Add feedback

On Embeddings for Numerical Features in Tabular Deep Learning

Gorishniy, Yury, Rubachev, Ivan, Babenko, Artem

arXiv.org Artificial IntelligenceOct-26-2023

Recently, Transformer-like deep architectures have shown strong performance on tabular data problems. Unlike traditional models, e.g., MLP, these architectures map scalar values of numerical features to high-dimensional embeddings before mixing them in the main backbone. In this work, we argue that embeddings for numerical features are an underexplored degree of freedom in tabular DL, which allows constructing more powerful DL models and competing with GBDT on some traditionally GBDT-friendly benchmarks. We start by describing two conceptually different approaches to building embedding modules: the first one is based on a piecewise linear encoding of scalar values, and the second one utilizes periodic activations. Then, we empirically demonstrate that these two approaches can lead to significant performance boosts compared to the embeddings based on conventional blocks such as linear layers and ReLU activations. Importantly, we also show that embedding numerical features is beneficial for many backbones, not only for Transformers. Specifically, after proper embeddings, simple MLP-like models can perform on par with the attention-based architectures. Overall, we highlight embeddings for numerical features as an important design aspect with good potential for further improvements in tabular DL.

dataset, hyperparameter, numerical feature, (17 more...)

arXiv.org Artificial Intelligence

2203.05556

Country: North America > United States > California (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Optimal Scoring Rule Design under Partial Knowledge

Chen, Yiling, Yu, Fang-Yi

arXiv.org Artificial IntelligenceOct-26-2023

This paper studies the design of optimal proper scoring rules when the principal has partial knowledge of an agent's signal distribution. Recent work characterizes the proper scoring rules that maximize the increase of an agent's payoff when the agent chooses to access a costly signal to refine a posterior belief from her prior prediction, under the assumption that the agent's signal distribution is fully known to the principal. In our setting, the principal only knows about a set of distributions where the agent's signal distribution belongs. We formulate the scoring rule design problem as a max-min optimization that maximizes the worst-case increase in payoff across the set of distributions. We propose an efficient algorithm to compute an optimal scoring rule when the set of distributions is finite, and devise a fully polynomial-time approximation scheme that accommodates various infinite sets of distributions. We further remark that widely used scoring rules, such as the quadratic and log rules, as well as previously identified optimal scoring rules under full knowledge, can be far from optimal in our partial knowledge settings.

convex function, information gain, information structure, (16 more...)

arXiv.org Artificial Intelligence

2107.0742

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Africa > South Sudan > Equatoria > Central Equatoria > Juba (0.04)

Genre: Research Report (1.00)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.93)

Add feedback

Solution Path Algorithm for Twin Multi-class Support Vector Machine

Chen, Liuyuan, Zhou, Kanglei, Jing, Junchang, Fan, Haiju, Li, Juntao

arXiv.org Machine LearningMay-30-2020

The twin support vector machine and its extensions have made great achievements in dealing with binary classification problems, however, which is faced with some difficulties such as model selection and solving multi-classification problems quickly. This paper is devoted to the fast regularization parameter tuning algorithm for the twin multi-class support vector machine. A new sample dataset division method is adopted and the Lagrangian multipliers are proved to be piecewise linear with respect to the regularization parameters by combining the linear equations and block matrix theory. Eight kinds of events are defined to seek for the starting event and then the solution path algorithm is designed, which greatly reduces the computational cost. In addition, only few points are combined to complete the initialization and Lagrangian multipliers are proved to be 1 as the regularization parameter tends to infinity. Simulation results based on UCI datasets show that the proposed method can achieve good classification performance with reducing the computational cost of grid search method from exponential level to the constant level.

artificial intelligence, machine learning, support vector machine, (16 more...)

arXiv.org Machine Learning

2006.00276

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
South America > Uruguay > Maldonado > Maldonado (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
Asia > China > Henan Province (0.04)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)

Add feedback

Challenge of the week: Piecewise linear clustering versus SVM

@machinelearnbotMay-20-2016, 06:00:24 GMT

In this challenge, we ask you to invent a new technique for clustering, based on separating hyperplanes. SVM (support vector machines) add many fictitious (dummy) variables and a non-linear mapping (to increase dimensionality and find hyperplanes on transformed variables), thus providing nearly or exact class separation (the purpose of clustering!) when traditional linear clustering fails.

artificial intelligence, machine learning, piecewise linear, (2 more...)

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.89)

Add feedback

Filters

Collaborating Authors

piecewise linear

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

OnEmbeddingsforNumericalFeatures inTabularDeepLearning

c4c28b367e14df88993ad475dedf6b77-Paper.pdf

9e9f0ffc3d836836ca96cbf8fe14b105-Paper-Conference.pdf

c4c28b367e14df88993ad475dedf6b77-Paper.pdf

Injective Sliced-Wasserstein embedding for weighted sets and point clouds

Characterization of the Distortion-Perception Tradeoff for Finite Channels with Arbitrary Metrics

On Embeddings for Numerical Features in Tabular Deep Learning

Optimal Scoring Rule Design under Partial Knowledge

Solution Path Algorithm for Twin Multi-class Support Vector Machine

Challenge of the week: Piecewise linear clustering versus SVM