AITopics | direct comparison

Collaborating Authors

direct comparison

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Export Reviews, Discussions, Author Feedback and Meta-Reviews

Neural Information Processing SystemsOct-3-2025, 04:02:34 GMT

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. Mallows models are a classically studied class of distributions over permutations that can be viewed as a sequential model in which items are inserted one by one into a ranking. This paper proposes an interesting hierarchical generalization of Mallows models in which groups of items are sequentially ``merged'' together (as they would be in mergesort). The model can also be viewed as a special case of a recently proposed class of ``riffle independent'' models by Huang/Guestrin, but with a more tractable number of parameters in general and better computational properties. There are several nice contributions in this paper, including a simple and elegant characterization of identifiability of the structure, as well as an interesting structure estimation algorithm based on the inside-outside parsing algorithm for stochastic context free grammars.

algorithm, mallow model, permutation, (10 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Lebanon (0.06)
North America > Canada > Quebec > Montreal (0.05)

Technology: Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.89)

Add feedback

Uncertainty Estimation using Variance-Gated Distributions

Gillis, H. Martin, Xu, Isaac, Trappenberg, Thomas

arXiv.org Machine LearningSep-12-2025

Evaluation of per-sample uncertainty quantification from neural networks is essential for decision-making involving high-risk applications. A common approach is to use the predictive distribution from Bayesian or approximation models and decompose the corresponding predictive uncertainty into epistemic (model-related) and aleatoric (data-related) components. However, additive decomposition has recently been questioned. In this work, we propose an intuitive framework for uncertainty estimation and decomposition based on the signal-to-noise ratio of class probability distributions across different model predictions. We introduce a variance-gated measure that scales predictions by a confidence factor derived from ensembles. We use this measure to discuss the existence of a collapse in the diversity of committee machines.

dataset, prediction, variance-gated distribution, (12 more...)

arXiv.org Machine Learning

2509.08846

Country: North America > Canada > Nova Scotia > Halifax Regional Municipality > Halifax (0.05)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)

Add feedback

Reviews: Sequence Modeling with Unconstrained Generation Order

Neural Information Processing SystemsMay-31-2025, 07:24:09 GMT

Updated review: The authors have indicated that they will run additional experiments and make the clarifications I requested, so I will raise my score in 7 in agreement with the other reviews leading to an "accept" consensus. However, I do note that in their rebuttal the authors describe Gu et al., Stern et al., and Welleck et al. as "concurrent work". To be totally clear, all three of those papers were posted to arxiv in early February; the NeurIPS deadline was over 3 months later and it is now 6 months after they papers appeared online. I would argue that 3 (or 6) months is long enough to provide a more direct comparison and would not consider this submission "concurrent work". I don't think this warrants rejecting the paper, but I do want to note that I disagree with the authors here and still believe that a more direct comparison is appropriate.

direct comparison, sequence modeling, unconstrained generation order, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language (0.32)

Add feedback

Reviews: End-to-End Kernel Learning with Supervised Convolutional Kernel Networks

Neural Information Processing SystemsJan-20-2025, 23:06:12 GMT

This paper proposes an original idea and theoretically appealing solutions to solve it. Quality: Its first part (Section 1, 2) is excellent, but its latter part may be a little weak. Section 3 is a little dense with numerous details and heuristics. A pseudo-code showing the overall framework may be helpful for readers. Section 4 is a little short to validate the potential effectiveness of the proposed method. A direct comparison between CKN and CNN with same architecture may be more informative for readers.

end-to-end kernel learning, experiment, supervised convolutional kernel network, (10 more...)

Neural Information Processing Systems

Genre: Research Report (0.41)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.39)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.32)

Add feedback

A self-organizing multiple-view representation of 3D objects

Neural Information Processing SystemsApr-6-2023, 19:46:45 GMT

The form in which these models are best stored depends on the kind of information available in the input, and on the trade-off between the amount of memory allocated for the storage and the degree of sophistication required of the recognition process. In computer vision, a distinction can be made between representation schemes that use 3D object-centered coordinate systems and schemes that store viewpoint-specific information such as 2D views of objects.

cid, representation, self-organizing multiple-view representation, (10 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.69)

Add feedback

The Odious Comparisons Of GPU Inference Performance And Value

#artificialintelligenceFeb-5-2023, 05:40:09 GMT

While AI training dims the lights at hyperscalers and cloud builders and costs billions of dollars a year, in the long run, there will be a whole lot more aggregate processing done on AI inference than on AI training. It might be a factor of 2X to 3X compute capacity higher soon, and anywhere from 10X to 100X higher capacity within a decade. What we all do suspect, however, is that there will be relatively few heavy duty AI training devices and platforms that use them and myriad and numerous AI inference devices. And so the relative performance and price/performance of compute engines that run inference are going to be important as they are deployed at scale. Meta Platforms helped invent many of the machine learning techniques and technologies that are being deployed in production these days, and it is was no surprise to us that the company had created a unified inference framework, called AITemplate, which it open sourced and described earlier this month in an MetaAI engineering blog post.

artificial intelligence, inference, machine learning, (17 more...)

#artificialintelligence

Industry: Information Technology (0.45)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Toward A Formalized Approach for Spike Sorting Algorithms and Hardware Evaluation

Zhang, Tim, Lammie, Corey, Azghadi, Mostafa Rahimi, Amirsoleimani, Amirali, Ahmadi, Majid, Genov, Roman

arXiv.org Artificial IntelligenceMay-13-2022

Spike sorting algorithms are used to separate extracellular recordings of neuronal populations into single-unit spike activities. The development of customized hardware implementing spike sorting algorithms is burgeoning. However, there is a lack of a systematic approach and a set of standardized evaluation criteria to facilitate direct comparison of both software and hardware implementations. In this paper, we formalize a set of standardized criteria and a publicly available synthetic dataset entitled Synthetic Simulations Of Extracellular Recordings (SSOER), which was constructed by aggregating existing synthetic datasets with varying Signal-To-Noise Ratios (SNRs). Furthermore, we present a benchmark for future comparison, and use our criteria to evaluate a simulated Resistive Random-Access Memory (RRAM) In-Memory Computing (IMC) system using the Discrete Wavelet Transform (DWT) for feature extraction. Our system consumes approximately (per channel) 10.72mW and occupies an area of 0.66mm$^2$ in a 22nm FDSOI Complementary Metal-Oxide-Semiconductor (CMOS) process.

machine learning, programming language, spike, (19 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/MWSCAS54063.2022.9859357

2205.06514

Country:

North America > Canada > Ontario > Toronto (0.15)
North America > Canada > Quebec > Montreal (0.14)
Oceania > Australia > Queensland (0.04)
North America > Canada > Ontario > Essex County > Windsor (0.04)

Genre: Research Report (0.50)

Industry:

Semiconductors & Electronics (0.48)
Health & Medicine (0.48)

Technology:

Information Technology > Software > Programming Languages (0.84)
Information Technology > Data Science > Data Mining (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Data Science > Data Quality > Data Transformation (0.54)

Add feedback

An AGM Approach to Revising Preferences

Haret, Adrian, Wallner, Johannes P.

arXiv.org Artificial IntelligenceDec-28-2021

We look at preference change arising out of an interaction between two elements: the first is an initial preference ranking encoding a pre-existing attitude; the second element is new preference information signaling input from an authoritative source, which may come into conflict with the initial preference. The aim is to adjust the initial preference and bring it in line with the new preference, without having to give up more information than necessary. We model this process using the formal machinery of belief change, along the lines of the well-known AGM approach. We propose a set of fundamental rationality postulates, and derive the main results of the paper: a set of representation theorems showing that preference change according to these postulates can be rationalized as a choice function guided by a ranking on the comparisons in the initial preference order. We conclude by presenting operators satisfying our proposed postulates. Our approach thus allows us to situate preference revision within the larger family of belief change operators.

direct comparison, operator, revision, (15 more...)

arXiv.org Artificial Intelligence

2112.14243

Country:

Europe > Netherlands > North Holland > Amsterdam (0.04)
Europe > Austria > Styria > Graz (0.04)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Belief Revision (0.93)

Add feedback

MaxDropout: Deep Neural Network Regularization Based on Maximum Output Values

Santos, Claudio Filipi Goncalves do, Colombo, Danilo, Roder, Mateus, Papa, João Paulo

arXiv.org Machine LearningJul-27-2020

Different techniques have emerged in the deep learning scenario, such as Convolutional Neural Networks, Deep Belief Networks, and Long Short-Term Memory Networks, to cite a few. In lockstep, regularization methods, which aim to prevent overfitting by penalizing the weight connections, or turning off some units, have been widely studied either. In this paper, we present a novel approach called MaxDropout, a regularizer for deep neural network models that works in a supervised fashion by removing (shutting off) the prominent neurons (i.e., most active) in each hidden layer. The model forces fewer activated units to learn more representative information, thus providing sparsity. Regarding the experiments, we show that it is possible to improve existing neural networks and provide better results in neural networks when Dropout is replaced by MaxDropout. The proposed method was evaluated in image classification, achieving comparable results to existing regularizers, such as Cutout and RandomErasing, also improving the accuracy of neural networks that uses Dropout by replacing the existing layer by MaxDropout.

artificial intelligence, machine learning, maxdropout, (17 more...)

arXiv.org Machine Learning

2007.13723

Country:

North America > Canada > Ontario > Toronto (0.14)
South America > Brazil > São Paulo (0.04)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Simple coarse graining and sampling strategies for image recognition

Whitelam, Stephen

arXiv.org Machine LearningSep-7-2018

A conceptually simple way to recognize images is to directly compare test-set data and training-set data. The accuracy of this approach is limited by the method of comparison used, and by the extent to which the training-set data covers the required configuration space. Here we show that this coverage can be substantially increased using simple strategies of coarse graining (replacing groups of images by their centroids) and sampling (using distinct sets of centroids in combination). We use the MNIST data set to show that coarse graining can be used to convert a subset of training images into about an order of magnitude fewer image centroids, with no loss of accuracy of classification of test-set images by direct (nearest-neighbor) classification. Distinct batches of centroids can be used in combination as a means of sampling configuration space, and can classify test-set data more accurately than can the unaltered training set. The approach works most naturally with multiple processors in parallel.

artificial intelligence, machine learning, pattern recognition, (20 more...)

arXiv.org Machine Learning

1809.02599

Country:

North America > United States > Massachusetts (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Image Matching (0.41)

Add feedback