AITopics | mote

Collaborating Authors

mote

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

MOTE-NAS: Multi-Objective Training-based Estimate for Efficient Neural Architecture Search

Neural Information Processing SystemsMar-22-2026, 04:39:56 GMT

Neural Architecture Search (NAS) methods seek effective optimization toward performance metrics regarding model accuracy and generalization while facing challenges regarding search costs and GPU resources. Recent Neural Tangent Kernel (NTK) NAS methods achieve remarkable search efficiency based on a training-free model estimate; however, they overlook the non-convex nature of the DNNs in the search process. In this paper, we develop Multi-Objective Training-based Estimate (MOTE) for efficient NAS, retaining search effectiveness and achieving the new state-of-the-art in the accuracy and cost trade-off. To improve NTK and inspired by the Training Speed Estimation (TSE) method, MOTE is designed to model the actual performance of DNNs from macro to micro perspective by draw loss landscape and convergence speed simultaneously. Using two reduction strategies, the MOTE is generated based on a reduced architecture and a reduced dataset. Inspired by evolutionary search, our iterative ranking-based, coarse-to-fine architecture search is highly effective. Experiments on NASBench-201 show MOTE-NAS achieves 94.32% accuracy on CIFAR-10, 72.81% on CIFAR-100, and 46.38% on ImageNet-16-120, outperforming NTK-based NAS approaches. An evaluation-free (EF) version of MOTE-NAS delivers high efficiency in only 5 minutes, delivering a model more accurate than KNAS.

artificial intelligence, machine learning, proceedings, (5 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.79)
Information Technology > Artificial Intelligence > Cognitive Science (0.62)

Add feedback

MOTE-NAS: Multi-Objective Training-based Estimate for Efficient Neural Architecture Search Y u-Ming Zhang 1 Jun-Wei Hsieh

Neural Information Processing SystemsFeb-17-2026, 16:32:48 GMT

architecture, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

MoTE: Reconciling Generalization with Specialization for Visual-Language to Video Knowledge Transfer

Neural Information Processing SystemsFeb-15-2026, 11:29:24 GMT

Code is available at https://github.com/ZMHH-H/MoTE .

category, knowledge management, machine learning, (15 more...)

Neural Information Processing Systems

Country: Asia > China > Shanghai > Shanghai (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Leisure & Entertainment (0.67)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

MoTE: Reconciling Generalization with Specialization for Visual-Language to Video Knowledge Transfer

Neural Information Processing SystemsDec-26-2025, 05:46:32 GMT

Transferring visual-language knowledge from large-scale foundation models for video recognition has proved to be effective. To bridge the domain gap, additional parametric modules are added to capture the temporal information. However, zero-shot generalization diminishes with the increase in the number of specialized parameters, making existing works a trade-off between zero-shot and close-set performance. In this paper, we present MoTE, a novel framework that enables generalization and specialization to be balanced in one unified model. Our approach tunes a mixture of temporal experts to learn multiple task views with various degrees of data fitting. To maximally preserve the knowledge of each expert, we propose Weight Merging Regularization, which regularizes the merging process of experts in weight space.

knowledge management, large language model, reconciling generalization, (9 more...)

Neural Information Processing Systems

Technology:

Information Technology > Visual Languages (0.66)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.54)
Information Technology > Knowledge Management > Knowledge Engineering (0.43)

Add feedback

This brain implant is smaller than a grain of rice

Popular ScienceNov-7-2025, 17:56:14 GMT

The wireless neural transmitter safely delivers brain signals like a microchip. Breakthroughs, discoveries, and DIY tips sent every weekday. Today's neural implants are smaller than ever, but often remain cumbersome and prone to complications . According to researchers at Cornell University, a new iteration detailed this week in the journal may offer a novel path forward for brain implants. Small enough to fit on a grain of rice, the microscale optoelectronic tetherless electrode (or MOTE) is vastly smaller than similar implants and its design could be adapted to work in other delicate areas of the body.

andrew paul, brain implant, implant, (15 more...)

Popular Science

Genre: Research Report > New Finding (0.36)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Health Care Technology (1.00)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Concentration and excess risk bounds for imbalanced classification with synthetic oversampling

Ahmad, Touqeer, Kalan, Mohammadreza M., Portier, François, Stupfler, Gilles

arXiv.org Machine LearningOct-24-2025

Synthetic oversampling of minority examples using SMOTE and its variants is a leading strategy for addressing imbalanced classification problems. Despite the success of this approach in practice, its theoretical foundations remain underexplored. We develop a theoretical framework to analyze the behavior of SMOTE and related methods when classifiers are trained on synthetic data. We first derive a uniform concentration bound on the discrepancy between the empirical risk over synthetic minority samples and the population risk on the true minority distribution. We then provide a nonparametric excess risk guarantee for kernel-based classifiers trained using such synthetic data. These results lead to practical guidelines for better parameter tuning of both SMOTE and the downstream learning algorithm. Numerical experiments are provided to illustrate and support the theoretical findings

artificial intelligence, classifier, machine learning, (19 more...)

arXiv.org Machine Learning

2510.20472

Country:

North America > United States > California (0.05)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > Netherlands (0.04)
Europe > France > Brittany > Ille-et-Vilaine > Rennes (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

MOTE-NAS: Multi-Objective Training-based Estimate for Efficient Neural Architecture Search Y u-Ming Zhang 1 Jun-Wei Hsieh

Neural Information Processing SystemsOct-10-2025, 14:13:38 GMT

architecture, architecture search, mote, (16 more...)

Neural Information Processing Systems

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

641ce6c0e22483f34cd58625fcc7630e-Paper-Conference.pdf

Neural Information Processing SystemsOct-10-2025, 04:36:58 GMT

category, international conference, knowledge, (12 more...)

Neural Information Processing Systems

Country: Asia > China > Shanghai > Shanghai (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Leisure & Entertainment (0.67)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)

Add feedback

Beyond instruction-conditioning, MoTE: Mixture of Task Experts for Multi-task Embedding Models

Romero, Miguel, Ding, Shuoyang, Barret, Corey D., Dinu, Georgiana, Karypis, George

arXiv.org Artificial IntelligenceJun-24-2025

Dense embeddings are fundamental to modern machine learning systems, powering Retrieval-Augmented Generation (RAG), information retrieval, and representation learning. While instruction-conditioning has become the dominant approach for embedding specialization, its direct application to low-capacity models imposes fundamental representational constraints that limit the performance gains derived from specialization. In this paper, we analyze these limitations and introduce the Mixture of Task Experts (MoTE) transformer block, which leverages task-specialized parameters trained with Task-Aware Contrastive Learning (\tacl) to enhance the model ability to generate specialized embeddings. Empirical results show that MoTE achieves $64\%$ higher performance gains in retrieval datasets ($+3.27 \rightarrow +5.21$) and $43\%$ higher performance gains across all datasets ($+1.81 \rightarrow +2.60$). Critically, these gains are achieved without altering instructions, training data, inference time, or number of active parameters.

arxiv preprint arxiv, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2506.17781

Country: North America > United States (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.69)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.94)

Add feedback

MoTE: Mixture of Task-specific Experts for Pre-Trained ModelBased Class-incremental Learning

Li, Linjie, Wu, Zhenyu, Ji, Yang

arXiv.org Artificial IntelligenceJun-16-2025

Class-incremental learning (CIL) requires deep learning models to continuously acquire new knowledge from streaming data while preserving previously learned information. Recently, CIL based on pre-trained models (PTMs) has achieved remarkable success. However, prompt-based approaches suffer from prompt overwriting, while adapter-based methods face challenges such as dimensional misalignment between tasks. While the idea of expert fusion in Mixture of Experts (MoE) can help address dimensional inconsistency, both expert and routing parameters are prone to being overwritten in dynamic environments, making MoE challenging to apply directly in CIL. To tackle these issues, we propose a mixture of task-specific experts (MoTE) framework that effectively mitigates the miscalibration caused by inconsistent output dimensions across tasks. Inspired by the weighted feature fusion and sparse activation mechanisms in MoE, we introduce task-aware expert filtering and reliable expert joint inference during the inference phase, mimicking the behavior of routing layers without inducing catastrophic forgetting. Extensive experiments demonstrate the superiority of our method without requiring an exemplar set. Furthermore, the number of tasks in MoTE scales linearly with the number of adapters. Building on this, we further explore the trade-off between adapter expansion and model performance and propose the Adapter-Limited MoTE. The code is available at https://github.com/Franklilinjie/MoTE.

adapter, artificial intelligence, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2506.11038

Country: Asia (0.28)

Genre: Research Report > New Finding (0.46)

Industry: Education (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback