AITopics | moses

Collaborating Authors

moses

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Graph Energy Matching: Transport-Aligned Energy-Based Modeling for Graph Generation

Balcerak, Michal, Shit, Suprosana, Prabhakar, Chinmay, Kaltenbach, Sebastian, Albergo, Michael S., Du, Yilun, Menze, Bjoern

arXiv.org Machine LearningMar-25-2026

Energy-based models for discrete domains, such as graphs, explicitly capture relative likelihoods, naturally enabling composable probabilistic inference tasks like conditional generation or enforcing constraints at test-time. However, discrete energy-based models typically struggle with efficient and high-quality sampling, as off-support regions often contain spurious local minima, trapping samplers and causing training instabilities. This has historically resulted in a fidelity gap relative to discrete diffusion models. We introduce Graph Energy Matching (GEM), a generative framework for graphs that closes this fidelity gap. Motivated by the transport map optimization perspective of the Jordan-Kinderlehrer-Otto (JKO) scheme, GEM learns a permutation-invariant potential energy that simultaneously provides transport-aligned guidance from noise toward data and refines samples within regions of high data likelihood. Further, we introduce a sampling protocol that leverages an energy-based switch to seamlessly bridge: (i) rapid, gradient-guided transport toward high-probability regions to (ii) a mixing regime for exploration of the learned graph distribution. On molecular graph benchmarks, GEM matches or exceeds strong discrete diffusion baselines. Beyond sample quality, explicit modeling of relative likelihood enables targeted exploration at inference time, facilitating compositional generation, property-constrained sampling, and geodesic interpolation between graphs.

artificial intelligence, machine learning, proposal, (19 more...)

arXiv.org Machine Learning

2603.23398

Country:

Asia > Middle East > Jordan (0.24)
North America > United States > Massachusetts (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Mixtures of SubExperts for Large Language Continual Learning

Kang, Haeyong

arXiv.org Artificial IntelligenceNov-11-2025

Adapting Large Language Models (LLMs) to a continuous stream of tasks is a critical yet challenging endeavor. While Parameter-Efficient Fine-Tuning (PEFT) methods have become a standard for this, they face a fundamental dilemma in continual learning. Reusing a single set of PEFT parameters for new tasks often leads to catastrophic forgetting of prior knowledge. Conversely, allocating distinct parameters for each task prevents forgetting but results in a linear growth of the model's size and fails to facilitate knowledge transfer between related tasks. To overcome these limitations, we propose a novel adaptive PEFT method referred to as \textit{Mixtures of SubExperts (MoSEs)}, a novel continual learning framework designed for minimal forgetting and efficient scalability. MoSEs integrate a sparse Mixture of SubExperts into the transformer layers, governed by a task-specific routing mechanism. This architecture allows the model to isolate and protect knowledge within dedicated SubExperts, thereby minimizing parameter interference and catastrophic forgetting. Crucially, the router can adaptively select and combine previously learned sparse parameters for new tasks, enabling effective knowledge transfer while ensuring that the model's capacity grows sublinearly. We evaluate MoSEs on the comprehensive TRACE benchmark datasets. Our experiments demonstrate that MoSEs significantly outperform conventional continual learning approaches in both knowledge retention and scalability to new tasks, achieving state-of-the-art performance with substantial memory and computational savings.

large language model, machine learning, natural language, (14 more...)

arXiv.org Artificial Intelligence

2511.06237

Country: North America > United States (0.28)

Genre: Research Report > New Finding (0.46)

Industry: Education > Educational Setting (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Multi-Task Vehicle Routing Solver via Mixture of Specialized Experts under State-Decomposable MDP

Pan, Yuxin, Cao, Zhiguang, Gu, Chengyang, Liu, Liu, Zhao, Peilin, Chen, Yize, Lin, Fangzhen

arXiv.org Artificial IntelligenceOct-27-2025

Existing neural methods for multi-task vehicle routing problems (VRPs) typically learn unified solvers to handle multiple constraints simultaneously. However, they often underutilize the compositional structure of VRP variants, each derivable from a common set of basis VRP variants. This critical oversight causes unified solvers to miss out the potential benefits of basis solvers, each specialized for a basis VRP variant. To overcome this limitation, we propose a framework that enables unified solvers to perceive the shared-component nature across VRP variants by proactively reusing basis solvers, while mitigating the exponential growth of trained neural solvers. Specifically, we introduce a State-Decomposable MDP (SDMDP) that reformulates VRPs by expressing the state space as the Cartesian product of basis state spaces associated with basis VRP variants. More crucially, this formulation inherently yields the optimal basis policy for each basis VRP variant. Furthermore, a Latent Space-based SDMDP extension is developed by incorporating both the optimal basis policies and a learnable mixture function to enable the policy reuse in the latent space. Under mild assumptions, this extension provably recovers the optimal unified policy of SDMDP through the mixture function that computes the state embedding as a mapping from the basis state embeddings generated by optimal basis policies. For practical implementation, we introduce the Mixture-of-Specialized-Experts Solver (MoSES), which realizes basis policies through specialized Low-Rank Adaptation (LoRA) experts, and implements the mixture function via an adaptive gating mechanism. Extensive experiments conducted across VRP variants showcase the superiority of MoSES over prior methods.

data mining, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2510.21453

Country:

Asia > China (0.45)
North America > Canada (0.45)
Asia > Middle East (0.27)

Genre: Research Report > Experimental Study (1.00)

Industry: Transportation > Freight & Logistics Services (0.61)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.45)

Add feedback

Discovery in Egypt offers new evidence for the Bible's story of Moses

Daily Mail - Science & techOct-1-2025, 18:13:54 GMT

Trump dollar coin design released by Treasury... and it's inspired by the most iconic political photo of the century Top plastic surgeons reveal secrets behind Taylor Swift's'changing' face: 'It is looking very full' Shroud of Turin mystery deepens as surgeon spots hidden detail that points to Jesus' resurrection Hollywood A-listers pay me $50,000 to cure their drug addicted nepo-babies because they can't afford for these secrets to go public I'm no longer sleeping with my husband - and never will again, says MOLLY RYDDELL. I love him, but counted down the moments until he climaxed. Then I couldn't bear it any more and the truth spilled out... so many women feel the same Lori Loughlin's husband Mossimo Giannulli seen with mystery brunette in tiny skirt day after shock split I'm a woman with autism... here are the signs you might be masking, even from yourself Diddy sentenced to 50 MONTHS in prison for prostitution offenses as he's branded a vile and unrepentant woman beater I've loved Taylor Swift for years. I was so happy after trying a trendy new cosmetic procedure. But 10 years later I suffered a devastating side effect... the doctor had lied The'middle-class kinks' saving marriages: Wives reveal the eight buzzy sex trends that revived their lagging libidos - including the fantasy husbands are secretly obsessed with Cake-faced 90s sitcom star looks unrecognizable as she ditches the heavy eyeshadow for an LA errand run can you guess who?

life, nicole kidman, taylor swift, (11 more...)

Daily Mail - Science & tech

Country:

Africa > Middle East > Egypt (0.51)
Europe > Italy > Piedmont > Turin Province > Turin (0.24)
North America > Canada > Alberta (0.14)
(19 more...)

Genre: Personal (0.46)

Industry:

Media > Music (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)
Health & Medicine > Consumer Health (1.00)
(2 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (0.68)

Add feedback

MoSEs: Uncertainty-Aware AI-Generated Text Detection via Mixture of Stylistics Experts with Conditional Thresholds

Wu, Junxi, Wang, Jinpeng, Liu, Zheng, Chen, Bin, Hu, Dongjian, Wu, Hao, Xia, Shu-Tao

arXiv.org Artificial IntelligenceSep-9-2025

The rapid advancement of large language models has intensified public concerns about the potential misuse. Therefore, it is important to build trustworthy AI-generated text detection systems. Existing methods neglect stylistic modeling and mostly rely on static thresholds, which greatly limits the detection performance. In this paper, we propose the Mixture of Stylistic Experts (MoSEs) framework that enables stylistics-aware uncertainty quantification through conditional threshold estimation. MoSEs contain three core components, namely, the Stylistics Reference Repository (SRR), the Stylistics-Aware Router (SAR), and the Conditional Threshold Estimator (CTE). For input text, SRR can activate the appropriate reference data in SRR and provide them to CTE. Subsequently, CTE jointly models the linguistic statistical properties and semantic features to dynamically determine the optimal threshold. With a discrimination score, MoSEs yields prediction labels with the corresponding confidence level. Our framework achieves an average improvement 11.34% in detection performance compared to baselines. More inspiringly, MoSEs shows a more evident improvement 39.15% in the low-resource case. Our code is available at https://github.com/creator-xi/MoSEs.

large language model, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

2509.02499

Country: Asia > China (0.28)

Genre: Research Report > New Finding (0.68)

Industry:

Media > Film (1.00)
Media > News (0.67)
Leisure & Entertainment > Sports > Soccer (0.46)
Health & Medicine > Therapeutic Area > Ophthalmology/Optometry (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Hidden 'fingerprints' found in the Bible after thousands of years rewrite the story of the Ark of the Covenant

Daily Mail - Science & techJun-6-2025, 17:21:20 GMT

Scientists have uncovered hidden patterns in the Bible that challenge ancient beliefs about its origins. Using artificial intelligence, they discovered'fingerprints' in text throughout the Old Testament, suggesting multiple people wrote the stories. The traditional Jewish and Christian understanding is that Moses wrote the first five books of the Old Testament, including stories about creation, Noah's flood and the Ark of the Covenant. The new study found three distinct writing styles with distinct vocabulary, tone and focus areas, suggesting multiple authors and sources contributed to the books over time. Researchers used AI analyzed for 50 chapters across five books, uncovering inconsistencies in language and content, repeated stories, shifts in tone and internal contradictions.

artificial intelligence, bible, old testament, (16 more...)

Daily Mail - Science & tech

Country: Asia > Middle East > Israel (0.19)

Genre: Research Report > New Finding (0.71)

Industry: Law (0.76)

Technology: Information Technology > Artificial Intelligence > Applied AI (0.56)

Add feedback

Sparse Regression for Machine Translation

Biçici, Ergun

arXiv.org Artificial IntelligenceJun-27-2024

We use transductive regression techniques to learn mappings between source and target features of given parallel corpora and use these mappings to generate machine translation outputs. We show the effectiveness of $L_1$ regularized regression (\textit{lasso}) to learn the mappings between sparsely observed feature sets versus $L_2$ regularized regression. Proper selection of training instances plays an important role to learn correct feature mappings within limited computational resources and at expected accuracy levels. We introduce \textit{dice} instance selection method for proper selection of training instances, which plays an important role to learn correct feature mappings for improving the source and target coverage of the training set. We show that $L_1$ regularized regression performs better than $L_2$ regularized regression both in regression measurements and in the translation experiments using graph decoding. We present encouraging results when translating from German to English and Spanish to English. We also demonstrate results when the phrase table of a phrase-based decoder is replaced with the mappings we find with the regression model.

moses, phrase table, regression, (15 more...)

arXiv.org Artificial Intelligence

2406.19478

Country:

North America > United States > Ohio > Franklin County > Columbus (0.04)
North America > United States > New York > Monroe County > Rochester (0.04)
North America > United States > Michigan (0.04)
(5 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.69)

Add feedback

Computer vision-based model for detecting turning lane features on Florida's public roadways

Antwi, Richard Boadu, Takyi, Samuel, Michael, Kimollo, Karaer, Alican, Ozguven, Eren Erman, Moses, Ren, Dulebenets, Maxim A., Sando, Thobias

arXiv.org Artificial IntelligenceJun-13-2024

Efficient and current roadway geometry data collection is a critical task for transportation agencies to undertake effective road planning, maintenance, design, and rehabilitation efforts. The methods for gathering such data can be broadly classified into two categories: a) land-based methods, which encompass field inventory, mobile mapping, and image logging, and b) aerial-based methods, which involve satellite imagery, drones, and laser scanning. However, employing land-based techniques for extensive highway networks covering thousands of miles proves arduous and costly, and poses safety risks for crew members. Consequently, there exists a pressing need to develop more efficient methodologies for acquiring this data promptly, safely, and economically. Fortunately, with the increasing availability of high-resolution images and recent strides in computer vision and object detection technologies, automated extraction of roadway geometry features has become feasible.

detection, dulebenet, ozguven, (17 more...)

arXiv.org Artificial Intelligence

2406.08822

Country:

North America > United States > Florida > Duval County > Jacksonville (0.14)
North America > United States > Florida > Leon County > Tallahassee (0.05)
North America > United States > Florida > Hillsborough County > University (0.04)
(3 more...)

Genre: Research Report > New Finding (0.93)

Industry:

Transportation > Infrastructure & Services (1.00)
Transportation > Ground > Road (1.00)
Information Technology (0.93)
(2 more...)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(3 more...)

Add feedback

Marginalization Consistent Mixture of Separable Flows for Probabilistic Irregular Time Series Forecasting

Yalavarthi, Vijaya Krishna, Scholz, Randolf, Madhusudhanan, Kiran, Born, Stefan, Schmidt-Thieme, Lars

arXiv.org Artificial IntelligenceJun-11-2024

Probabilistic forecasting models for joint distributions of targets in irregular time series are a heavily under-researched area in machine learning with, to the best of our knowledge, only three models researched so far: GPR, the Gaussian Process Regression model [16], TACTiS, the Transformer-Attentional Copulas for Time Series [14, 2] and ProFITi [43], a multivariate normalizing flow model based on invertible attention layers. While ProFITi, thanks to using multivariate normalizing flows, is the more expressive model with a better predictive performance, we will show that it suffers from marginalization inconsistency: it does not guarantee that the marginal distributions of a subset of variables in its predictive distributions coincide with the directly predicted distributions of these variables. Also, TACTiS does not provide any guarantees for marginalization consistency. We develop a novel probabilistic irregular time series forecasting model, Marginalization Consistent Mixtures of Separable Flows (moses), that mixes several normalizing flows with (i) Gaussian Processes with full covariance matrix as source distributions and (ii) a separable invertible transformation, aiming to combine the expressivity of normalizing flows with the marginalization consistency of Gaussians. In experiments on four different datasets we show that moses outperform other state-of-the-art marginalization consistent models, perform on par with ProFITi, but different from ProFITi, guarantees marginalization consistency.

dataset, international conference, moses, (13 more...)

arXiv.org Artificial Intelligence

2406.07246

Country:

Europe > Germany (0.05)
North America > United States > Tennessee > Anderson County > Oak Ridge (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > Belgium > Flanders > West Flanders > Bruges (0.04)

Genre: Research Report (0.82)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.69)
(2 more...)

Add feedback

Preference Optimization for Molecular Language Models

Park, Ryan, Theisen, Ryan, Sahni, Navriti, Patek, Marcel, Cichońska, Anna, Rahman, Rayees

arXiv.org Machine LearningOct-18-2023

Molecular language modeling is an effective approach to generating novel chemical structures. However, these models do not \emph{a priori} encode certain preferences a chemist may desire. We investigate the use of fine-tuning using Direct Preference Optimization to better align generated molecules with chemist preferences. Our findings suggest that this approach is simple, efficient, and highly effective.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Machine Learning

2310.12304

Country: North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre: Research Report > New Finding (0.86)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.72)

Add feedback