AITopics | ect

Collaborating Authors

ect

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

AdaptiveImportanceSamplingforFinite-Sum OptimizationandSamplingwithDecreasing Step-Sizes

Neural Information Processing SystemsFeb-9-2026, 22:31:19 GMT

In this work, we build on this framework and proposeAvare, a simple and efficient algorithm for adaptive importance sampling for finite-sum optimization and sampling with decreasing step-sizes.

algorithm, artificial intelligence, machine learning, (14 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.05)
Europe > Sweden > Stockholm > Stockholm (0.05)
Oceania > Australia > New South Wales > Sydney (0.04)
(7 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.70)

Add feedback

Molecular Machine Learning Using Euler Characteristic Transforms

Toscano-Duran, Victor, Rottach, Florian, Rieck, Bastian

arXiv.org Artificial IntelligenceJul-8-2025

The shape of a molecule determines its physicochemical and biological properties. However, it is often underrepresented in standard molecular representation learning approaches. Here, we propose using the Euler Characteristic Transform (ECT) as a geometrical-topological descriptor. Computed directly on a molecular graph derived from handcrafted atomic features, the ECT enables the extraction of multiscale structural features, offering a novel way to represent and encode molecular shape in the feature space. We assess the predictive performance of this representation across nine benchmark regression datasets, all centered around predicting the inhibition constant $K_i$. In addition, we compare our proposed ECT-based representation against traditional molecular representations and methods, such as molecular fingerprints/descriptors and graph neural networks (GNNs). Our results show that our ECT-based representation achieves competitive performance, ranking among the best-performing methods on several datasets. More importantly, its combination with traditional representations, particularly with the AVALON fingerprint, significantly \emph{enhances predictive performance}, outperforming other methods on most datasets. These findings highlight the complementary value of multiscale topological information and its potential for being combined with established techniques. Our study suggests that hybrid approaches incorporating explicit shape information can lead to more informative and robust molecular representations, enhancing and opening new avenues in molecular machine learning tasks. To support reproducibility and foster open biomedical research, we provide open access to all experiments and code used in this work.

artificial intelligence, machine learning, representation, (18 more...)

arXiv.org Artificial Intelligence

2507.03474

Country:

Europe > Switzerland > Fribourg > Fribourg (0.04)
Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.04)
North America > United States (0.04)
(2 more...)

Genre: Research Report > New Finding (0.54)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Identifying Heterogeneity in Distributed Learning

Xiao, Zelin, Gu, Jia, Chen, Song Xi

arXiv.org Machine LearningJun-26-2025

We study methods for identifying heterogeneous parameter components in distributed M-estimation with minimal data transmission. One is based on a re-normalized Wald test, which is shown to be consistent as long as the number of distributed data blocks $K$ is of a smaller order of the minimum block sample size and the level of heterogeneity is dense. The second one is an extreme contrast test (ECT) based on the difference between the largest and smallest component-wise estimated parameters among data blocks. By introducing a sample splitting procedure, the ECT can avoid the bias accumulation arising from the M-estimation procedures, and exhibits consistency for $K$ being much larger than the sample size while the heterogeneity is sparse. The ECT procedure is easy to operate and communication-efficient. A combination of the Wald and the extreme contrast tests is formulated to attain more robust power under varying levels of sparsity of the heterogeneity. We also conduct intensive numerical experiments to compare the family-wise error rate (FWER) and the power of the proposed methods. Additionally, we conduct a case study to present the implementation and validity of the proposed methods.

artificial intelligence, heterogeneity, machine learning, (15 more...)

arXiv.org Machine Learning

2506.16394

Country:

Asia > Middle East > Jordan (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre:

Research Report > Experimental Study (0.46)
Research Report > New Finding (0.46)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Simulation-Based Sensitivity Analysis in Optimal Treatment Regimes and Causal Decomposition with Individualized Interventions

Park, Soojin, Kang, Suyeon, Lee, Chioun

arXiv.org Machine LearningJun-25-2025

Causal decomposition analysis aims to assess the effect of modifying risk factors on reducing social disparities in outcomes. Recently, this analysis has incorporated individual characteristics when modifying risk factors by utilizing optimal treatment regimes (OTRs). Since the newly defined individualized effects rely on the no omitted confounding assumption, developing sensitivity analyses to account for potential omitted confounding is essential. Moreover, OTRs and individualized effects are primarily based on binary risk factors, and no formal approach currently exists to benchmark the strength of omitted confounding using observed covariates for binary risk factors. To address this gap, we extend a simulation-based sensitivity analysis that simulates unmeasured confounders, addressing two sources of bias emerging from deriving OTRs and estimating individualized effects. Additionally, we propose a formal bounding strategy that benchmarks the strength of omitted confounding for binary risk factors. Using the High School Longitudinal Study 2009 (HSLS:09), we demonstrate this sensitivity analysis and benchmarking method.

artificial intelligence, machine learning, student, (18 more...)

arXiv.org Machine Learning

2506.1901

Country:

North America > United States > California > Riverside County > Riverside (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > District of Columbia > Washington (0.04)
(2 more...)

Genre:

Research Report > Experimental Study (0.67)
Research Report > New Finding (0.46)

Industry:

Health & Medicine (1.00)
Education > Educational Setting > K-12 Education > Secondary School (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.67)
Information Technology > Data Science (0.67)

Add feedback

Causal Decomposition Analysis with Synergistic Interventions: A Triply-Robust Machine Learning Approach to Addressing Multiple Dimensions of Social Disparities

Park, Soojin, Kim, Su Yeon, Zheng, Xinyao, Lee, Chioun

arXiv.org Machine LearningJun-25-2025

Educational disparities are rooted in and perpetuate social inequalities across multiple dimensions such as race, socioeconomic status, and geography. To reduce disparities, most intervention strategies focus on a single domain and frequently evaluate their effectiveness by using causal decomposition analysis. However, a growing body of research suggests that single-domain interventions may be insufficient for individuals marginalized on multiple fronts. While interventions across multiple domains are increasingly proposed, there is limited guidance on appropriate methods for evaluating their effectiveness. To address this gap, we develop an extended causal decomposition analysis that simultaneously targets multiple causally ordered intervening factors, allowing for the assessment of their synergistic effects. These scenarios often involve challenges related to model misspecification due to complex interactions among group categories, intervening factors, and their confounders with the outcome. To mitigate these challenges, we introduce a triply robust estimator that leverages machine learning techniques to address potential model misspecification. We apply our method to a cohort of students from the High School Longitudinal Study, focusing on math achievement disparities between Black, Hispanic, and White high schoolers. Specifically, we examine how two sequential interventions - equalizing the proportion of students who attend high-performing schools and equalizing enrollment in Algebra I by 9th grade across racial groups - may reduce these disparities.

artificial intelligence, disparity, machine learning, (17 more...)

arXiv.org Machine Learning

2506.18994

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
North America > United States > California > Riverside County > Riverside (0.04)
North America > Mexico > Oaxaca (0.04)
(2 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine (1.00)
Education > Educational Setting > K-12 Education > Secondary School (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)

Add feedback

Text-to-Image Generation for Vocabulary Learning Using the Keyword Method

Attygalle, Nuwan T., Kljun, Matjaž, Quigley, Aaron, Pucihar, Klen čOpič, Grubert, Jens, Biener, Verena, Leiva, Luis A., Yoneyama, Juri, Toniolo, Alice, Miguel, Angela, Kato, Hirokazu, Weerasinghe, Maheshya

arXiv.org Artificial IntelligenceJan-28-2025

The 'keyword method' is an effective technique for learning vocabulary of a foreign language. It involves creating a memorable visual link between what a word means and what its pronunciation in a foreign language sounds like in the learner's native language. However, these memorable visual links remain implicit in the people's mind and are not easy to remember for a large set of words. To enhance the memorisation and recall of the vocabulary, we developed an application that combines the keyword method with text-to-image generators to externalise the memorable visual links into visuals. These visuals represent additional stimuli during the memorisation process. To explore the effectiveness of this approach we first run a pilot study to investigate how difficult it is to externalise the descriptions of mental visualisations of memorable links, by asking participants to write them down. We used these descriptions as prompts for text-to-image generator (DALL-E2) to convert them into images and asked participants to select their favourites. Next, we compared different text-to-image generators (DALL-E2, Midjourney, Stable and Latent Diffusion) to evaluate the perceived quality of the generated images by each. Despite heterogeneous results, participants mostly preferred images generated by DALL-E2, which was used also for the final study. In this study, we investigated whether providing such images enhances the retention of vocabulary being learned, compared to the keyword method only. Our results indicate that people did not encounter difficulties describing their visualisations of memorable links and that providing corresponding images significantly improves memory retention.

artificial intelligence, machine learning, participant, (14 more...)

arXiv.org Artificial Intelligence

2501.17099

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
(24 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Questionnaire & Opinion Survey (1.00)

Industry:

Education (1.00)
Leisure & Entertainment (0.93)
Health & Medicine > Consumer Health (0.87)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.71)

Add feedback

Topology meets Machine Learning: An Introduction using the Euler Characteristic Transform

Rieck, Bastian

arXiv.org Artificial IntelligenceOct-23-2024

Machine learning is shaping up to be the transformative technology of our times: Many of us have played with (and marveled at) models like ChatGPT, new breakthroughs in applications like healthcare research are announced on an almost daily basis, and new avenues for integrating these tools into scientific research are opening up, with some mathematicians already using large language models as proof assistants. This article aims to lift the veil and dispel some myths about machine learning; along the way, it will also show how machine learning itself can benefit from mathematical concepts. Indeed, from the outside, machine learning might look like a homogeneous entity, but in fact, the field is fractured and highly diverse. While the main thrust of the field arises from the undeniable engineering advances, with bigger and better models, there is also a strong community of applied mathematicians. Next to the classical drivers of machine-learning architectures, i.e., linear algebra and statistics, topology recently started to provide novel insights into the foundations of machine learning: Point-set topology, harnessing concepts like neighborhoods, can be used to extend existing algorithms from graphs to cell complexes [4]. Algebraic topology, making use of effective invariants like homology, improves the results of models for volume reconstruction [13]. Finally, differential topology, providing tools to study smooth properties of data, results in efficient methods for analyzing embedded (simplicial) complexes [6]. These (and many more) methods have now found a home in the nascent field of topological deep learning [8]. Before diving into concrete examples, let us first take a step back and discuss machine learning as such.

artificial intelligence, deep learning, machine learning, (13 more...)

arXiv.org Artificial Intelligence

2410.1776

Country: Europe > Switzerland > Fribourg > Fribourg (0.04)

Genre: Research Report (0.64)

Industry: Health & Medicine (0.88)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.90)

Add feedback

Generative Topology for Shape Synthesis

Röell, Ernst, Rieck, Bastian

arXiv.org Artificial IntelligenceOct-9-2024

The Euler Characteristic Transform (ECT) is a powerful invariant for assessing geometrical and topological characteristics of a large variety of objects, including graphs and embedded simplicial complexes. Although the ECT is invertible in theory, no explicit algorithm for general data sets exists. In this paper, we address this lack and demonstrate that it is possible to learn the inversion, permitting us to develop a novel framework for shape generation tasks on point clouds. Our model exhibits high quality in reconstruction and generation tasks, affords efficient latent-space interpolation, and is orders of magnitude faster than existing methods. Understanding shapes requires understanding their geometrical and topological properties in tandem. Given the large variety of different representations of such data, ranging from point clouds over graphs to simplicial complexes, a general framework for handling such inputs is beneficial. The Euler Characteristic Transform (ECT) provides such a framework based on the idea of studying a shape from multiple directions--sampled from a sphere of appropriate dimensionality--and at multiple scales. In fact, the ECT is an injective map, serving as a unique characterisation of a shape (Ghrist et al., 2018; Turner et al., 2014). Somewhat surprisingly, this even holds when using a finite number of directions (Curry et al., 2022). Hence, while it is known that the ECT can be inverted, i.e. it is possible to reconstruct input data from an ECT, only algorithms for special cases such as planar graphs are currently known (Fasy et al., 2018).

artificial intelligence, machine learning, point cloud, (15 more...)

arXiv.org Artificial Intelligence

2410.18987

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
North America > United States (0.04)
Europe > Switzerland > Fribourg > Fribourg (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Sensing and Signal Processing > Image Processing (0.67)

Add feedback

Diss-l-ECT: Dissecting Graph Data with local Euler Characteristic Transforms

von Rohrscheidt, Julius, Rieck, Bastian

arXiv.org Artificial IntelligenceOct-3-2024

The Euler Characteristic Transform (ECT) is an efficiently-computable geometrical-topological invariant that characterizes the global shape of data. In this paper, we introduce the Local Euler Characteristic Transform ($\ell$-ECT), a novel extension of the ECT particularly designed to enhance expressivity and interpretability in graph representation learning. Unlike traditional Graph Neural Networks (GNNs), which may lose critical local details through aggregation, the $\ell$-ECT provides a lossless representation of local neighborhoods. This approach addresses key limitations in GNNs by preserving nuanced local structures while maintaining global interpretability. Moreover, we construct a rotation-invariant metric based on $\ell$-ECTs for spatial alignment of data spaces. Our method exhibits superior performance than standard GNNs on a variety of node classification tasks, particularly in graphs with high heterophily.

dataset, ect, graph, (14 more...)

arXiv.org Artificial Intelligence

2410.02622

Country:

North America > United States > Wisconsin (0.04)
North America > United States > Texas (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
Europe > Switzerland > Fribourg > Fribourg (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Consistency Models Made Easy

Geng, Zhengyang, Pokle, Ashwini, Luo, William, Lin, Justin, Kolter, J. Zico

arXiv.org Artificial IntelligenceJun-20-2024

Consistency models (CMs) are an emerging class of generative models that offer faster sampling than traditional diffusion models. CMs enforce that all points along a sampling trajectory are mapped to the same initial point. But this target leads to resource-intensive training: for example, as of 2024, training a SoTA CM on CIFAR-10 takes one week on 8 GPUs. In this work, we propose an alternative scheme for training CMs, vastly improving the efficiency of building such models. Specifically, by expressing CM trajectories via a particular differential equation, we argue that diffusion models can be viewed as a special case of CMs with a specific discretization. We can thus fine-tune a consistency model starting from a pre-trained diffusion model and progressively approximate the full consistency condition to stronger degrees over the training process. Our resulting method, which we term Easy Consistency Tuning (ECT), achieves vastly improved training times while indeed improving upon the quality of previous methods: for example, ECT achieves a 2-step FID of 2.73 on CIFAR10 within 1 hour on a single A100 GPU, matching Consistency Distillation trained of hundreds of GPU hours. Owing to this computational efficiency, we investigate the scaling law of CMs under ECT, showing that they seem to obey classic power law scaling, hinting at their ability to improve efficiency and performance at larger scales. Code (https://github.com/locuslab/ect) is available.

arxiv preprint arxiv, diffusion model, imagenet 64 64, (13 more...)

arXiv.org Artificial Intelligence

2406.14548

Country: North America > Canada > Ontario > Toronto (0.04)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback