AITopics | Ogallo, William

Collaborating Authors

Ogallo, William

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Domain-agnostic and Multi-level Evaluation of Generative Models

Tadesse, Girmaw Abebe, Born, Jannis, Cintas, Celia, Ogallo, William, Zubarev, Dmitry, Manica, Matteo, Weldemariam, Komminist

arXiv.org Artificial IntelligenceJan-20-2023

Machine Learning (ML) methods, particularly generative models, are effective in addressing critical problems across different domains, which includes material sciences. Examples include the design of novel molecules by combining data-driven techniques and domain knowledge to efficiently search the space of all plausible molecules and generate new and valid ones [1, 2, 3, 4]. Traditional high-throughput wet-lab experiments, physics-based simulations, and bioinformatics tools for the molecular design process heavily depend on human expertise. These processes require significant resource expenditure to propose, synthesize and test new molecules, thereby limiting the exploration space [5, 6, 7]. For example, generative models have been applied to facilitate the material discovery process by employing inverse molecular design problem. This approach transforms the conventional and slow discovery process by mapping the desired set of properties to a set of structures. The generative process is then optimized to encourage the generation of molecules with those selected properties. Countless approaches have been suggested for such tasks, most prominently VAEs with different sampling techniques [8, 9, 10]), GANs [11, 12], diffusion models [13], flow networks [14] and Transformers [15].

artificial intelligence, generative model, machine learning, (12 more...)

arXiv.org Artificial Intelligence

2301.0875

Genre:

Research Report > New Finding (0.68)
Research Report > Experimental Study (0.46)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Generation (0.87)

Add feedback

Sparsity-based Feature Selection for Anomalous Subgroup Discovery

Tadesse, Girmaw Abebe, Ogallo, William, Wanjiru, Catherine, Wachira, Charles, Mulang', Isaiah Onando, Anand, Vibha, Walcott-Bryant, Aisha, Speakman, Skyler

arXiv.org Artificial IntelligenceJan-6-2022

Anomalous pattern detection aims to identify instances where deviation from normalcy is evident, and is widely applicable across domains. Multiple anomalous detection techniques have been proposed in the state of the art. However, there is a common lack of a principled and scalable feature selection method for efficient discovery. Existing feature selection techniques are often conducted by optimizing the performance of prediction outcomes rather than its systemic deviations from the expected. In this paper, we proposed a sparsity-based automated feature selection (SAFS) framework, which encodes systemic outcome deviations via the sparsity of feature-driven odds ratios. SAFS is a model-agnostic approach with usability across different discovery techniques. SAFS achieves more than $3\times$ reduction in computation time while maintaining detection performance when validated on publicly available critical care dataset. SAFS also results in a superior performance when compared against multiple baselines for feature selection.

artificial intelligence, feature selection, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2201.02008

Genre: Research Report > Experimental Study (0.47)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Post-discovery Analysis of Anomalous Subsets

Mulang', Isaiah Onando, Ogallo, William, Tadesse, Girmaw Abebe, Walcott-Bryant, Aisha

arXiv.org Artificial IntelligenceNov-23-2021

Analyzing the behaviour of a population in response to disease and interventions is critical to unearth variability in healthcare as well as understand sub-populations that require specialized attention, but also to assist in designing future interventions. Two aspects become very essential in such analysis namely: i) Discovery of differentiating patterns exhibited by sub-populations, and ii) Characterization of the identified subpopulations. For the discovery phase, an array of approaches in the anomalous pattern detection literature have been employed to reveal differentiating patterns, especially to identify anomalous subgroups. However, these techniques are limited to describing the anomalous subgroups and offer little in form of insightful characterization, thereby limiting interpretability and understanding of these data-driven techniques in clinical practices. In this work, we propose an analysis of differentiated output (rather than discovery) and quantify anomalousness similarly to the counter-factual setting. To this end we design an approach to perform post-discovery analysis of anomalous subsets, in which we initially identify the most important features on the anomalousness of the subsets, then by perturbation, the approach seeks to identify the least number of changes necessary to lose anomalousness. Our approach is presented and the evaluation results on the 2019 MarketScan Commercial Claims and Medicare data, show that extra insights can be obtained by extrapolated examination of the identified subgroups.

artificial intelligence, bioinformatics, machine learning, (22 more...)

arXiv.org Artificial Intelligence

2111.14622

Country: North America > United States (0.69)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.94)

Industry:

Health & Medicine > Health Care Providers & Services (0.50)
Health & Medicine > Government Relations & Public Policy (0.36)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Biomedical Informatics (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

Automated Supervised Feature Selection for Differentiated Patterns of Care

Wanjiru, Catherine, Ogallo, William, Tadesse, Girmaw Abebe, Wachira, Charles, Mulang', Isaiah Onando, Walcott-Bryant, Aisha

arXiv.org Artificial IntelligenceNov-5-2021

An automated feature selection pipeline was developed using several state-of-the-art feature selection techniques to select optimal features for Differentiating Patterns of Care (DPOC). The pipeline included three types of feature selection techniques; Filters, Wrappers and Embedded methods to select the top K features. Five different datasets with binary dependent variables were used and their different top K optimal features selected. The selected features were tested in the existing multi-dimensional subset scanning (MDSS) where the most anomalous subpopulations, most anomalous subsets, propensity scores, and effect of measures were recorded to test their performance. This performance was compared with four similar metrics gained after using all covariates in the dataset in the MDSS pipeline. We found out that despite the different feature selection techniques used, the data distribution is key to note when determining the technique to use.

artificial intelligence, health & medicine, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2111.03495

Country: Oceania > Australia (0.14)

Genre:

Research Report > New Finding (0.95)
Research Report > Experimental Study (0.70)

Industry: Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.95)

Add feedback