AITopics | Dodhia, Rahul

Plotting

Dodhia, Rahul

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Weak Labeling for Cropland Mapping in Africa

Hacheme, Gilles Quentin, Zaytar, Akram, Tadesse, Girmaw Abebe, Robinson, Caleb, Dodhia, Rahul, Ferres, Juan M. Lavista, Wood, Stephen

arXiv.org Artificial IntelligenceJan-13-2024

If the goal is to achieve better results in specific regions, models Cropland mapping can play a vital role in addressing environmental, that are tailored to those regions usually perform better than agricultural, and food security challenges. However, models that are designed for the whole world. in the context of Africa, practical applications are often hindered To this end, we develop a modeling workflow for generating by the limited availability of high-resolution cropland high-resolution cropland maps that are tailored toward a maps. Such maps typically require extensive human labeling, given area of interest (AOI), using Kenya as a use case. We use thereby creating a scalability bottleneck. To address this, we a deep learning based semantic segmentation workflow - an approach propose an approach that utilizes unsupervised object clustering often employed for land-cover maps [9, 10, 11, 12, 13]. to refine existing weak labels, such as those obtained In order to train the models, we used a mixture of sparse human from global cropland maps. The refined labels, in conjunction labels gathered in the AOI and weak labels from global with sparse human annotations, serve as training data for a cropland maps. Specifically we use the area of intersection semantic segmentation network designed to identify cropland between an unsupervised object based clustering of the input areas. We conduct experiments to demonstrate the benefits of satellite imagery and the weak labels to mine stronger cropland the improved weak labels generated by our method. In a scenario (positive class) and non-cropland (negative class) samples (see where we train our model with only 33 human-annotated Figure 1 for an overview of this approach).

artificial intelligence, machine learning, weak label, (16 more...)

arXiv.org Artificial Intelligence

2401.07014

Country: Africa > Kenya (0.25)

Genre: Research Report (0.64)

Industry:

Information Technology > Security & Privacy (0.88)
Food & Agriculture > Agriculture (0.69)
Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Seeing the roads through the trees: A benchmark for modeling spatial dependencies with aerial imagery

Robinson, Caleb, Corley, Isaac, Ortiz, Anthony, Dodhia, Rahul, Ferres, Juan M. Lavista, Najafirad, Peyman

arXiv.org Artificial IntelligenceJan-12-2024

Fully understanding a complex high-resolution satellite or aerial imagery scene often requires spatial reasoning over a broad relevant context. The human object recognition system is able to understand object in a scene over a long-range relevant context. For example, if a human observes an aerial scene that shows sections of road broken up by tree canopy, then they will be unlikely to conclude that the road has actually been broken up into disjoint pieces by trees and instead think that the canopy of nearby trees is occluding the road. However, there is limited research being conducted to understand long-range context understanding of modern machine learning models. In this work we propose a road segmentation benchmark dataset, Chesapeake Roads Spatial Context (RSC), for evaluating the spatial long-range context understanding of geospatial machine learning models and show how commonly used semantic segmentation models can fail at this task. For example, we show that a U-Net trained to segment roads from background in aerial imagery achieves an 84% recall on unoccluded roads, but just 63.5% recall on roads covered by tree canopy despite being trained to model both the same way. We further analyze how the performance of models changes as the relevant context for a decision (unoccluded roads in our case) varies in distance. We release the code to reproduce our experiments and dataset of imagery and masks to encourage future research in this direction -- https://github.com/isaaccorley/ChesapeakeRSC.

artificial intelligence, machine learning, tree canopy, (16 more...)

arXiv.org Artificial Intelligence

2401.06762

Country: North America > United States > Maryland (0.29)

Genre: Research Report (0.40)

Industry: Government > Regional Government > North America Government > United States Government (0.69)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (0.74)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Comprehensive Evaluation and Insights into the Use of Deep Neural Networks to Detect and Quantify Lymphoma Lesions in PET/CT Images

Ahamed, Shadab, Xu, Yixi, Gowdy, Claire, O, Joo H., Bloise, Ingrid, Wilson, Don, Martineau, Patrick, Bénard, François, Yousefirizi, Fereshteh, Dodhia, Rahul, Lavista, Juan M., Weeks, William B., Uribe, Carlos F., Rahmim, Arman

arXiv.org Artificial IntelligenceNov-16-2023

This study performs comprehensive evaluation of four neural network architectures (UNet, SegResNet, DynUNet, and SwinUNETR) for lymphoma lesion segmentation from PET/CT images. These networks were trained, validated, and tested on a diverse, multi-institutional dataset of 611 cases. Internal testing (88 cases; total metabolic tumor volume (TMTV) range [0.52, 2300] ml) showed SegResNet as the top performer with a median Dice similarity coefficient (DSC) of 0.76 and median false positive volume (FPV) of 4.55 ml; all networks had a median false negative volume (FNV) of 0 ml. On the unseen external test set (145 cases with TMTV range: [0.10, 2480] ml), SegResNet achieved the best median DSC of 0.68 and FPV of 21.46 ml, while UNet had the best FNV of 0.41 ml. We assessed reproducibility of six lesion measures, calculated their prediction errors, and examined DSC performance in relation to these lesion measures, offering insights into segmentation accuracy and clinical relevance. Additionally, we introduced three lesion detection criteria, addressing the clinical need for identifying lesions, counting them, and segmenting based on metabolic characteristics. We also performed expert intra-observer variability analysis revealing the challenges in segmenting ``easy'' vs. ``hard'' cases, to assist in the development of more resilient segmentation algorithms. Finally, we performed inter-observer agreement assessment underscoring the importance of a standardized ground truth segmentation protocol involving multiple expert annotators. Code is available at: https://github.com/microsoft/lymphoma-segmentation-dnn

artificial intelligence, lesion measure, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2311.09614

Country:

North America > United States (0.28)
North America > Canada > British Columbia (0.14)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Oncology > Lymphoma (1.00)
Health & Medicine > Nuclear Medicine (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Multimodal Foundation Models for Zero-shot Animal Species Recognition in Camera Trap Images

Fabian, Zalan, Miao, Zhongqi, Li, Chunyuan, Zhang, Yuanhan, Liu, Ziwei, Hernández, Andrés, Montes-Rojas, Andrés, Escucha, Rafael, Siabatto, Laura, Link, Andrés, Arbeláez, Pablo, Dodhia, Rahul, Ferres, Juan Lavista

arXiv.org Artificial IntelligenceNov-2-2023

Due to deteriorating environmental conditions and increasing human activity, conservation efforts directed towards wildlife is crucial. Motion-activated camera traps constitute an efficient tool for tracking and monitoring wildlife populations across the globe. Supervised learning techniques have been successfully deployed to analyze such imagery, however training such techniques requires annotations from experts. Reducing the reliance on costly labelled data therefore has immense potential in developing large-scale wildlife tracking solutions with markedly less human labor. In this work we propose WildMatch, a novel zero-shot species classification framework that leverages multimodal foundation models. In particular, we instruction tune vision-language models to generate detailed visual descriptions of camera trap images using similar terminology to experts. Then, we match the generated caption to an external knowledge base of descriptions in order to determine the species in a zero-shot manner. We investigate techniques to build instruction tuning datasets for detailed animal description generation and propose a novel knowledge augmentation technique to enhance caption quality. We demonstrate the performance of WildMatch on a new camera trap dataset collected in the Magdalena Medio region of Colombia.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2311.01064

Country: North America > United States > California (0.28)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Assessment of Differentially Private Synthetic Data for Utility and Fairness in End-to-End Machine Learning Pipelines for Tabular Data

Pereira, Mayana, Kshirsagar, Meghana, Mukherjee, Sumit, Dodhia, Rahul, Ferres, Juan Lavista, de Sousa, Rafael

arXiv.org Artificial IntelligenceOct-29-2023

Differentially private (DP) synthetic data sets are a solution for sharing data while preserving the privacy of individual data providers. Understanding the effects of utilizing DP synthetic data in end-to-end machine learning pipelines impacts areas such as health care and humanitarian action, where data is scarce and regulated by restrictive privacy laws. In this work, we investigate the extent to which synthetic data can replace real, tabular data in machine learning pipelines and identify the most effective synthetic data generation techniques for training and evaluating machine learning models. We systematically investigate the impacts of differentially private synthetic data on downstream classification tasks from the point of view of utility as well as fairness. Our analysis is comprehensive and includes representatives of the two main types of synthetic data generation algorithms: marginal-based and GAN-based. To the best of our knowledge, our work is the first that: (i) proposes a training and evaluation framework that does not assume that real data is available for testing the utility and fairness of machine learning models trained on synthetic data; (ii) presents the most extensive analysis of synthetic data set generation algorithms in terms of utility and fairness when used for training machine learning models; and (iii) encompasses several different definitions of fairness. Our findings demonstrate that marginal-based synthetic data generators surpass GAN-based ones regarding model training utility for tabular data. Indeed, we show that models trained using data generated by marginal-based algorithms can exhibit similar utility to models trained using real data. Our analysis also reveals that the marginal-based synthetic data generator MWEM PGM can train models that simultaneously achieve utility and fairness characteristics close to those obtained by models trained with real data.

artificial intelligence, machine learning, synthetic data, (18 more...)

arXiv.org Artificial Intelligence

2310.1925

Country: North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Research Report > New Finding (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine (0.88)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.95)

Add feedback

Rapid building damage assessment workflow: An implementation for the 2023 Rolling Fork, Mississippi tornado event

Robinson, Caleb, Nsutezo, Simone Fobi, Ortiz, Anthony, Sederholm, Tina, Dodhia, Rahul, Birge, Cameron, Richards, Kasie, Pitcher, Kris, Duarte, Paulo, Ferres, Juan M. Lavista

arXiv.org Artificial IntelligenceAug-24-2023

Rapid and accurate building damage assessments from high-resolution satellite imagery following a natural disaster is essential to inform and optimize first responder efforts. However, performing such building damage assessments in an automated manner is non-trivial due to the challenges posed by variations in disaster-specific damage, diversity in satellite imagery, and the dearth of extensive, labeled datasets. To circumvent these issues, this paper introduces a human-in-the-loop workflow for rapidly training building damage assessment models after a natural disaster. This article details a case study using this workflow, executed in partnership with the American Red Cross during a tornado event in Rolling Fork, Mississippi in March, 2023. The output from our human-in-the-loop modeling process achieved a precision of 0.86 and recall of 0.80 for damaged buildings when compared to ground truth data collected post-disaster. This workflow was implemented end-to-end in under 2 hours per satellite imagery scene, highlighting its potential for real-time deployment.

artificial intelligence, damage assessment, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2306.12589

Country: North America > United States > Mississippi (0.72)

Genre: Workflow (1.00)

Industry:

Health & Medicine (0.89)
Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.79)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.31)

Add feedback

Poverty rate prediction using multi-modal survey and earth observation data

Fobi, Simone, Cardona, Manuel, Collins, Elliott, Robinson, Caleb, Ortiz, Anthony, Sederholm, Tina, Dodhia, Rahul, Ferres, Juan Lavista

arXiv.org Artificial IntelligenceJul-21-2023

This work presents an approach for combining household demographic and living standards survey questions with features derived from satellite imagery to predict the poverty rate of a region. Our approach utilizes visual features obtained from a single-step featurization method applied to freely available 10m/px Sentinel-2 surface reflectance satellite imagery. These visual features are combined with ten survey questions in a proxy means test (PMT) to estimate whether a household is below the poverty line. We show that the inclusion of visual features reduces the mean error in poverty rate estimates from 4.09% to 3.88% over a nationally representative out-of-sample test set. In addition to including satellite imagery features in proxy means tests, we propose an approach for selecting a subset of survey questions that are complementary to the visual features extracted from satellite imagery. Specifically, we design a survey variable selection approach guided by the full survey and image features and use the approach to determine the most relevant set of small survey questions to include in a PMT. We validate the choice of small survey questions in a downstream task of predicting the poverty rate using the small set of questions. This approach results in the best performance -- errors in poverty rate decrease from 4.09% to 3.71%. We show that extracted visual features encode geographic and urbanization differences between regions.

artificial intelligence, machine learning, poverty rate, (17 more...)

arXiv.org Artificial Intelligence

2307.11921

Country:

North America > United States (0.48)
Africa > Ethiopia (0.30)
North America > Mexico > Mexico City (0.14)

Genre: Questionnaire & Opinion Survey (1.00)

Industry: Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.97)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision > Image Understanding (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Revisiting pre-trained remote sensing model benchmarks: resizing and normalization matters

Corley, Isaac, Robinson, Caleb, Dodhia, Rahul, Ferres, Juan M. Lavista, Najafirad, Peyman

arXiv.org Artificial IntelligenceMay-22-2023

Research in self-supervised learning (SSL) with natural images has progressed rapidly in recent years and is now increasingly being applied to and benchmarked with datasets containing remotely sensed imagery. A common benchmark case is to evaluate SSL pre-trained model embeddings on datasets of remotely sensed imagery with small patch sizes, e.g., 32x32 pixels, whereas standard SSL pre-training takes place with larger patch sizes, e.g., 224x224. Furthermore, pre-training methods tend to use different image normalization preprocessing steps depending on the dataset. In this paper, we show, across seven satellite and aerial imagery datasets of varying resolution, that by simply following the preprocessing steps used in pre-training (precisely, image sizing and normalization methods), one can achieve significant performance improvements when evaluating the extracted features on downstream tasks -- an important detail overlooked in previous work in this space. We show that by following these steps, ImageNet pre-training remains a competitive baseline for satellite imagery based transfer learning tasks -- for example we find that these steps give +32.28 to overall accuracy on the So2Sat random split dataset and +11.16 on the EuroSAT dataset. Finally, we report comprehensive benchmark results with a variety of simple baseline methods for each of the seven datasets, forming an initial benchmark suite for remote sensing imagery.

artificial intelligence, dataset, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2305.13456

Country: North America > United States > Texas (0.14)

Genre: Research Report > New Finding (0.46)

Industry: Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.95)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Dwelling Type Classification for Disaster Risk Assessment Using Satellite Imagery

Nasir, Md, Sederholm, Tina, Sharma, Anshu, Mallu, Sundeep Reddy, Ghatage, Sumedh Ranjan, Dodhia, Rahul, Ferres, Juan Lavista

arXiv.org Artificial IntelligenceNov-15-2022

Vulnerability and risk assessment of neighborhoods is essential for effective disaster preparedness. Existing traditional systems, due to dependency on time-consuming and cost-intensive field surveying, do not provide a scalable way to decipher warnings and assess the precise extent of the risk at a hyper-local level. In this work, machine learning was used to automate the process of identifying dwellings and their type to build a potentially more effective disaster vulnerability assessment system. First, satellite imageries of low-income settlements and vulnerable areas in India were used to identify 7 different dwelling types. Specifically, we formulated the dwelling type classification as a semantic segmentation task and trained a U-net based neural network model, namely TernausNet, with the data we collected. Then a risk score assessment model was employed, using the determined dwelling type along with an inundation model of the regions. The entire pipeline was deployed to multiple locations prior to natural hazards in India in 2020. Post hoc ground-truth data from those regions was collected to validate the efficacy of this model which showed promising performance. This work can aid disaster response organizations and communities at risk by providing household-level risk information that can inform preemptive actions.

artificial intelligence, machine learning, satellite imagery, (12 more...)

arXiv.org Artificial Intelligence

2211.11636

Country: Asia > India (0.89)

Genre: Research Report (0.40)

Industry:

Information Technology > Security & Privacy (1.00)
Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.75)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

Defending Democracy: Using Deep Learning to Identify and Prevent Misinformation

Trivedi, Anusua, Suhm, Alyssa, Mahankal, Prathamesh, Mukuntharaj, Subhiksha, Parab, Meghana D., Mohan, Malvika, Berger, Meredith, Sethumadhavan, Arathi, Jaiman, Ashish, Dodhia, Rahul

arXiv.org Artificial IntelligenceJun-3-2021

The rise in online misinformation in recent years threatens democracies by distorting authentic public discourse and causing confusion, fear, and even, in extreme cases, violence. There is a need to understand the spread of false content through online networks for developing interventions that disrupt misinformation before it achieves virality. Using a Deep Bidirectional Transformer for Language Understanding (BERT) and propagation graphs, this study classifies and visualizes the spread of misinformation on a social media network using publicly available Twitter data. The results confirm prior research around user clusters and the virality of false content while improving the precision of deep learning models for misinformation detection. The study further demonstrates the suitability of BERT for providing a scalable model for false information detection, which can contribute to the development of more timely and accurate interventions to slow the spread of misinformation in online environments.

deep learning, misinformation, neural network, (23 more...)

arXiv.org Artificial Intelligence

2106.02607

Country: North America > United States (1.00)

Genre: Research Report > New Finding (0.48)

Industry:

Media > News (1.00)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback