Goto

Collaborating Authors

 wilderness area


InterpreTabNet: Distilling Predictive Signals from Tabular Data by Salient Feature Interpretation

Si, Jacob, Cheng, Wendy Yusi, Cooper, Michael, Krishnan, Rahul G.

arXiv.org Artificial Intelligence

Tabular data are omnipresent in various sectors of industries. Neural networks for tabular data such as TabNet have been proposed to make predictions while leveraging the attention mechanism for interpretability. However, the inferred attention masks are often dense, making it challenging to come up with rationales about the predictive signal. To remedy this, we propose InterpreTabNet, a variant of the TabNet model that models the attention mechanism as a latent variable sampled from a Gumbel-Softmax distribution. This enables us to regularize the model to learn distinct concepts in the attention masks via a KL Divergence regularizer. It prevents overlapping feature selection by promoting sparsity which maximizes the model's efficacy and improves interpretability to determine the important features when predicting the outcome. To assist in the interpretation of feature interdependencies from our model, we employ a large language model (GPT-4) and use prompt engineering to map from the learned feature mask onto natural language text describing the learned signal. Through comprehensive experiments on real-world datasets, we demonstrate that InterpreTabNet outperforms previous methods for interpreting tabular data while attaining competitive accuracy.


Explaining Multimodal Data Fusion: Occlusion Analysis for Wilderness Mapping

Ekim, Burak, Schmitt, Michael

arXiv.org Artificial Intelligence

Jointly harnessing complementary features of multi-modal input data in a common latent space has been found to be beneficial long ago. However, the influence of each modality on the models decision remains a puzzle. This study proposes a deep learning framework for the modality-level interpretation of multimodal earth observation data in an end-to-end fashion. While leveraging an explainable machine learning method, namely Occlusion Sensitivity, the proposed framework investigates the influence of modalities under an early-fusion scenario in which the modalities are fused before the learning process. We show that the task of wilderness mapping largely benefits from auxiliary data such as land cover and night time light data.


Web scraping and text analysis in R and GGplot2 – A.Z. Andis Arietta

#artificialintelligence

I recently needed to learn text mining for a project at work. I generally learn more quickly with a real-world project. So, I turned to a topic I love: Wilderness, to see how I could apply the skills of text scrubbing and natural language processing. You can clone my Git repo for the project or follow along in the post below. The first portion of this post will cover web scraping, then text mining, and finally analysis and visualization.


MapInWild: A Remote Sensing Dataset to Address the Question What Makes Nature Wild

Ekim, Burak, Stomberg, Timo T., Roscher, Ribana, Schmitt, Michael

arXiv.org Artificial Intelligence

I. INTRODUCTION The advancement in deep learning (DL) techniques has led to a notable increase in the number and size of annotated datasets in a variety of domains, with remote sensing (RS) being no exception [1]. Also, an increase in earth observation (EO) missions and easy access to globally available and free geodata have opened up new research opportunities. Although numerous RS datasets have been published in the past years [2]-[6], most of them addressed tasks concerning man-made environments such as building footprint extraction and road network classification, leaving the environmental and ecology-related sub-areas of remote sensing underrepresented. The ESA WorldCover map legend is given below the figure. In this community, the classification task can be machine learning model in the form of deep neural networks. While some methods frame the RS-related classification (usually called semantic segmentation by tasks within the context of perturbation-seeking generative the computer vision community) the task outputs denselyannotated adversarial networks [14], some others made use of uncertainty prediction maps on a pixel scale by separating the estimation applied to deep ensembles [15] and self-attention input into distinct and semantically coherent segments.