Goto

Collaborating Authors

 Information Fusion


The Extended Dawid-Skene Model: Fusing Information from Multiple Data Schemas

arXiv.org Machine Learning

While label fusion from multiple noisy annotations is a well understood concept in data wrangling (tackled for example by the Dawid-Skene (DS) model), we consider the extended problem of carrying out learning when the labels themselves are not consistently annotated with the same schema. We show that even if annotators use disparate, albeit related, label-sets, we can still draw inferences for the underlying full label-set. We propose the Inter-Schema AdapteR (ISAR) to translate the fully-specified label-set to the one used by each annotator, enabling learning under such heterogeneous schemas, without the need to re-annotate the data. We apply our method to a mouse behavioural dataset, achieving significant gains (compared with DS) in out-of-sample log-likelihood (-3.40 to -2.39) and F1-score (0.785 to 0.864).


Reasoning about disclosure in data integration in the presence of source constraints

arXiv.org Artificial Intelligence

Data integration systems allow users to access data sitting in multiple sources by means of queries over a global schema, related to the sources via mappings. Data sources often contain sensitive information, and thus an analysis is needed to verify that a schema satisfies a privacy policy, given as a set of queries whose answers should not be accessible to users. Such an analysis should take into account not only knowledge that an attacker may have about the mappings, but also what they may know about the semantics of the sources. In this paper, we show that source constraints can have a dramatic impact on disclosure analysis. We study the problem of determining whether a given data integration system discloses a source query to an attacker in the presence of constraints, providing both lower and upper bounds on source-aware disclosure analysis.


Predicting Crop Losses using Machine Learning CGIAR Platform for Big Data in Agriculture

#artificialintelligence

Timely and accurate agricultural impact assessments for droughts are critical for designing appropriate interventions and policy. These assessments are often ad hoc, late, or spatially imprecise, with reporting at the zonal or regional level. This is problematic as we find substantial variability in losses at the village-level, which is missing when reporting at the zonal level. In this paper, we propose a new data fusion method--combining remotely sensed data with agricultural survey data--that might address these limitations. We apply the method to Ethiopia, which is regularly hit by droughts and is a substantial recipient of ad hoc imported food aid.


RGB and LiDAR fusion based 3D Semantic Segmentation for Autonomous Driving

arXiv.org Artificial Intelligence

LiDAR has become a standard sensor for autonomous driving applications as they provide highly precise 3D point clouds. LiDAR is also robust for low-light scenarios at night-time or due to shadows where the performance of cameras is degraded. LiDAR perception is gradually becoming mature for algorithms including object detection and SLAM. However, semantic segmentation algorithm remains to be relatively less explored. Motivated by the fact that semantic segmentation is a mature algorithm on image data, we explore sensor fusion based 3D segmentation. To the best of our knowledge, this is the first attempt at RGB and LiDAR based 3D segmentation for autonomous driving. Our main contribution is to convert the RGB image to a polar-grid mapping representation used for LiDAR and design early and mid-level fusion architectures. Additionally, we design a hybrid fusion architecture that combines both fusion algorithms. We evaluate our algorithm on KITTI dataset which provides segmentation annotation for cars, pedestrians and cyclists. We evaluate two state-of-the-art architectures namely SqueezeSeg and PointSeg and improve the mIoU score by 10 % in both cases relative to the LiDAR only baseline.


A survey of advances in vision-based vehicle re-identification

arXiv.org Artificial Intelligence

Vehicle re-identification (V-reID) has become significantly popular in the community due to its applications and research significance. In particular, the V-reID is an important problem that still faces numerous open challenges. This paper reviews different V-reID methods including sensor based methods, hybrid methods, and vision based methods which are further categorized into hand-crafted feature based methods and deep feature based methods. The vision based methods make the V-reID problem particularly interesting, and our review systematically addresses and evaluates these methods for the first time. We conduct experiments on four comprehensive benchmark datasets and compare the performances of recent hand-crafted feature based methods and deep feature based methods. We present the detail analysis of these methods in terms of mean average precision (mAP) and cumulative matching curve (CMC). These analyses provide objective insight into the strengths and weaknesses of these methods. We also provide the details of different V-reID datasets and critically discuss the challenges and future trends of V-reID methods.


Fusion of Heterogeneous Earth Observation Data for the Classification of Local Climate Zones

arXiv.org Machine Learning

This paper proposes a novel framework for fusing multi-temporal, multispectral satellite images and OpenStreetMap (OSM) data for the classification of local climate zones (LCZs). Feature stacking is the most commonly-used method of data fusion but does not consider the heterogeneity of multimodal optical images and OSM data, which becomes its main drawback. The proposed framework processes two data sources separately and then combines them at the model level through two fusion models (the landuse fusion model and building fusion model), which aim to fuse optical images with landuse and buildings layers of OSM data, respectively. In addition, a new approach to detecting building incompleteness of OSM data is proposed. The proposed framework was trained and tested using data from the 2017 IEEE GRSS Data Fusion Contest, and further validated on one additional test set containing test samples which are manually labeled in Munich and New York. Experimental results have indicated that compared to the feature stacking-based baseline framework the proposed framework is effective in fusing optical images with OSM data for the classification of LCZs with high generalization capability on a large scale. The classification accuracy of the proposed framework outperforms the baseline framework by more than 6% and 2%, while testing on the test set of 2017 IEEE GRSS Data Fusion Contest and the additional test set, respectively. In addition, the proposed framework is less sensitive to spectral diversities of optical satellite images and thus achieves more stable classification performance than state-of-the art frameworks.


How to Solve Data Integration Challenges with AI

#artificialintelligence

Data is streaming through businesses at increasingly faster rates, creating time-to-insight and time-to-action data integration challenges which can be crucial to rising above competitors. As fast access to information becomes a greater demand for businesses, a more apparent challenge is in condensing data into useful information for developing insights. Many companies are spending more time and resources on increasing data and preparing it than they are on analysis. Agile, efficient, flexible data integration strategies are crucial to expanding big data analysis. If you're looking for easier ways to handle data integration projects, you may want to try looking into introducing artificial intelligence (AI) capabilities to your data integration strategies toolkit.


Noise-cancelling headsets worn by soldiers can reveal the position of a sniper after a single shot

Daily Mail - Science & tech

Locations of enemy snipers shooting at troops may soon be revealed instantly on the smartphones of the ambushed troops. Cutting-edge audio technology is being developed to use microphones in the ears of the soldiers to track two notable noises from a bullet - supersonic air in front of the bullet and the blast as it leaves the muzzle. Technology is being developed to use these two sounds to trace the original location and reveal where it was fired from. The data and location will then be relayed to the handset of the beleaguered troops to help them identify and neutralise the threat. Audio experts that developed the technology say it builds on existing technology and could be employed on the battlefield in just two years.


Data Integration Engineer - IoT BigData Jobs

#artificialintelligence

C3 IoT has an opening for a Data Integration Engineer. You will be required to create advanced application integration solutions and configure, deploy and enhance enterprise cloud applications. C3 IoT product suite is entirely data-driven, so a great candidate will have a passion for acquiring, analyzing, and transforming data to generate insight. Qualified candidates will have a solid knowledge of integration and data manipulation technologies.


Human Activity Recognition Using Visual Object Detection

arXiv.org Machine Learning

Visual Human Activity Recognition (HAR) and data fusion with other sensors can help us at tracking the behavior and activity of underground miners with little obstruction. Existing models, such as Single Shot Detector (SSD), trained on the Common Objects in Context (COCO) dataset is used in this paper to detect the current state of a miner, such as an injured miner vs a non-injured miner. Tensorflow is used for the abstraction layer of implementing machine learning algorithms, and although it uses Python to deal with nodes and tensors, the actual algorithms run on C++ libraries, providing a good balance between performance and speed of development. The paper further discusses evaluation methods for determining the accuracy of the machine-learning and an approach to increase the accuracy of the detected activity/state of people in a mining environment, by means of data fusion.