Goto

Collaborating Authors

 Materials


Material Segmentation of Multi-View Satellite Imagery

arXiv.org Artificial Intelligence

Material recognition methods use image context and local cues for pixel-wise classification. In many cases only a single image is available to make a material prediction. Image sequences, routinely acquired in applications such as mutliview stereo, can provide a sampling of the underlying reflectance functions that reveal pixel-level material attributes. We investigate multi-view material segmentation using two datasets generated for building material segmentation and scene material segmentation from the SpaceNet Challenge satellite image dataset. In this paper, we explore the impact of multi-angle reflectance information by introducing the \textit{reflectance residual encoding}, which captures both the multi-angle and multispectral information present in our datasets. The residuals are computed by differencing the sparse-sampled reflectance function with a dictionary of pre-defined dense-sampled reflectance functions. Our proposed reflectance residual features improves material segmentation performance when integrated into pixel-wise and semantic segmentation architectures. At test time, predictions from individual segmentations are combined through softmax fusion and refined by building segment voting. We demonstrate robust and accurate pixelwise segmentation results using the proposed material segmentation pipeline.


Noisy Quantum Computers Could Be Good for Chemistry Problems

WIRED

Scientists and researchers have long extolled the extraordinary potential capabilities of universal quantum computers, like simulating physical and natural processes or breaking cryptographic codes in practical time frames. Yet important developments in the technology--the ability to fabricate the necessary number of high-quality qubits (the basic units of quantum information) and gates (elementary operations between qubits)--is most likely still decades away. However, there is a class of quantum devices--ones that currently exist--that could address otherwise intractable problems much sooner than that. These near-term quantum devices, coined Noisy Intermediate-Scale Quantum (NISQ) by Caltech professor John Preskill, are single-purpose, highly imperfect, and modestly sized. Dr. Anton Toutov is the cofounder and chief science officer of Fuzionaire and holds a PhD in organic chemistry from Caltech.


Nassim Taleb's Case Against Nate Silver Is Bad Math - Facts So Romantic

Nautilus

It began, late last year, with Silver boasting about the success of his election models and Taleb shooting back that Silver doesn't "know how math works." Silver said Taleb was "consumed by anger" and hadn't had any new ideas since 2001. The argument has gotten personal, with Silver calling Taleb an "intellectual-yet-idiot" (an insult taken from Taleb's own book) and Taleb calling Silver "klueless" and "butthurt." Here is a recap of what they're fighting about so you can know who's right (Silver, mostly) and who's wrong (Taleb). The origin of Taleb's ire can be found in Silver's success since 2008--and his some-time failures. As I described in Nautilus last month, evaluating probabilistic election forecasts can be conceptually slippery, made especially difficult by the counterintuitive properties of mathematical probability.


Plant-wide fault and disturbance screening using combined transfer entropy and eigenvector centrality analysis

arXiv.org Artificial Intelligence

Finding the source of a disturbance or fault in complex systems such as industrial chemical processing plants can be a difficult task and consume a significant number of engineering hours. In many cases, a systematic elimination procedure is considered to be the only feasible approach but can cause undesired process upsets. Practitioners desire robust alternative approaches. This paper presents an unsupervised, data-driven method for ranking process elements according to the magnitude and novelty of their influence. Partial bivariate transfer entropy estimation is used to infer a weighted directed graph of process elements. Eigenvector centrality is applied to rank network nodes according to their overall effect. As the ranking of process elements rely on emerging properties that depend on the aggregate of many connections, the results are robust to errors in the estimation of individual edge properties and the inclusion of indirect connections that do not represent the true causal structure of the process. A monitoring chart of continuously calculated process element importance scores over multiple overlapping time regions can assist with incipient fault detection. Ranking results combined with visual inspection of information transfer networks is also useful for root cause analysis of known faults and disturbances. A software implementation of the proposed method is available.


How artificial intelligence is helping farmers and babies in the developing world

#artificialintelligence

Businesses and nonprofits are finding novel ways to employ artificial intelligence in the developing world, using the tools to improve agriculture yields, infant health care, and entrepreneur earnings, according to speakers at MIT Technology Review's EmTech Digital conference in San Francisco on Tuesday. Solomon Assefa, who oversees IBM's research labs in Kenya and South Africa, said the company has been using AI to more accurately predict crop yields in specific regions, based on shifting weather patterns, soil moisture, and other conditions. This insight into growing conditions has helped local farmers raise financing to expand their operations, or make better decisions about the right seeds, appropriate fertilizer, and ideal times to plant and harvest. Separately, the tech giant's research lab has partnered with a startup, Hello Tractor, that links farmers in need of tractors with owners looking to lease equipment. By forecasting demand for the vehicles, IBM has also helped owners raise money to expand their fleet, boosting their profits, Assefa said.


Are Learned Molecular Representations Ready For Prime Time?

arXiv.org Machine Learning

Advancements in neural machinery have led to a wide range of algorithmic solutions for molecular property prediction. Two classes of models in particular have yielded promising results: neural networks applied to computed molecular fingerprints or expert-crafted descriptors, and graph convolutional neural networks that construct a learned molecular representation by operating on the graph structure of the molecule. However, recent literature has yet to clearly determine which of these two methods is superior when generalizing to new chemical space. Furthermore, prior research has rarely examined these new models in industry research settings in comparison to existing employed models. In this paper, we benchmark models extensively on 19 public and 15 proprietary industrial datasets spanning a wide variety of chemical endpoints. In addition, we introduce a graph convolutional model that consistently outperforms models using fixed molecular descriptors as well as previous graph neural architectures on both public and proprietary datasets. Our empirical findings indicate that while approaches based on these representations have yet to reach the level of experimental reproducibility, our proposed model nevertheless offers significant improvements over models currently used in industrial workflows.


'Cow toilets' to cut farm ammonia emissions by collecting up to 35 pints of urine a day

Daily Mail - Science & tech

A Dutch inventor has developed a'cow toilet' to help cut ammonia emissions from cow urine that cause environmental pollution. Tests on the device, which collects around 26 to 35 pints (15 to 20 litres) of urine produced daily by a single cow, have started on a farm in the country. Inventor Henk Hanskamp claims the device could halve the ammonia emissions from cows, which account for almost half (49 per cent) of agricultural ammonia pollution. This type of contamination has multiple negative impacts on both the environment and human health and can threaten aquatic wildlife and contribute to smog. The small-scale tests are being conducted in the Netherlands, the world's second-biggest agricultural exporter behind the United States.


The Impact of Extraneous Variables on the Performance of Recurrent Neural Network Models in Clinical Tasks

arXiv.org Machine Learning

Electronic Medical Records (EMR) are a rich source of patient information, including measurements reflecting physiologic signs and administered therapies. Identifying which variables are useful in predicting clinical outcomes can be challenging. Advanced algorithms such as deep neural networks were designed to process high-dimensional inputs containing variables in their measured form, thus bypass separate feature selection or engineering steps. We investigated the effect of extraneous input variables on the predictive performance of Recurrent Neural Networks (RNN) by including in the input vector extraneous variables randomly drawn from theoretical and empirical distributions. RNN models using different input vectors (EMR variables; EMR and extraneous variables; extraneous variables only) were trained to predict three clinical outcomes: in-ICU mortality, 72-hour ICU re-admission, and 30-day ICU-free days. The measured degradations of the RNN's predictive performance with the addition of extraneous variables to EMR variables were negligible.


Transfer Learning Using Ensemble Neural Networks for Organic Solar Cell Screening

arXiv.org Machine Learning

Organic Solar Cells are a promising technology for solving the clean energy crisis in the world. However, generating candidate chemical compounds for solar cells is a time-consuming process requiring thousands of hours of laboratory analysis. For a solar cell, the most important property is the power conversion efficiency which is dependent on the highest occupied molecular orbitals (HOMO) values of the donor molecules. Recently, machine learning techniques have proved to be very useful in building predictive models for HOMO values of donor structures of Organic Photovoltaic Cells (OPVs). Since experimental datasets are limited in size, current machine learning models are trained on data derived from calculations based on density functional theory (DFT). Molecular line notations such as SMILES or InChI are popular input representations for describing the molecular structure of donor molecules. The two types of line representations encode different information, such as SMILES defines the bond types while InChi defines protonation. In this work, we present an ensemble deep neural network architecture, called SINet, which harnesses both the SMILES and InChI molecular representations to predict HOMO values and leverage the potential of transfer learning from a sizeable DFT-computed dataset- Harvard CEP to build more robust predictive models for relatively smaller HOPV datasets. Harvard CEP dataset contains molecular structures and properties for 2.3 million candidate donor structures for OPV while HOPV contains DFT-computed and experimental values of 350 and 243 molecules respectively. Our results demonstrate significant performance improvement from the use of transfer learning and leveraging both molecular representations.


Measuring the Similarity between Materials with an Emphasis on the Materials Distinctiveness

arXiv.org Machine Learning

In this study, we establish a basis for selecting similarity measures when applying machine learning techniques to solve materials science problems. This selection is considered with an emphasis on the distinctiveness between materials that reflect their nature well. We perform a case study with a dataset of rare-earth transition metal crystalline compounds represented using the Orbital Field Matrix descriptor and the Coulomb Matrix descriptor. We perform predictions of the formation energies using k-nearest neighbors regression, ridge regression, and kernel ridge regression. Through detailed analyses of the yield prediction accuracy, we examine the relationship between the characteristics of the material representation and similarity measures, and the complexity of the energy function they can capture. Empirical experiments and theoretical analysis reveal that similarity measures and kernels that minimize the loss of materials distinctiveness improve the prediction performance.