Goto

Collaborating Authors

 Atlantic Ocean


Eigenvalue initialisation and regularisation for Koopman autoencoders

arXiv.org Artificial Intelligence

Regularising the parameter matrices of neural networks is ubiquitous in training deep models. Typical regularisation approaches suggest initialising weights using small random values, and to penalise weights to promote sparsity. However, these widely used techniques may be less effective in certain scenarios. Here, we study the Koopman autoencoder model which includes an encoder, a Koopman operator layer, and a decoder. These models have been designed and dedicated to tackle physics-related problems with interpretable dynamics and an ability to incorporate physics-related constraints. However, the majority of existing work employs standard regularisation practices. In our work, we take a step toward augmenting Koopman autoencoders with initialisation and penalty schemes tailored for physics-related settings. Specifically, we propose the "eigeninit" initialisation scheme that samples initial Koopman operators from specific eigenvalue distributions. In addition, we suggest the "eigenloss" penalty scheme that penalises the eigenvalues of the Koopman operator during training. We demonstrate the utility of these schemes on two synthetic data sets: a driven pendulum and flow past a cylinder; and two real-world problems: ocean surface temperatures and cyclone wind fields. We find on these datasets that eigenloss and eigeninit improves the convergence rate by up to a factor of 5, and that they reduce the cumulative long-term prediction error by up to a factor of 3. Such a finding points to the utility of incorporating similar schemes as an inductive bias in other physics-related deep learning approaches.


Towards Sustainable Artificial Intelligence: An Overview of Environmental Protection Uses and Issues

arXiv.org Artificial Intelligence

Artificial Intelligence (AI) is used to create more sustainable production methods and model climate change, making it a valuable tool in the fight against environmental degradation. This paper describes the paradox of an energy-consuming technology serving the ecological challenges of tomorrow. The study provides an overview of the sectors that use AI-based solutions for environmental protection. It draws on numerous examples from AI for Green players to present use cases and concrete examples. In the second part of the study, the negative impacts of AI on the environment and the emerging technological solutions to support Green AI are examined. It is also shown that the research on less energy-consuming AI is motivated more by cost and energy autonomy constraints than by environmental considerations. This leads to a rebound effect that favors an increase in the complexity of models. Finally, the need to integrate environmental indicators into algorithms is discussed. The environmental dimension is part of the broader ethical problem of AI, and addressing it is crucial for ensuring the sustainability of AI in the long term.


Set-Transformer BeamsNet for AUV Velocity Forecasting in Complete DVL Outage Scenarios

arXiv.org Artificial Intelligence

Autonomous underwater vehicles (AUVs) are regularly used for deep ocean applications. Commonly, the autonomous navigation task is carried out by a fusion between two sensors: the inertial navigation system and the Doppler velocity log (DVL). The DVL operates by transmitting four acoustic beams to the sea floor, and once reflected back, the AUV velocity vector can be estimated. However, in real-life scenarios, such as an uneven seabed, sea creatures blocking the DVL's view and, roll/pitch maneuvers, the acoustic beams' reflection is resulting in a scenario known as DVL outage. Consequently, a velocity update is not available to bind the inertial solution drift. To cope with such situations, in this paper, we leverage our BeamsNet framework and propose a Set-Transformer-based BeamsNet (ST-BeamsNet) that utilizes inertial data readings and previous DVL velocity measurements to regress the current AUV velocity in case of a complete DVL outage. The proposed approach was evaluated using data from experiments held in the Mediterranean Sea with the Snapir AUV and was compared to a moving average (MA) estimator. Our ST-BeamsNet estimated the AUV velocity vector with an 8.547% speed error, which is 26% better than the MA approach.


Controllable Text Generation with Language Constraints

arXiv.org Artificial Intelligence

We consider the task of text generation in language models with constraints specified in natural language. To this end, we first create a challenging benchmark Cognac that provides as input to the model a topic with example text, along with a constraint on text to be avoided. Unlike prior work, our benchmark contains knowledge-intensive constraints sourced from databases like Wordnet and Wikidata, which allows for straightforward evaluation while striking a balance between broad attribute-level and narrow lexical-level controls. We find that even state-of-the-art language models like GPT-3 fail often on this task, and propose a solution to leverage a language model's own internal knowledge to guide generation. Our method, called CognacGen, first queries the language model to generate guidance terms for a specified topic or constraint, and uses the guidance to modify the model's token generation probabilities. We propose three forms of guidance (binary verifier, top-k tokens, textual example), and employ prefix-tuning approaches to distill the guidance to tackle diverse natural language constraints. Through extensive empirical evaluations, we demonstrate that CognacGen can successfully generalize to unseen instructions and outperform competitive baselines in generating constraint conforming text.


Dolphins discovered with signs of Alzheimer's disease in their brains

Daily Mail - Science & tech

Stranded dolphins have been discovered with brain changes associated with Alzheimer's disease in humans. Researchers from the University of Glasgow studied the brains of 22 odontocetes - toothed whales - thathad died in coastal waters off Scotland. One bottlenose dolphin, one white-beaked dolphin and two long-finned pilot whales had accumulated amyloid-beta plaques, which is a hallmark of dementia. The researchers say these ill creatures could have led their otherwise healthy group, or pod, into shallow waters by mistake after getting confused or lost. Whales, dolphins and porpoises are regularly found stranded in shallow waters or beaches around the UK coastline, and often in pods.


An ensemble neural network approach to forecast Dengue outbreak based on climatic condition

arXiv.org Artificial Intelligence

Dengue fever is a virulent disease spreading over 100 tropical and subtropical countries in Africa, the Americas, and Asia. This arboviral disease affects around 400 million people globally, severely distressing the healthcare systems. The unavailability of a specific drug and ready-to-use vaccine makes the situation worse. Hence, policymakers must rely on early warning systems to control intervention-related decisions. Forecasts routinely provide critical information for dangerous epidemic events. However, the available forecasting models (e.g., weather-driven mechanistic, statistical time series, and machine learning models) lack a clear understanding of different components to improve prediction accuracy and often provide unstable and unreliable forecasts. This study proposes an ensemble wavelet neural network with exogenous factor(s) (XEWNet) model that can produce reliable estimates for dengue outbreak prediction for three geographical regions, namely San Juan, Iquitos, and Ahmedabad. The proposed XEWNet model is flexible and can easily incorporate exogenous climate variable(s) confirmed by statistical causality tests in its scalable framework. The proposed model is an integrated approach that uses wavelet transformation into an ensemble neural network framework that helps in generating more reliable long-term forecasts. The proposed XEWNet allows complex non-linear relationships between the dengue incidence cases and rainfall; however, mathematically interpretable, fast in execution, and easily comprehensible. The proposal's competitiveness is measured using computational experiments based on various statistical metrics and several statistical comparison tests. In comparison with statistical, machine learning, and deep learning methods, our proposed XEWNet performs better in 75% of the cases for short-term and long-term forecasting of dengue incidence.


Robust Recurrent Neural Network to Identify Ship Motion in Open Water with Performance Guarantees -- Technical Report

arXiv.org Artificial Intelligence

Recurrent neural networks are capable of learning the dynamics of an unknown nonlinear system purely from input-output measurements. However, the resulting models do not provide any stability guarantees on the input-output mapping. In this work, we represent a recurrent neural network as a linear time-invariant system with nonlinear disturbances. By introducing constraints on the parameters, we can guarantee finite gain stability and incremental finite gain stability. We apply this identification method to learn the motion of a four-degrees-of-freedom ship that is moving in open water and compare it against other purely learning-based approaches with unconstrained parameters. Our analysis shows that the constrained recurrent neural network has a lower prediction accuracy on the test set, but it achieves comparable results on an out-of-distribution set and respects stability conditions.


Dismantling Sellafield: the epic task of shutting down a nuclear site

The Guardian > Energy

If you take the cosmic view of Sellafield, the superannuated nuclear facility in north-west England, its story began long before the Earth took shape. About 9bn years ago, tens of thousands of giant stars ran out of fuel, collapsed upon themselves, and then exploded. Flung out by such explosions, trillions of tonnes of uranium traversed the cold universe and wound up near our slowly materialising solar system. And here, over roughly 20m years, the uranium and other bits of space dust and debris cohered to form our planet in such a way that the violent tectonics of the young Earth pushed the uranium not towards its hot core but up into the folds of its crust. Within reach, so to speak, of the humans who eventually came along circa 300,000BC, and who mined the uranium beginning in the 1500s, learned about its radioactivity in 1896 and started feeding it into their nuclear reactors 70-odd years ago, making electricity that could be relayed to their houses to run toasters and light up Christmas trees. Sellafield compels this kind of gaze into the abyss of deep time because it is a place where multiple time spans – some fleeting, some cosmic – drift in and out of view. Laid out over six square kilometres, Sellafield is like a small town, with nearly a thousand buildings, its own roads and even a rail siding – all owned by the government, and requiring security clearance to visit. Sellafield's presence, at the end of a road on the Cumbrian coast, is almost hallucinatory. Then, having driven through a high-security gate, you're surrounded by towering chimneys, pipework, chugging cooling plants, everything dressed in steampunk. The sun bounces off metal everywhere. In some spots, the air shakes with the noise of machinery. It feels like the most manmade place in the world. Since it began operating in 1950, Sellafield has had different duties. First it manufactured plutonium for nuclear weapons.


Machine learning versus data science – demystifying the scene

#artificialintelligence

Who better to ask about artificial intelligence (AI) than the current darling of the scene, ChatGPT? Its answer ('Machine learning and data science are closely related fields, but they are not the same thing') is a useful starting point. But unpacking the differences between machine learning versus data science requires human effort, for the time being at least. Until the machines take over. If you are new to machine learning, it's worth skipping back to the late 1950s to gain an understanding of its origins.


Regression modelling of spatiotemporal extreme U.S. wildfires via partially-interpretable neural networks

arXiv.org Artificial Intelligence

Risk management in many environmental settings requires an understanding of the mechanisms that drive extreme events. Useful metrics for quantifying such risk are extreme quantiles of response variables conditioned on predictor variables that describe, e.g., climate, biosphere and environmental states. Typically these quantiles lie outside the range of observable data and so, for estimation, require specification of parametric extreme value models within a regression framework. Classical approaches in this context utilise linear or additive relationships between predictor and response variables and suffer in either their predictive capabilities or computational efficiency; moreover, their simplicity is unlikely to capture the truly complex structures that lead to the creation of extreme wildfires. In this paper, we propose a new methodological framework for performing extreme quantile regression using artificial neutral networks, which are able to capture complex non-linear relationships and scale well to high-dimensional data. The ``black box" nature of neural networks means that they lack the desirable trait of interpretability often favoured by practitioners; thus, we unify linear, and additive, regression methodology with deep learning to create partially-interpretable neural networks that can be used for statistical inference but retain high prediction accuracy. To complement this methodology, we further propose a novel point process model for extreme values which overcomes the finite lower-endpoint problem associated with the generalised extreme value class of distributions. Efficacy of our unified framework is illustrated on U.S. wildfire data with a high-dimensional predictor set and we illustrate vast improvements in predictive performance over linear and spline-based regression techniques.