South America
Unlocking the potential of deep learning for marine ecology: overview, applications, and outlook
Goodwin, Morten, Halvorsen, Kim Tallaksen, Jiao, Lei, Knausgård, Kristian Muri, Martin, Angela Helen, Moyano, Marta, Oomen, Rebekah A., Rasmussen, Jeppe Have, Sørdalen, Tonje Knutsen, Thorbjørnsen, Susanna Huneide
The deep learning revolution is touching all scientific disciplines and corners of our lives as a means of harnessing the power of big data. Marine ecology is no exception. These new methods provide analysis of data from sensors, cameras, and acoustic recorders, even in real time, in ways that are reproducible and rapid. Off-the-shelf algorithms can find, count, and classify species from digital images or video and detect cryptic patterns in noisy data. Using these opportunities requires collaboration across ecological and data science disciplines, which can be challenging to initiate. To facilitate these collaborations and promote the use of deep learning towards ecosystem-based management of the sea, this paper aims to bridge the gap between marine ecologists and computer scientists. We provide insight into popular deep learning approaches for ecological data analysis in plain language, focusing on the techniques of supervised learning with deep neural networks, and illustrate challenges and opportunities through established and emerging applications of deep learning to marine ecology. We use established and future-looking case studies on plankton, fishes, marine mammals, pollution, and nutrient cycling that involve object detection, classification, tracking, and segmentation of visualized data. We conclude with a broad outlook of the field's opportunities and challenges, including potential technological advances and issues with managing complex data sets.
From Organisational Structure to Organisational Behaviour Formalisation
Jonker, Catholijn M., Treur, Jan
As the complexity of systems based on multiple software agents increases, as is the case, for example in the context of Internet, their dynamics are less easy to predict and to manage. A recent development is to incorporate organisation modelling methods within the software engineering process of multi-agent systems. Indeed, like complex agent-based software systems, societies are characterised by complex dynamics involving interaction between large numbers of actors and groups of actors. If within society such complex dynamics would take place in an completely unstructured, incoherent manner, any actor involved has not much to rely on to do prediction, and therefore is not able to function in a knowledgeable manner. This has serious disadvantages, which is a reason why in history within human societies organisational structure has been developed as a means to manage complex dynamics. Here it is assumed that organisational structure provides co-ordination of the processes in such a manner that a process or agent involved can function in a more adequate manner. So the basic assumption is that providing organisational structure has implications to organisational dynamics. The dynamics induced by a given organisational structure are much more dependable than in an entirely unstructured situation. It is assumed that the organisational structure itself is relatively stable, i.e., the structure may change, but the frequency and scale of change are
Insect farm uses artificial intelligence to promote food security
"The events from this year and last have shown how fragile our global food system really is," said Fotis Fotiadis, CEO and co-founder of Better Origin, the UK-based insect mini-farm. Detailing the current food security outlook in Europe, Fotiadis relayed: " While we are still trying to digest how the pandemic has affected our global food supply chain, it is becoming more evident that we cannot rely solely on imported food products." The food industry currently places reliance of our animal feed sector on soy, most of which comes from South America. "A disturbance in the global soy supply chain can, therefore, have a dramatic impact on livestock production in the UK and EU," Fotiadis observed. The solution can be found in technologies that are being developed that allow for food and feed products to be grown locally.
Turning old models fashion again: Recycling classical CNN networks using the Lattice Transformation
de Almeida, Ana Paula G. S., Vidal, Flavio de Barros
In the early 1990s, the first signs of life of the CNN era were given: LeCun et al. proposed a CNN model trained by the backpropagation algorithm to classify low-resolution images of handwritten digits. Undoubtedly, it was a breakthrough in the field of computer vision. But with the rise of other classification methods, it fell out fashion. That was until 2012, when Krizhevsky et al. revived the interest in CNNs by exhibiting considerably higher image classification accuracy on the ImageNet challenge. Since then, the complexity of the architectures are exponentially increasing and many structures are rapidly becoming obsolete. Using multistream networks as a base and the feature infusion precept, we explore the proposed LCNN cross-fusion strategy to use the backbones of former state-of-the-art networks on image classification in order to discover if the technique is able to put these designs back in the game. In this paper, we showed that we can obtain an increase of accuracy up to 63.21% on the NORB dataset we comparing with the original structure. However, no technique is definitive. While our goal is to try to reuse previous state-of-the-art architectures with few modifications, we also expose the disadvantages of our explored strategy.
Identifying and Mitigating Gender Bias in Hyperbolic Word Embeddings
Kumar, Vaibhav, Bhotia, Tenzin Singhay, Kumar, Vaibhav, Chakraborty, Tanmoy
Euclidean word embedding models such as GloVe and Word2Vec have been shown to reflect human-like gender biases. In this paper, we extend the study of gender bias to the recently popularized hyperbolic word embeddings. We propose gyrocosine bias, a novel measure for quantifying gender bias in hyperbolic word representations and observe a significant presence of gender bias. To address this problem, we propose Poincar\'e Gender Debias (PGD), a novel debiasing procedure for hyperbolic word representations. Experiments on a suit of evaluation tests show that PGD effectively reduces bias while adding a minimal semantic offset.
Real-Time Glaucoma Detection from Digital Fundus Images using Self-ONNs
Devecioglu, Ozer Can, Malik, Junaid, Ince, Turker, Kiranyaz, Serkan, Atalay, Eray, Gabbouj, Moncef
Glaucoma leads to permanent vision disability by damaging the optical nerve that transmits visual images to the brain. The fact that glaucoma does not show any symptoms as it progresses and cannot be stopped at the later stages, makes it critical to be diagnosed in its early stages. Although various deep learning models have been applied for detecting glaucoma from digital fundus images, due to the scarcity of labeled data, their generalization performance was limited along with high computational complexity and special hardware requirements. In this study, compact Self-Organized Operational Neural Networks (Self- ONNs) are proposed for early detection of glaucoma in fundus images and their performance is compared against the conventional (deep) Convolutional Neural Networks (CNNs) over three benchmark datasets: ACRIMA, RIM-ONE, and ESOGU. The experimental results demonstrate that Self-ONNs not only achieve superior detection performance but can also significantly reduce the computational complexity making it a potentially suitable network model for biomedical datasets especially when the data is scarce.
Spectroscopy and Chemometrics/Machine Learning News Weekly #38, 2021
Classification and adulterant detection in" LINK "Agronomy: Phenotyping and Validation of Root Morphological Traits in Barley (Hordeum vulgare L.)" LINK "Comparison of wavelength selected methods for improving of prediction performance of PLS model to determine aflatoxin B1 (AFB1) in wheat samples during storage" LINK "A Brief History of Whiskey Adulteration and the Role of Spectroscopy Combined with Chemometrics in the Detection of Modern Whiskey Fraud" LINK Equipment for Spectroscopy "Feasibility study of detecting palm oil adulteration with recycled cooking oil using handheld Near-infrared spectrometer" LINK Environment NIR-Spectroscopy Application "Spatial Prediction of Calcium Carbonate and Clay Content in Soils using Airborne Hyperspectral Data" LINK "Monitoring the soil copper pollution degree based on the reflectance spectrum of an arid desert plant" LINK "Micromachines: Visualization of Local Concentration and Viscosity Distribution during Glycerol-Water Mixing in a Y-Shape ...
Click-through Rate Prediction with Auto-Quantized Contrastive Learning
Pan, Yujie, Yao, Jiangchao, Han, Bo, Jia, Kunyang, Zhang, Ya, Yang, Hongxia
Click-through rate (CTR) prediction becomes indispensable in ubiquitous web recommendation applications. Nevertheless, the current methods are struggling under the cold-start scenarios where the user interactions are extremely sparse. We consider this problem as an automatic identification about whether the user behaviors are rich enough to capture the interests for prediction, and propose an Auto-Quantized Contrastive Learning (AQCL) loss to regularize the model. Different from previous methods, AQCL explores both the instance-instance and the instance-cluster similarity to robustify the latent representation, and automatically reduces the information loss to the active users due to the quantization. The proposed framework is agnostic to different model architectures and can be trained in an end-to-end fashion. Extensive results show that it consistently improves the current state-of-the-art CTR models.
A spatiotemporal machine learning approach to forecasting COVID-19 incidence at the county level in the United States
Lucas, Benjamin, Vahedi, Behzad, Karimzadeh, Morteza
With COVID-19 affecting every country globally and changing everyday life, the ability to forecast the spread of the disease is more important than any previous epidemic. The conventional methods of disease-spread modeling, compartmental models, are based on the assumption of spatiotemporal homogeneity of the spread of the virus, which may cause forecasting to underperform, especially at high spatial resolutions. In this paper we approach the forecasting task with an alternative technique - spatiotemporal machine learning. We present COVID-LSTM, a data-driven model based on a Long Short-term Memory deep learning architecture for forecasting COVID-19 incidence at the county-level in the US. We use the weekly number of new positive cases as temporal input, and hand-engineered spatial features from Facebook movement and connectedness datasets to capture the spread of the disease in time and space. COVID-LSTM outperforms the COVID-19 Forecast Hub's Ensemble model (COVIDhub-ensemble) on our 17-week evaluation period, making it the first model to be more accurate than the COVIDhub-ensemble over one or more forecast periods. Over the 4-week forecast horizon, our model is on average 50 cases per county more accurate than the COVIDhub-ensemble. We highlight that the underutilization of data-driven forecasting of disease spread prior to COVID-19 is likely due to the lack of sufficient data available for previous diseases, in addition to the recent advances in machine learning methods for spatiotemporal forecasting. We discuss the impediments to the wider uptake of data-driven forecasting, and whether it is likely that more deep learning-based models will be used in the future.
Path Based Hierarchical Clustering on Knowledge Graphs
Pietrasik, Marcin, Reformat, Marek
Knowledge graphs have emerged as a widely adopted medium for storing relational data, making methods for automatically reasoning with them highly desirable. In this paper, we present a novel approach for inducing a hierarchy of subject clusters, building upon our earlier work done in taxonomy induction. Our method first constructs a tag hierarchy before assigning subjects to clusters on this hierarchy. We quantitatively demonstrate our method's ability to induce a coherent cluster hierarchy on three real-world datasets.