AITopics | Arctic Ocean

Collaborating Authors

Arctic Ocean

REPLUG: Retrieval-Augmented Black-Box Language Models

Shi, Weijia, Min, Sewon, Yasunaga, Michihiro, Seo, Minjoon, James, Rich, Lewis, Mike, Zettlemoyer, Luke, Yih, Wen-tau

arXiv.org Artificial IntelligenceMay-24-2023

We introduce REPLUG, a retrieval-augmented language modeling framework that treats the language model (LM) as a black box and augments it with a tuneable retrieval model. Unlike prior retrieval-augmented LMs that train language models with special cross attention mechanisms to encode the retrieved text, REPLUG simply prepends retrieved documents to the input for the frozen black-box LM. This simple design can be easily applied to any existing retrieval and language models. Furthermore, we show that the LM can be used to supervise the retrieval model, which can then find documents that help the LM make better predictions. Our experiments demonstrate that REPLUG with the tuned retriever significantly improves the performance of GPT-3 (175B) on language modeling by 6.3%, as well as the performance of Codex on five-shot MMLU by 5.1%.

artificial intelligence, large language model, natural language, (18 more...)

arXiv.org Artificial Intelligence

2301.12652

Country:

Europe > Germany (0.04)
Asia > Japan (0.04)
Asia > India (0.04)
(14 more...)

Genre: Research Report (0.50)

Industry:

Leisure & Entertainment (1.00)
Transportation > Air (0.84)
Energy > Renewable > Solar (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.71)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.70)

Add feedback

A benchmark for computational analysis of animal behavior, using animal-borne tags

Hoffman, Benjamin, Cusimano, Maddie, Baglione, Vittorio, Canestrari, Daniela, Chevallier, Damien, DeSantis, Dominic L., Jeantet, Lorène, Ladds, Monique A., Maekawa, Takuya, Mata-Silva, Vicente, Moreno-González, Víctor, Trapote, Eva, Vainio, Outi, Vehkaoja, Antti, Yoda, Ken, Zacarian, Katherine, Friedlaender, Ari, Rutz, Christian

arXiv.org Artificial IntelligenceMay-18-2023

Animal-borne sensors ('bio-loggers') can record a suite of kinematic and environmental data, which can elucidate animal ecophysiology and improve conservation efforts. Machine learning techniques are useful for interpreting the large amounts of data recorded by bio-loggers, but there exists no standard for comparing the different machine learning techniques in this domain. To address this, we present the Bio-logger Ethogram Benchmark (BEBE), a collection of datasets with behavioral annotations, standardized modeling tasks, and evaluation metrics. BEBE is to date the largest, most taxonomically diverse, publicly available benchmark of this type, and includes 1654 hours of data collected from 149 individuals across nine taxa. We evaluate the performance of ten different machine learning methods on BEBE, and identify key challenges to be addressed in future work. Datasets, models, and evaluation code are made publicly available at https://github.com/earthspecies/BEBE, to enable community use of BEBE as a point of comparison in methods development.

artificial intelligence, deep learning, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2305.1074

Country:

North America > Martinique (0.04)
Oceania > New Zealand (0.04)
Europe > Finland > Uusimaa > Helsinki (0.04)
(12 more...)

Genre: Research Report > New Finding (0.92)

Industry:

Information Technology (0.68)
Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.69)

Add feedback

Balloons, 'objects' – what's in the sky above the US?

Al JazeeraFeb-16-2023, 18:43:03 GMT

Los Angeles, California – The United States military shot down a flurry of objects this month: a large object it identified as a Chinese surveillance balloon followed by three smaller objects that the government said might be "benign". The airborne objects were drifting through airspace increasingly crowded with commercial and amateur balloons, drones and possible aerial surveillance craft belonging to adversaries. Their rising numbers pose a challenge to aviators and government agencies. Experts say that while heavy commercial balloons must meet strict Federal Aviation Administration (FAA) regulations, lighter amateur balloons are exempt from most rules, and the FAA might not be able to track them. Military and intelligence officials found no evidence that the three smaller objects were conducting surveillance for another country, and they were not sending communication signals, National Security Council spokesman John Kirby said at a White House briefing on Monday.

balloon, chinese surveillance balloon, nelson, (13 more...)

Al Jazeera

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.55)
North America > United States > Alaska (0.05)
Asia > China (0.05)
(7 more...)

Industry:

Transportation > Infrastructure & Services (1.00)
Transportation > Air (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Government > Military (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.35)

Add feedback

Using Artificial Intelligence to aid Scientific Discovery of Climate Tipping Points

Sleeman, Jennifer, Chung, David, Ashcraft, Chace, Brett, Jay, Gnanadesikan, Anand, Kevrekidis, Yannis, Hughes, Marisa, Haine, Thomas, Pradal, Marie-Aude, Gelderloos, Renske, Tang, Caroline, Saksena, Anshu, White, Larry

arXiv.org Artificial IntelligenceFeb-14-2023

We propose a hybrid Artificial Intelligence (AI) climate modeling approach that enables climate modelers in scientific discovery using a climate-targeted simulation methodology based on a novel combination of deep neural networks and mathematical methods for modeling dynamical systems. The simulations are grounded by a neuro-symbolic language that both enables question answering of what is learned by the AI methods and provides a means of explainability. We describe how this methodology can be applied to the discovery of climate tipping points and, in particular, the collapse of the Atlantic Meridional Overturning Circulation (AMOC). We show how this methodology is able to predict AMOC collapse with a high degree of accuracy using a surrogate climate model for ocean interaction. We also show preliminary results of neuro-symbolic method performance when translating between natural language questions and symbolically learned representations. Our AI methodology shows promising early results, potentially enabling faster climate tipping point related research that would otherwise be computationally infeasible.

artificial intelligence, climate model, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2302.06852

Country:

Southern Ocean (0.05)
North America > United States > North Carolina > Durham County > Durham (0.04)
North America > United States > Maryland > Prince George's County > Laurel (0.04)
(5 more...)

Genre: Research Report (0.82)

Industry: Government (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

UW-CVGAN: UnderWater Image Enhancement with Capsules Vectors Quantization

Pucci, Rita, Micheloni, Christian, Martinel, Niki

arXiv.org Artificial IntelligenceFeb-2-2023

The degradation in the underwater images is due to wavelength-dependent light attenuation, scattering, and to the diversity of the water types in which they are captured. Deep neural networks take a step in this field, providing autonomous models able to achieve the enhancement of underwater images. We introduce Underwater Capsules Vectors GAN UWCVGAN based on the discrete features quantization paradigm from VQGAN for this task. The proposed UWCVGAN combines an encoding network, which compresses the image into its latent representation, with a decoding network, able to reconstruct the enhancement of the image from the only latent representation. In contrast with VQGAN, UWCVGAN achieves feature quantization by exploiting the clusterization ability of capsule layer, making the model completely trainable and easier to manage. The model obtains enhanced underwater images with high quality and fine details. Moreover, the trained encoder is independent of the decoder giving the possibility to be embedded onto the collector as compressing algorithm to reduce the memory space required for the images, of factor $3\times$. \myUWCVGAN{ }is validated with quantitative and qualitative analysis on benchmark datasets, and we present metrics results compared with the state of the art.

artificial intelligence, machine learning, uw-cvgan, (17 more...)

arXiv.org Artificial Intelligence

2302.01144

Country:

Europe > Italy (0.04)
Arctic Ocean > Barents Sea > White Sea (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

A Deep Learning Method for Real-time Bias Correction of Wind Field Forecasts in the Western North Pacific

Zhang, Wei, Jiang, Yueyue, Dong, Junyu, Song, Xiaojiang, Pang, Renbo, Guoan, Boyu, Yu, Hui

arXiv.org Artificial IntelligenceDec-28-2022

Forecasts by the European Centre for Medium-Range Weather Forecasts (ECMWF; EC for short) can provide a basis for the establishment of maritime-disaster warning systems, but they contain some systematic biases.The fifth-generation EC atmospheric reanalysis (ERA5) data have high accuracy, but are delayed by about 5 days. To overcome this issue, a spatiotemporal deep-learning method could be used for nonlinear mapping between EC and ERA5 data, which would improve the quality of EC wind forecast data in real time. In this study, we developed the Multi-Task-Double Encoder Trajectory Gated Recurrent Unit (MT-DETrajGRU) model, which uses an improved double-encoder forecaster architecture to model the spatiotemporal sequence of the U and V components of the wind field; we designed a multi-task learning loss function to correct wind speed and wind direction simultaneously using only one model. The study area was the western North Pacific (WNP), and real-time rolling bias corrections were made for 10-day wind-field forecasts released by the EC between December 2020 and November 2021, divided into four seasons. Compared with the original EC forecasts, after correction using the MT-DETrajGRU model the wind speed and wind direction biases in the four seasons were reduced by 8-11% and 9-14%, respectively. In addition, the proposed method modelled the data uniformly under different weather conditions. The correction performance under normal and typhoon conditions was comparable, indicating that the data-driven mode constructed here is robust and generalizable.

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.atmosres.2022.106586

2212.1416

Country:

North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.04)
Pacific Ocean > North Pacific Ocean (0.04)
Europe > United Kingdom (0.04)
(4 more...)

Genre: Research Report > New Finding (0.49)

Industry: Energy > Renewable > Wind (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Exposure and Emergence in Usage-Based Grammar: Computational Experiments in 35 Languages

Dunn, Jonathan

arXiv.org Artificial IntelligenceNov-25-2022

This paper uses computational experiments to explore the role of exposure in the emergence of construction grammars. While usage-based grammars are hypothesized to depend on a learner's exposure to actual language use, the mechanisms of such exposure have only been studied in a few constructions in isolation. This paper experiments with (i) the growth rate of the constructicon, (ii) the convergence rate of grammars exposed to independent registers, and (iii) the rate at which constructions are forgotten when they have not been recently observed. These experiments show that the lexicon grows more quickly than the grammar and that the growth rate of the grammar is not dependent on the growth rate of the lexicon. At the same time, register-specific grammars converge onto more similar constructions as the amount of exposure increases. This means that the influence of specific registers becomes less important as exposure increases. Finally, the rate at which constructions are forgotten when they have not been recently observed mirrors the growth rate of the constructicon. This paper thus presents a computational model of usage-based grammar that includes both the emergence and the unentrenchment of constructions.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

doi: 10.18710/CES0L8

2211.1416

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Pacific Ocean (0.04)
Oceania > New Zealand > South Island > Canterbury Region > Christchurch (0.04)
(5 more...)

Genre:

Research Report > New Finding (0.92)
Research Report > Experimental Study (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
(2 more...)

Add feedback

AutoReply: Detecting Nonsense in Dialogue Introspectively with Discriminative Replies

Shi, Weiyan, Dinan, Emily, Renduchintala, Adi, Fried, Daniel, Jacob, Athul Paul, Yu, Zhou, Lewis, Mike

arXiv.org Artificial IntelligenceNov-22-2022

Existing approaches built separate classifiers to detect nonsense in dialogues. In this paper, we show that without external classifiers, dialogue models can detect errors in their own messages introspectively, by calculating the likelihood of replies that are indicative of poor messages. For example, if an agent believes its partner is likely to respond "I don't understand" to a candidate message, that message may not make sense, so an alternative message should be chosen. We evaluate our approach on a dataset from the game Diplomacy, which contains long dialogues richly grounded in the game state, on which existing models make many errors. We first show that hand-crafted replies can be effective for the task of detecting nonsense in applications as complex as Diplomacy. We then design AutoReply, an algorithm to search for such discriminative replies automatically, given a small number of annotated dialogue examples. We find that AutoReply-generated replies outperform handcrafted replies and perform on par with carefully fine-tuned large supervised models. Results also show that one single reply without much computation overheads can also detect dialogue nonsense reasonably well.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2211.12615

Country:

Asia > Middle East > Republic of Türkiye (0.05)
Europe > Spain (0.04)
Europe > Russia (0.04)
(16 more...)

Genre: Research Report (0.82)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.47)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.46)

Add feedback

Robust Causality and False Attribution in Data-Driven Earth Science Discoveries

Eldhose, Elizabeth, Chauhan, Tejasvi, Chandel, Vikram, Ghosh, Subimal, Ganguly, Auroop R.

arXiv.org Machine LearningSep-26-2022

Causal and attribution studies are essential for earth scientific discoveries and critical for informing climate, ecology, and water policies. However, the current generation of methods needs to keep pace with the complexity of scientific and stakeholder challenges and data availability combined with the adequacy of data-driven methods. Unless carefully informed by physics, they run the risk of conflating correlation with causation or getting overwhelmed by estimation inaccuracies. Given that natural experiments, controlled trials, interventions, and counterfactual examinations are often impractical, information-theoretic methods have been developed and are being continually refined in the earth sciences. Here we show that transfer entropy-based causal graphs, which have recently become popular in the earth sciences with high-profile discoveries, can be spurious even when augmented with statistical significance. We develop a subsample-based ensemble approach for robust causality analysis. Simulated data, and observations in climate and ecohydrology, suggest the robustness and consistency of this approach.

artificial intelligence, machine learning, subsample, (15 more...)

arXiv.org Machine Learning

2209.1258

Country:

Asia > India > Maharashtra > Mumbai (0.04)
North America > United States > Washington > Benton County > Richland (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
(6 more...)

Genre: Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Therapeutic Area (0.66)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Efficient Unsupervised Learning for Plankton Images

Alfano, Paolo Didier, Rando, Marco, Letizia, Marco, Odone, Francesca, Rosasco, Lorenzo, Pastore, Vito Paolo

arXiv.org Artificial IntelligenceSep-14-2022

Monitoring plankton populations in situ is fundamental to preserve the aquatic ecosystem. Plankton microorganisms are in fact susceptible of minor environmental perturbations, that can reflect into consequent morphological and dynamical modifications. Nowadays, the availability of advanced automatic or semi-automatic acquisition systems has been allowing the production of an increasingly large amount of plankton image data. The adoption of machine learning algorithms to classify such data may be affected by the significant cost of manual annotation, due to both the huge quantity of acquired data and the numerosity of plankton species. To address these challenges, we propose an efficient unsupervised learning pipeline to provide accurate classification of plankton microorganisms. We build a set of image descriptors exploiting a two-step procedure. First, a Variational Autoencoder (VAE) is trained on features extracted by a pre-trained neural network. We then use the learnt latent space as image descriptor for clustering. We compare our method with state-of-the-art unsupervised approaches, where a set of pre-defined hand-crafted features is used for clustering of plankton images. The proposed pipeline outperforms the benchmark algorithms for all the plankton datasets included in our analysis, providing better image embedding properties.

artificial intelligence, deep learning, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2209.06726

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > Italy (0.05)
North America > United States > New York > New York County > New York City (0.04)
(3 more...)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback