AITopics | Atlantic Ocean

Collaborating Authors

Atlantic Ocean

WeatherBench 2: A benchmark for the next generation of data-driven global weather models

Rasp, Stephan, Hoyer, Stephan, Merose, Alexander, Langmore, Ian, Battaglia, Peter, Russel, Tyler, Sanchez-Gonzalez, Alvaro, Yang, Vivian, Carver, Rob, Agrawal, Shreya, Chantry, Matthew, Bouallegue, Zied Ben, Dueben, Peter, Bromberg, Carla, Sisk, Jared, Barrington, Luke, Bell, Aaron, Sha, Fei

arXiv.org Artificial IntelligenceJan-26-2024

WeatherBench 2 is an update to the global, medium-range (1-14 day) weather forecasting benchmark proposed by Rasp et al. (2020), designed with the aim to accelerate progress in data-driven weather modeling. WeatherBench 2 consists of an open-source evaluation framework, publicly available training, ground truth and baseline data as well as a continuously updated website with the latest metrics and state-of-the-art models: https://sites.research.google/weatherbench. This paper describes the design principles of the evaluation framework and presents results for current state-of-the-art physical and data-driven weather models. The metrics are based on established practices for evaluating weather forecasts at leading operational weather centers. We define a set of headline scores to provide an overview of model performance. In addition, we also discuss caveats in the current evaluation setup and challenges for the future of data-driven weather forecasting.

forecast, lead time, resolution, (16 more...)

arXiv.org Artificial Intelligence

2308.1556

Country:

North America > United States (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Spain (0.04)
(6 more...)

Genre:

Research Report (1.00)
Overview (0.68)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Producing Plankton Classifiers that are Robust to Dataset Shift

Chen, Cheng, Kyathanahally, Sreenath, Reyes, Marta, Merkli, Stefanie, Merz, Ewa, Francazi, Emanuele, Hoege, Marvin, Pomati, Francesco, Baity-Jesi, Marco

arXiv.org Artificial IntelligenceJan-25-2024

Modern plankton high-throughput monitoring relies on deep learning classifiers for species recognition in water ecosystems. Despite satisfactory nominal performances, a significant challenge arises from Dataset Shift, which causes performances to drop during deployment. In our study, we integrate the ZooLake dataset with manually-annotated images from 10 independent days of deployment, serving as test cells to benchmark Out-Of-Dataset (OOD) performances. Our analysis reveals instances where classifiers, initially performing well in In-Dataset conditions, encounter notable failures in practical scenarios. For example, a MobileNet with a 92% nominal test accuracy shows a 77% OOD accuracy. We systematically investigate conditions leading to OOD performance drops and propose a preemptive assessment method to identify potential pitfalls when classifying new data, and pinpoint features in OOD images that adversely impact classification. We present a three-step pipeline: (i) identifying OOD degradation compared to nominal test performance, (ii) conducting a diagnostic analysis of degradation causes, and (iii) providing solutions. We find that ensembles of BEiT vision transformers, with targeted augmentations addressing OOD robustness, geometric ensembling, and rotation-based test-time augmentation, constitute the most robust model, which we call BEsT model. It achieves an 83% OOD accuracy, with errors concentrated on container classes. Moreover, it exhibits lower sensitivity to dataset shift, and reproduces well the plankton abundances. Our proposed pipeline is applicable to generic plankton classifiers, contingent on the availability of suitable test cells. By identifying critical shortcomings and offering practical procedures to fortify models against dataset shift, our study contributes to the development of more reliable plankton classification technologies.

augmentation, classification, dataset, (16 more...)

arXiv.org Artificial Intelligence

2401.14256

Country:

Europe > Switzerland (0.04)
North America > United States > New Jersey > Hudson County > Hoboken (0.04)
North America > United States > California (0.04)
(4 more...)

Genre: Research Report > New Finding (0.48)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.46)
Water & Waste Management > Water Management (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
(3 more...)

Add feedback

Towards Consistent Natural-Language Explanations via Explanation-Consistency Finetuning

Chen, Yanda, Singh, Chandan, Liu, Xiaodong, Zuo, Simiao, Yu, Bin, He, He, Gao, Jianfeng

arXiv.org Artificial IntelligenceJan-25-2024

Large language models (LLMs) often generate convincing, fluent explanations. However, different from humans, they often generate inconsistent explanations on different inputs. For example, an LLM may generate the explanation "all birds can fly" when answering the question "Can sparrows fly?" but meanwhile answer "no" to the related question "Can penguins fly?". Explanations should be consistent across related examples so that they allow a human to simulate the LLM's decision process on multiple examples. We propose explanation-consistency finetuning (EC-finetuning), a method that adapts LLMs to generate more consistent natural-language explanations on related examples. EC-finetuning involves finetuning LLMs on synthetic data that is carefully constructed to contain consistent explanations. Across a variety of question-answering datasets in various domains, EC-finetuning yields a 10.0% relative explanation consistency improvement on four finetuning datasets, and generalizes to seven out-of-distribution datasets not seen during finetuning (+4.5% relative). Code is available at https://github.com/yandachen/explanation-consistency-finetuning .

explanation, follow-up question, starter qa, (14 more...)

arXiv.org Artificial Intelligence

2401.13986

Country:

Europe > North Sea (0.05)
Atlantic Ocean > North Atlantic Ocean > North Sea (0.05)
Europe > Netherlands > North Holland > Amsterdam (0.05)
(4 more...)

Genre: Research Report (0.64)

Industry:

Leisure & Entertainment (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Health & Medicine (0.95)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

TrustLLM: Trustworthiness in Large Language Models

Sun, Lichao, Huang, Yue, Wang, Haoran, Wu, Siyuan, Zhang, Qihui, Gao, Chujie, Huang, Yixin, Lyu, Wenhan, Zhang, Yixuan, Li, Xiner, Liu, Zhengliang, Liu, Yixin, Wang, Yijue, Zhang, Zhikun, Kailkhura, Bhavya, Xiong, Caiming, Xiao, Chaowei, Li, Chunyuan, Xing, Eric, Huang, Furong, Liu, Hao, Ji, Heng, Wang, Hongyi, Zhang, Huan, Yao, Huaxiu, Kellis, Manolis, Zitnik, Marinka, Jiang, Meng, Bansal, Mohit, Zou, James, Pei, Jian, Liu, Jian, Gao, Jianfeng, Han, Jiawei, Zhao, Jieyu, Tang, Jiliang, Wang, Jindong, Mitchell, John, Shu, Kai, Xu, Kaidi, Chang, Kai-Wei, He, Lifang, Huang, Lifu, Backes, Michael, Gong, Neil Zhenqiang, Yu, Philip S., Chen, Pin-Yu, Gu, Quanquan, Xu, Ran, Ying, Rex, Ji, Shuiwang, Jana, Suman, Chen, Tianlong, Liu, Tianming, Zhou, Tianyi, Wang, William, Li, Xiang, Zhang, Xiangliang, Wang, Xiao, Xie, Xing, Chen, Xun, Wang, Xuyu, Liu, Yan, Ye, Yanfang, Cao, Yinzhi, Chen, Yong, Zhao, Yue

arXiv.org Artificial IntelligenceJan-25-2024

Large language models (LLMs), exemplified by ChatGPT, have gained considerable attention for their excellent natural language processing capabilities. Nonetheless, these LLMs present many challenges, particularly in the realm of trustworthiness. Therefore, ensuring the trustworthiness of LLMs emerges as an important topic. This paper introduces TrustLLM, a comprehensive study of trustworthiness in LLMs, including principles for different dimensions of trustworthiness, established benchmark, evaluation, and analysis of trustworthiness for mainstream LLMs, and discussion of open challenges and future directions. Specifically, we first propose a set of principles for trustworthy LLMs that span eight different dimensions. Based on these principles, we further establish a benchmark across six dimensions including truthfulness, safety, fairness, robustness, privacy, and machine ethics. We then present a study evaluating 16 mainstream LLMs in TrustLLM, consisting of over 30 datasets. Our findings firstly show that in general trustworthiness and utility (i.e., functional effectiveness) are positively related. Secondly, our observations reveal that proprietary LLMs generally outperform most open-source counterparts in terms of trustworthiness, raising concerns about the potential risks of widely accessible open-source LLMs. However, a few open-source LLMs come very close to proprietary ones. Thirdly, it is important to note that some LLMs may be overly calibrated towards exhibiting trustworthiness, to the extent that they compromise their utility by mistakenly treating benign prompts as harmful and consequently not responding. Finally, we emphasize the importance of ensuring transparency not only in the models themselves but also in the technologies that underpin trustworthiness. Knowing the specific trustworthy technologies that have been employed is crucial for analyzing their effectiveness.

koala-13b oasst-12b ernie palm 2, out-of-distribution generalization, safety and output robustness, (17 more...)

arXiv.org Artificial Intelligence

2401.05561

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.13)
Atlantic Ocean (0.04)
Asia > Middle East > Jordan (0.04)
(35 more...)

Genre:

Research Report > New Finding (1.00)
Overview (1.00)
Personal > Interview (0.92)

Industry:

Media > News (1.00)
Leisure & Entertainment (1.00)
Law > Statutes (1.00)
(14 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Drones attack deep in Russia as Medvedev threatens Ukraine's 'existence'

Al JazeeraJan-24-2024, 15:39:16 GMT

Russia and Ukraine traded deadly aerial attacks on civilian centres in the past week of the war, but Ukraine also scored hits on military and economic infrastructure deep in the Russian heartland, extending its reach to St Petersburg for the first time. Ukrainian military intelligence said it had struck an unspecified military target in St Petersburg on Thursday, using drones launched from Ukrainian soil. Ukrainian strategic industries minister Oleksandr Kamyshin confirmed the attack, telling the World Economic Forum in Davos that the attack was carried out by a Ukrainian-built drone that had travelled 1,250km (780 miles) from Ukrainian soil. Russia's defence ministry said three drones had been launched and it had downed all three over the Gulf of Finland that day, one near an oil terminal. On Sunday, Ukraine attacked again in several locations, and this time, the evidence of its success was clear.

artificial intelligence, russia, ukraine, (15 more...)

Al Jazeera

Country:

Asia > Russia (1.00)
Europe > Finland (0.25)
Europe > Ukraine > Kharkiv Oblast > Kharkiv (0.06)
(14 more...)

Industry:

Government > Military (1.00)
Government > Regional Government > Europe Government > Russia Government (0.91)
Government > Regional Government > Asia Government > Russia Government (0.91)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)

Add feedback

Consistency Guided Knowledge Retrieval and Denoising in LLMs for Zero-shot Document-level Relation Triplet Extraction

Sun, Qi, Huang, Kun, Yang, Xiaocui, Tong, Rong, Zhang, Kun, Poria, Soujanya

arXiv.org Artificial IntelligenceJan-24-2024

Document-level Relation Triplet Extraction (DocRTE) is a fundamental task in information systems that aims to simultaneously extract entities with semantic relations from a document. Existing methods heavily rely on a substantial amount of fully labeled data. However, collecting and annotating data for newly emerging relations is time-consuming and labor-intensive. Recent advanced Large Language Models (LLMs), such as ChatGPT and LLaMA, exhibit impressive long-text generation capabilities, inspiring us to explore an alternative approach for obtaining auto-labeled documents with new relations. In this paper, we propose a Zero-shot Document-level Relation Triplet Extraction (ZeroDocRTE) framework, which generates labeled data by retrieval and denoising knowledge from LLMs, called GenRDK. Specifically, we propose a chain-of-retrieval prompt to guide ChatGPT to generate labeled long-text data step by step. To improve the quality of synthetic data, we propose a denoising strategy based on the consistency of cross-document knowledge. Leveraging our denoised synthetic data, we proceed to fine-tune the LLaMA2-13B-Chat for extracting document-level relation triplets. We perform experiments for both zero-shot document-level relation and triplet extraction on two public datasets. The experimental results illustrate that our GenRDK framework outperforms strong baselines.

extraction, relation type, synthetic data, (15 more...)

arXiv.org Artificial Intelligence

2401.13598

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
Asia > Singapore (0.04)
North America > Canada > Ontario > Toronto (0.04)
(4 more...)

Genre: Research Report (0.50)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Ohio Republican Senate candidates clash over border security, drone strikes in Mexico

FOX NewsJan-23-2024, 17:35:33 GMT

Ohio Republican candidates who are vying to take on Democratic incumbent Sen. Sherrod Brown clashed over border security and drone strikes in Mexico during Monday's first statewide debate. Facing off at WJW Fox 8 Studios in Cleveland, businessman Bernie Moreno, Ohio Secretary of State Frank LaRose and state Sen. Matt Dolan generally agreed on a few issues, including calling for fully securing the U.S.-Mexico border, but then quickly clashed upon delving into the immigration crisis further. Dolan accused Moreno, who was endorsed by former President Trump, of wanting "to militarize the federal government and deport children" for his stance calling for deporting anybody in the country illegally. LaRose called earlier Monday for President Biden to deploy three military divisions to the border, which Dolan said was irresponsible. "We need to work with the Mexican government, we need to be tough with the Mexican government," Dolan said.

artificial intelligence, larose, moreno, (12 more...)

FOX News

Country:

North America > Mexico (1.00)
Pacific Ocean (0.05)
North America > United States > Wyoming (0.05)
(6 more...)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Government > Immigration & Customs (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.72)

Add feedback

From Knowledge Organization to Knowledge Representation and Back

Giunchiglia, Fausto, Bagchi, Mayukh, Das, Subhashis

arXiv.org Artificial IntelligenceJan-22-2024

Knowledge Organization (KO) and Knowledge Representation (KR) have been the two mainstream methodologies of knowledge modelling in the Information Science community and the Artificial Intelligence community, respectively. The facet-analytical tradition of KO has developed an exhaustive set of guiding canons for ensuring quality in organising and managing knowledge but has remained limited in terms of technology-driven activities to expand its scope and services beyond the bibliographic universe of knowledge. KR, on the other hand, boasts of a robust ecosystem of technologies and technology-driven service design which can be tailored to model any entity or scale to any service in the entire universe of knowledge. This paper elucidates both the facet-analytical KO and KR methodologies in detail and provides a functional mapping between them. Out of the mapping, the paper proposes an integrated KR-enriched KO methodology with all the standard components of a KO methodology plus the advanced technologies provided by the KR approach. The practical benefits of the methodological integration has been exemplified through the flagship application of the Digital University at the University of Trento, Italy.

artefact, ko methodology, kr methodology, (14 more...)

arXiv.org Artificial Intelligence

2401.11753

Country:

Europe > Italy > Trentino-Alto Adige/Südtirol > Trentino Province > Trento (0.25)
North America > United States > New York (0.05)
Europe > United Kingdom > UK North Sea (0.04)
(10 more...)

Genre: Research Report (0.53)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.71)

Add feedback

A-KIT: Adaptive Kalman-Informed Transformer

Cohen, Nadav, Klein, Itzik

arXiv.org Artificial IntelligenceJan-18-2024

The extended Kalman filter (EKF) is a widely adopted method for sensor fusion in navigation applications. A crucial aspect of the EKF is the online determination of the process noise covariance matrix reflecting the model uncertainty. While common EKF implementation assumes a constant process noise, in real-world scenarios, the process noise varies, leading to inaccuracies in the estimated state and potentially causing the filter to diverge. To cope with such situations, model-based adaptive EKF methods were proposed and demonstrated performance improvements, highlighting the need for a robust adaptive approach. In this paper, we derive and introduce A-KIT, an adaptive Kalman-informed transformer to learn the varying process noise covariance online. The A-KIT framework is applicable to any type of sensor fusion. Here, we present our approach to nonlinear sensor fusion based on an inertial navigation system and Doppler velocity log. By employing real recorded data from an autonomous underwater vehicle, we show that A-KIT outperforms the conventional EKF by more than 49.5% and model-based adaptive EKF by an average of 35.4% in terms of position accuracy.

covariance, matrix, noise covariance, (15 more...)

arXiv.org Artificial Intelligence

2401.09987

Country:

Asia > Middle East > Israel > Haifa District > Haifa (0.04)
Atlantic Ocean > Mediterranean Sea (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Sensing and Signal Processing (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

GA-SmaAt-GNet: Generative Adversarial Small Attention GNet for Extreme Precipitation Nowcasting

Reulen, Eloy, Mehrkanoon, Siamak

arXiv.org Artificial IntelligenceJan-18-2024

In recent years, data-driven modeling approaches have gained considerable traction in various meteorological applications, particularly in the realm of weather forecasting. However, these approaches often encounter challenges when dealing with extreme weather conditions. In light of this, we propose GA-SmaAt-GNet, a novel generative adversarial architecture that makes use of two methodologies aimed at enhancing the performance of deep learning models for extreme precipitation nowcasting. Firstly, it uses a novel SmaAt-GNet built upon the successful SmaAt-UNet architecture as generator. This network incorporates precipitation masks (binarized precipitation maps) as an additional data source, leveraging valuable information for improved predictions. Additionally, GA-SmaAt-GNet utilizes an attention-augmented discriminator inspired by the well-established Pix2Pix architecture. Furthermore, we assess the performance of GA-SmaAt-GNet using real-life precipitation dataset from the Netherlands. Our experimental results reveal a notable improvement in both overall performance and for extreme precipitation events. Furthermore, we conduct uncertainty analysis on the proposed GA-SmaAt-GNet model as well as on the precipitation dataset, providing additional insights into the predictive capabilities of the model. Finally, we offer further insights into the predictions of our proposed model using Grad-CAM. This visual explanation technique generates activation heatmaps, illustrating areas of the input that are more activated for various parts of the network.

architecture, precipitation, prediction, (14 more...)

arXiv.org Artificial Intelligence

2401.09881

Country:

Europe > Netherlands (0.26)
Europe > North Sea (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
(3 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback