Goto

Collaborating Authors

 Indian Ocean


Arabic Fine-Grained Entity Recognition

arXiv.org Artificial Intelligence

Traditional NER systems are typically trained to recognize coarse-grained entities, and less attention is given to classifying entities into a hierarchy of fine-grained lower-level subtypes. This article aims to advance Arabic NER with fine-grained entities. We chose to extend Wojood (an open-source Nested Arabic Named Entity Corpus) with subtypes. In particular, four main entity types in Wojood, geopolitical entity (GPE), location (LOC), organization (ORG), and facility (FAC), are extended with 31 subtypes. To do this, we first revised Wojood's annotations of GPE, LOC, ORG, and FAC to be compatible with the LDC's ACE guidelines, which yielded 5, 614 changes. Second, all mentions of GPE, LOC, ORG, and FAC (~44K) in Wojood are manually annotated with the LDC's ACE sub-types. We refer to this extended version of Wojood as WojoodF ine. To evaluate our annotations, we measured the inter-annotator agreement (IAA) using both Cohen's Kappa and F1 score, resulting in 0.9861 and 0.9889, respectively. To compute the baselines of WojoodF ine, we fine-tune three pre-trained Arabic BERT encoders in three settings: flat NER, nested NER and nested NER with subtypes and achieved F1 score of 0.920, 0.866, and 0.885, respectively. Our corpus and models are open-source and available at https://sina.birzeit.edu/wojood/.


Tensions Spilling Over From Gaza Impact Shipping in the Red Sea

NYT > Middle East

The tensions spilling over from the war in Gaza to merchant shipping in the Red Sea escalated on Saturday when Britain and the United States said their militaries had shot down more than a dozen attack drones. The Houthis, an armed group that controls much of northern Yemen, have been staging drone and missile assaults on Israeli and American targets since the Oct. 7 Hamas-led attacks on Israel. They have said they intend to prevent Israeli ships from sailing the Red Sea until Israel stops its war on Hamas, which rules Gaza. Both the Houthis and Hamas, like Hezbollah in Lebanon, are backed by Iran. The shipping industry was also bracing for potential economic fallout as the Red Sea, a vital sea lane, is increasingly drawn into the regional unrest.


US, UK say they shot down 15 drones from Yemen's Houthis over Red Sea

Al Jazeera

The United States and United Kingdom authorities say their warships have shot down 15 attack drones over the Red Sea as Israel's war on Gaza threatens to spread in the region. The US Central Command (CENTCOM) on Saturday said its guided-missile destroyer responded to a wave of drones from "Houthi-controlled areas of Yemen" over the Red Sea, downing 14 suspected attack drones. It described the launches as "one-way attack drones", saying they were "shot down with no damage to ships in the area or reported injuries". UK Defence Secretary Grant Shapps also said the Royal Navy destroyer HMS Diamond fired a Sea Viper missile and destroyed a drone that was "targeting merchant shipping". Meanwhile, Yemen's Iran-aligned Houthis said the group attacked the Israeli city of Eilat on Saturday with a swarm of drones, according to spokesman Yahya Sarea who referred to the Red Sea resort city as being in "southern occupied Palestine".


HMS Diamond: British warship shoots down suspected attack drone in Red Sea

BBC News

Earlier this month, the US military said the Unity Explorer, sailing under the flag of the Bahamas and owned by a British company, was among three commercial vessels targeted in an attack by Iranian-backed Houthi rebels.


ResoNet: Robust and Explainable ENSO Forecasts with Hybrid Convolution and Transformer Networks

arXiv.org Artificial Intelligence

Recent studies have shown that deep learning (DL) models can skillfully predict the El Ni\~no-Southern Oscillation (ENSO) forecasts over 1.5 years ahead. However, concerns regarding the reliability of predictions made by DL methods persist, including potential overfitting issues and lack of interpretability. Here, we propose ResoNet, a DL model that combines convolutional neural network (CNN) and Transformer architectures. This hybrid architecture design enables our model to adequately capture local SSTA as well as long-range inter-basin interactions across oceans. We show that ResoNet can robustly predict ESNO at lead times between 19 and 26 months, thus outperforming existing approaches in terms of the forecast horizon. According to an explainability method applied to ResoNet predictions of El Ni\~no and La Ni\~na events from 1- to 18-month lead, we find that it predicts the Ni\~no3.4 index based on multiple physically reasonable mechanisms, such as the Recharge Oscillator concept, Seasonal Footprint Mechanism, and Indian Ocean capacitor effect. Moreover, we demonstrate that for the first time, the asymmetry between El Ni\~no and La Ni\~na development can be captured by ResoNet. Our results could help alleviate skepticism about applying DL models for ENSO prediction and encourage more attempts to discover and predict climate phenomena using AI methods.


Deep learning model to help detect plastic in oceans

AIHub

Our society relies heavily on plastic products and the amount of plastic waste is expected to increase in the future. If not properly discarded or recycled, much of it accumulates in rivers and lakes. Eventually it will flow into the oceans, where it can form aggregations of marine debris together with natural materials like driftwood and algae. A new study from Wageningen University and EPFL researchers, recently published in Cell iScience, has developed an artificial intelligence-based detector that estimates the probability of marine debris shown in satellite images. This could help to systematically remove plastic litter from the oceans with ships.


FuXi-S2S: An accurate machine learning model for global subseasonal forecasts

arXiv.org Artificial Intelligence

Skillful subseasonal forecasts beyond 2 weeks are crucial for a wide range of applications across various sectors of society. Recently, state-of-the-art machine learning based weather forecasting models have made significant advancements, outperforming the high-resolution forecast (HRES) from the European Centre for Medium-Range Weather Forecasts (ECMWF). However, the full potential of machine learning models in subseasonal forecasts has yet to be fully explored. In this study, we introduce FuXi Subseasonal-to-Seasonal (FuXi-S2S), a machine learning based subseasonal forecasting model that provides global daily mean forecasts up to 42 days, covering 5 upper-air atmospheric variables at 13 pressure levels and 11 surface variables. FuXi-S2S integrates an enhanced FuXi base model with a perturbation module for flow-dependent perturbations in hidden features, and incorporates Perlin noise to perturb initial conditions. The model is developed using 72 years of daily statistics from ECMWF ERA5 reanalysis data. When compared to the ECMWF Subseasonal-to-Seasonal (S2S) reforecasts, the FuXi-S2S forecasts demonstrate superior deterministic and ensemble forecasts for total precipitation (TP), outgoing longwave radiation (OLR), and geopotential at 500 hPa (Z500). Although it shows slightly inferior performance in predicting 2-meter temperature (T2M), it has clear advantages over land area. Regarding the extreme forecasts, FuXi-S2S outperforms ECMWF S2S globally for TP. Furthermore, FuXi-S2S forecasts surpass the ECMWF S2S reforecasts in predicting the Madden Julian Oscillation (MJO), a key source of subseasonal predictability. They extend the skillful prediction of MJO from 30 days to 36 days.


Visual Instruction Tuning with Polite Flamingo

arXiv.org Artificial Intelligence

Recent research has demonstrated that the multi-task fine-tuning of multi-modal Large Language Models (LLMs) using an assortment of annotated downstream vision-language datasets significantly enhances their performance. Yet, during this process, a side effect, which we termed as the "multi-modal alignment tax", surfaces. This side effect negatively impacts the model's ability to format responses appropriately -- for instance, its "politeness" -- due to the overly succinct and unformatted nature of raw annotations, resulting in reduced human preference. In this paper, we introduce Polite Flamingo, a multi-modal response rewriter that transforms raw annotations into a more appealing, "polite" format. Polite Flamingo is trained to reconstruct high-quality responses from their automatically distorted counterparts and is subsequently applied to a vast array of vision-language datasets for response rewriting. After rigorous filtering, we generate the PF-1M dataset and further validate its value by fine-tuning a multi-modal LLM with it. Combined with novel methodologies including U-shaped multi-stage tuning and multi-turn augmentation, the resulting model, Clever Flamingo, demonstrates its advantages in both multi-modal understanding and response politeness according to automated and human evaluations.


US destroyer in Red Sea shoots down another Houthi drone

FOX News

Fox News chief national security correspondent Jennifer Griffin reports on the repeated attacks on U.S. forces in the Middle East on'Faulkner Focus.' U.S. Navy destroyer USS Mason shot down a Houthi drone coming out of Yemen on Wednesday, a U.S. defense official told Fox News. The drone was headed toward USS Mason, which was responding to reports that Houthis were attacking the tanker Ardmore Encounter by using skiffs and then firing two missiles that missed, according to the official. No damage or injuries were initially reported, and the Ardmore Encounter went on its way. The incident occurred around 8 a.m. A Pentagon official confirmed to Fox News that the two missiles were anti-ship ballistic missiles fired from ground-based locations in Yemen.


Zebra: Extending Context Window with Layerwise Grouped Local-Global Attention

arXiv.org Artificial Intelligence

This paper introduces a novel approach to enhance the capabilities of Large Language Models (LLMs) in processing and understanding extensive text sequences, a critical aspect in applications requiring deep comprehension and synthesis of large volumes of information. Recognizing the inherent challenges in extending the context window for LLMs, primarily built on Transformer architecture, we propose a new model architecture, referred to as Zebra. This architecture efficiently manages the quadratic time and memory complexity issues associated with full attention in the Transformer by employing grouped local-global attention layers. Our model, akin to a zebra's alternating stripes, balances local and global attention layers, significantly reducing computational requirements and memory consumption. Comprehensive experiments, including pretraining from scratch, continuation of long context adaptation training, and long instruction tuning, are conducted to evaluate the Zebra's performance. The results show that Zebra achieves comparable or superior performance on both short and long sequence benchmarks, while also enhancing training and inference efficiency.