Indian Ocean
Combining deep generative models with extreme value theory for synthetic hazard simulation: a multivariate and spatially coherent approach
Climate hazards can cause major disasters when they occur simultaneously as compound hazards. To understand the distribution of climate risk and inform adaptation policies, scientists need to simulate a large number of physically realistic and spatially coherent events. Current methods are limited by computational constraints and the probabilistic spatial distribution of compound events is not given sufficient attention. The bottleneck in current approaches lies in modelling the dependence structure between variables, as inference on parametric models suffers from the curse of dimensionality. Generative adversarial networks (GANs) are well-suited to such a problem due to their ability to implicitly learn the distribution of data in high-dimensional settings. We employ a GAN to model the dependence structure for daily maximum wind speed, significant wave height, and total precipitation over the Bay of Bengal, combining this with traditional extreme value theory for controlled extrapolation of the tails. Once trained, the model can be used to efficiently generate thousands of realistic compound hazard events, which can inform climate risk assessments for climate adaptation and disaster preparedness. The method developed is flexible and transferable to other multivariate and spatial climate datasets.
Extending Explainable Boosting Machines to Scientific Image Data
Schug, Daniel, Yerramreddy, Sai, Caruana, Rich, Greenberg, Craig, Zwolak, Justyna P.
As the deployment of computer vision technology becomes increasingly common in science, the need for explanations of the system and its output has become a focus of great concern. Driven by the pressing need for interpretable models in science, we propose the use of Explainable Boosting Machines (EBMs) for scientific image data. Inspired by an important application underpinning the development of quantum technologies, we apply EBMs to cold-atom soliton image data tabularized using Gabor Wavelet Transform-based techniques that preserve the spatial structure of the data. In doing so, we demonstrate the use of EBMs for image data for the first time and show that our approach provides explanations that are consistent with human intuition about the data.
US Navy warship shoots down Iranian-made Houthi drone launched from Yemen
Former USS Cole commander Kirk Lippold discusses how released Hamas hostages are arriving at an Israeli hospital on'Your World.' The U.S. Navy destroyer USS Carney has shot down an Iranian-made Houthi drone launched from Yemen, a military official confirms to Fox News. There was no damage to the Carney or any injuries to the U.S. personnel onboard. The warship had been sailing near the Bab el-Mandeb Strait at the time of the attack. The USS Carney shot down 15 drones and four cruise missiles fired from Yemen in the northern Red Sea last month during a nine-hour span, using its SM-2 surface-to-air missiles.
Reinforcement Replaces Supervision: Query focused Summarization using Deep Reinforcement Learning
Nath, Swaroop, Khadilkar, Harshad, Bhattacharyya, Pushpak
Query-focused Summarization (QfS) deals with systems that generate summaries from document(s) based on a query. Motivated by the insight that Reinforcement Learning (RL) provides a generalization to Supervised Learning (SL) for Natural Language Generation, and thereby performs better (empirically) than SL, we use an RL-based approach for this task of QfS. Additionally, we also resolve the conflict of employing RL in Transformers with Teacher Forcing. We develop multiple Policy Gradient networks, trained on various reward signals: ROUGE, BLEU, and Semantic Similarity, which lead to a 10-point improvement over the State-of-the-Art approach on the ROUGE-L metric for a benchmark dataset (ELI5). We also show performance of our approach in zero-shot setting for another benchmark dataset (DebatePedia) -- our approach leads to results comparable to baselines, which were specifically trained on DebatePedia. To aid the RL training, we propose a better semantic similarity reward, enabled by a novel Passage Embedding scheme developed using Cluster Hypothesis. Lastly, we contribute a gold-standard test dataset to further research in QfS and Long-form Question Answering (LfQA).
GlycoNMR: Dataset and benchmarks for NMR chemical shift prediction of carbohydrates with graph neural networks
Chen, Zizhang, Badman, Ryan Paul, Foley, Lachele, Woods, Robert, Hong, Pengyu
Molecular representation learning (MRL) is a powerful tool for bridging the gap between machine learning and chemical sciences, as it converts molecules into numerical representations while preserving their chemical features. These encoded representations serve as a foundation for various downstream biochemical studies, including property prediction and drug design. MRL has had great success with proteins and general biomolecule datasets. Yet, in the growing sub-field of glycoscience (the study of carbohydrates, where longer carbohydrates are also called glycans), MRL methods have been barely explored. This under-exploration can be primarily attributed to the limited availability of comprehensive and well-curated carbohydrate-specific datasets and a lack of Machine learning (ML) pipelines specifically tailored to meet the unique problems presented by carbohydrate data. Since interpreting and annotating carbohydrate-specific data is generally more complicated than protein data, domain experts are usually required to get involved. The existing MRL methods, predominately optimized for proteins and small biomolecules, also cannot be directly used in carbohydrate applications without special modifications. To address this challenge, accelerate progress in glycoscience, and enrich the data resources of the MRL community, we introduce GlycoNMR. GlycoNMR contains two laboriously curated datasets with 2,609 carbohydrate structures and 211,543 annotated nuclear magnetic resonance (NMR) chemical shifts for precise atomic-level prediction. We tailored carbohydrate-specific features and adapted existing MRL models to tackle this problem effectively. For illustration, we benchmark four modified MRL models on our new datasets.
Optimal Clustering of Discrete Mixtures: Binomial, Poisson, Block Models, and Multi-layer Networks
Lyu, Zhongyuan, Li, Ting, Xia, Dong
In this paper, we first study the fundamental limit of clustering networks when a multi-layer network is present. Under the mixture multi-layer stochastic block model (MMSBM), we show that the minimax optimal network clustering error rate, which takes an exponential form and is characterized by the Renyi divergence between the edge probability distributions of the component networks. We propose a novel two-stage network clustering method including a tensor-based initialization algorithm involving both node and sample splitting and a refinement procedure by likelihood-based Lloyd algorithm. Network clustering must be accompanied by node community detection. Our proposed algorithm achieves the minimax optimal network clustering error rate and allows extreme network sparsity under MMSBM. Numerical simulations and real data experiments both validate that our method outperforms existing methods. Oftentimes, the edges of networks carry count-type weights. We then extend our methodology and analysis framework to study the minimax optimal clustering error rate for mixture of discrete distributions including Binomial, Poisson, and multi-layer Poisson networks. The minimax optimal clustering error rates in these discrete mixtures all take the same exponential form characterized by the Renyi divergences. These optimal clustering error rates in discrete mixtures can also be achieved by our proposed two-stage clustering algorithm.
Israeli-owned ship targeted in suspected drone attack: Reports
A suspected drone attack has hit a container ship owned by an Israeli businessman in the Indian Ocean, according to a United States defence official. The attack was likely carried out using an Iranian-made Shahed-136 drone on Friday, an unnamed US defence official told The Associated Press news agency on Saturday. Pan-Arab satellite channel Al Mayadeen also reported that an Israeli ship had been targeted in the Indian Ocean. The drone targeted the Malta-flagged, French-operated CMA CGM Symi vessel while in international waters. The ship reportedly suffered damage after the drone exploded, but no crew members were injured.
US warship cruising Red Sea shoots down attack drones fired from Yemen
A US warship cruising the Red Sea has shot down drones fired from Houthi-held territory in Yemen, according to the US Central Command. The USS Thomas Hudner, a guided-missile destroyer, shot down "multiple one-way attack drones" launched on Thursday morning from Yemen's Houthi-controlled areas, CENTCOM said in a post on X, formerly Twitter. CENTCOM said there was no damage to the US vessel or injuries to its crew. On the morning (Yemen time) of November 23, the USS Thomas Hudner (DDG 116) shot down multiple one-way attack drones launched from Houthi controlled areas in Yemen. The drones were shot down while the U.S. warship was on patrol in the Red Sea.
US Navy destroyer shoots down drone from Yemen in the Red Sea
The U.S. Department of Defense released video footage of a U.S. air strike on a training and weapons facility in Abul Kamal, Syria. The USS Thomas Hudner, an Arleigh Burke-class destroyer, shot down a drone from Yemen in the Red Sea on Wednesday, two U.S. defense officials confirmed to Fox News. A defense official said the drone was shot down in self-defense. "The drone was heading towards the Hudner," the official said. The drone attack is the latest in a series of attacks on American troops stationed in the Middle East amid the ongoing Israel-Hamas war.
Adaptation with Self-Evaluation to Improve Selective Prediction in LLMs
Chen, Jiefeng, Yoon, Jinsung, Ebrahimi, Sayna, Arik, Sercan O, Pfister, Tomas, Jha, Somesh
Large language models (LLMs) have recently shown great advances in a variety of tasks, including natural language understanding and generation. However, their use in high-stakes decision-making scenarios is still limited due to the potential for errors. Selective prediction is a technique that can be used to improve the reliability of the LLMs by allowing them to abstain from making predictions when they are unsure of the answer. In this work, we propose a novel framework for adaptation with self-evaluation to improve the selective prediction performance of LLMs. Our framework is based on the idea of using parameter-efficient tuning to adapt the LLM to the specific task at hand while improving its ability to perform self-evaluation. We evaluate our method on a variety of question-answering (QA) datasets and show that it outperforms state-of-the-art selective prediction methods. For example, on the CoQA benchmark, our method improves the AUACC from 91.23% to 92.63% and improves the AUROC from 74.61% to 80.25%.