AITopics | Pacific Ocean

Collaborating Authors

Pacific Ocean

Mobulas, a Wonder of the Gulf of California, Are Disappearing

WIREDFeb-18-2025, 12:30:00 GMT

These magnificent rays are at risk of disappearing due to targeted fishing, being caught as bycatch, and climate change. Scientists at the research collaboration Mobula Conservation are teaming up with artisanal and industrial fishermen to protect them. Also known as "Devil Rays," mobulas are elasmobranchs: a subclass of fish--including sharks, skates, and sawfish--that are distinguished by having skeletons primarily made from cartilage. More than a third of the species in this group are threatened with extinction. Of the nine species of mobulas, seven are endangered and two are vulnerable according to the International Union for Conservation of Nature.

artificial intelligence, california, mobula, (14 more...)

WIRED

Country:

North America > United States > California (0.59)
Pacific Ocean > North Pacific Ocean > Gulf of California (0.45)

Industry: Food & Agriculture > Fishing (0.54)

Technology: Information Technology > Artificial Intelligence (0.37)

Add feedback

Private Text Generation by Seeding Large Language Model Prompts

Nagesh, Supriya, Chen, Justin Y., Mishra, Nina, Wagner, Tal

arXiv.org Artificial IntelligenceFeb-18-2025

We explore how private synthetic text can be generated by suitably prompting a large language model (LLM). This addresses a challenge for organizations like hospitals, which hold sensitive text data like patient medical records, and wish to share it in order to train machine learning models for medical tasks, while preserving patient privacy. Methods that rely on training or finetuning a model may be out of reach, either due to API limits of third-party LLMs, or due to ethical and legal prohibitions on sharing the private data with the LLM itself. We propose Differentially Private Keyphrase Prompt Seeding (DP-KPS), a method that generates a private synthetic text corpus from a sensitive input corpus, by accessing an LLM only through privatized prompts. It is based on seeding the prompts with private samples from a distribution over phrase embeddings, thus capturing the input corpus while achieving requisite output diversity and maintaining differential privacy. We evaluate DP-KPS on downstream ML text classification tasks, and show that the corpora it generates preserve much of the predictive power of the original ones. Our findings offer hope that institutions can reap ML insights by privately sharing data with simple prompts and little compute.

dataset, keyphrase, sequence, (16 more...)

arXiv.org Artificial Intelligence

2502.13193

Country:

Europe > United Kingdom > England (0.05)
Asia > India (0.04)
North America > United States > Utah (0.04)
(24 more...)

Genre: Research Report > New Finding (0.66)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)
Information Technology > Security & Privacy (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

CondensNet: Enabling stable long-term climate simulations via hybrid deep learning models with adaptive physical constraints

Wang, Xin, Yang, Juntao, Adie, Jeff, See, Simon, Furtado, Kalli, Chen, Chen, Arcomano, Troy, Maulik, Romit, Mengaldo, Gianmarco

arXiv.org Artificial IntelligenceFeb-18-2025

Accurate and efficient climate simulations are crucial for understanding Earth's evolving climate. However, current general circulation models (GCMs) face challenges in capturing unresolved physical processes, such as cloud and convection. A common solution is to adopt cloud resolving models, that provide more accurate results than the standard subgrid parametrisation schemes typically used in GCMs. However, cloud resolving models, also referred to as super paramtetrizations, remain computationally prohibitive. Hybrid modeling, which integrates deep learning with equation-based GCMs, offers a promising alternative but often struggles with long-term stability and accuracy issues. In this work, we find that water vapor oversaturation during condensation is a key factor compromising the stability of hybrid models. To address this, we introduce CondensNet, a novel neural network architecture that embeds a self-adaptive physical constraint to correct unphysical condensation processes. CondensNet effectively mitigates water vapor oversaturation, enhancing simulation stability while maintaining accuracy and improving computational efficiency compared to super parameterization schemes. We integrate CondensNet into a GCM to form PCNN-GCM (Physics-Constrained Neural Network GCM), a hybrid deep learning framework designed for long-term stable climate simulations in real-world conditions, including ocean and land. PCNN-GCM represents a significant milestone in hybrid climate modeling, as it shows a novel way to incorporate physical constraints adaptively, paving the way for accurate, lightweight, and stable long-term climate simulations.

condensnet, parametrization, pcnn-gcm, (14 more...)

arXiv.org Artificial Intelligence

2502.13185

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Indian Ocean > Eastern Indian Ocean (0.04)
Asia > Singapore (0.04)
(3 more...)

Genre: Research Report > New Finding (0.93)

Industry: Energy (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Generalized Temporal Tensor Decomposition with Rank-revealing Latent-ODE

Chen, Panqi, Cheng, Lei, Li, Jianlong, Li, Weichang, Liu, Weiqing, Bian, Jiang, Fang, Shikai

arXiv.org Machine LearningFeb-10-2025

Tensor decomposition is a fundamental tool for analyzing multi-dimensional data by learning low-rank factors to represent high-order interactions. While recent works on temporal tensor decomposition have made significant progress by incorporating continuous timestamps in latent factors, they still struggle with general tensor data with continuous indexes not only in the temporal mode but also in other modes, such as spatial coordinates in climate data. Additionally, the problem of determining the tensor rank remains largely unexplored in temporal tensor models. To address these limitations, we propose \underline{G}eneralized temporal tensor decomposition with \underline{R}ank-r\underline{E}vealing laten\underline{T}-ODE (GRET). Our approach encodes continuous spatial indexes as learnable Fourier features and employs neural ODEs in latent space to learn the temporal trajectories of factors. To automatically reveal the rank of temporal tensors, we introduce a rank-revealing Gaussian-Gamma prior over the factor trajectories. We develop an efficient variational inference scheme with an analytical evidence lower bound, enabling sampling-free optimization. Through extensive experiments on both synthetic and real-world datasets, we demonstrate that GRET not only reveals the underlying ranks of temporal tensors but also significantly outperforms existing methods in prediction performance and robustness against noise.

artificial intelligence, factor trajectory, machine learning, (11 more...)

arXiv.org Machine Learning

2502.06164

Country:

Africa > Senegal > Kolda Region > Kolda (0.04)
Pacific Ocean (0.04)
North America > United States > California (0.04)
(4 more...)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Testing software for non-discrimination: an updated and extended audit in the Italian car insurance domain

Rondina, Marco, Vetrò, Antonio, Coppola, Riccardo, Regragrui, Oumaima, Fabris, Alessandro, Silvello, Gianmaria, Susto, Gian Antonio, De Martin, Juan Carlos

arXiv.org Artificial IntelligenceFeb-10-2025

Context. As software systems become more integrated into society's infrastructure, the responsibility of software professionals to ensure compliance with various non-functional requirements increases. These requirements include security, safety, privacy, and, increasingly, non-discrimination. Motivation. Fairness in pricing algorithms grants equitable access to basic services without discriminating on the basis of protected attributes. Method. We replicate a previous empirical study that used black box testing to audit pricing algorithms used by Italian car insurance companies, accessible through a popular online system. With respect to the previous study, we enlarged the number of tests and the number of demographic variables under analysis. Results. Our work confirms and extends previous findings, highlighting the problematic permanence of discrimination across time: demographic variables significantly impact pricing to this day, with birthplace remaining the main discriminatory factor against individuals not born in Italian cities. We also found that driver profiles can determine the number of quotes available to the user, denying equal opportunities to all. Conclusion. The study underscores the importance of testing for non-discrimination in software systems that affect people's everyday lives. Performing algorithmic audits over time makes it possible to evaluate the evolution of such algorithms. It also demonstrates the role that empirical software engineering can play in making software systems more accountable.

artificial intelligence, audit, discrimination, (18 more...)

arXiv.org Artificial Intelligence

2502.06439

Country:

North America > United States > New York > New York County > New York City (0.05)
Africa > Middle East > Morocco (0.04)
Europe > Romania (0.04)
(7 more...)

Genre:

Research Report > Experimental Study (0.68)
Research Report > New Finding (0.68)

Industry:

Banking & Finance > Insurance (1.00)
Government > Regional Government > Europe Government (0.68)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Software (0.75)

Add feedback

Linear Transformers as VAR Models: Aligning Autoregressive Attention Mechanisms with Autoregressive Forecasting

Lu, Jiecheng, Yang, Shihao

arXiv.org Machine LearningFeb-10-2025

Autoregressive attention-based time series forecasting (TSF) has drawn increasing interest, with mechanisms like linear attention sometimes outperforming vanilla attention. However, deeper Transformer architectures frequently misalign with autoregressive objectives, obscuring the underlying VAR structure embedded within linear attention and hindering their ability to capture the data generative processes in TSF. In this work, we first show that a single linear attention layer can be interpreted as a dynamic vector autoregressive (VAR) structure. We then explain that existing multi-layer Transformers have structural mismatches with the autoregressive forecasting objective, which impair interpretability and generalization ability. To address this, we show that by rearranging the MLP, attention, and input-output flow, multi-layer linear attention can also be aligned as a VAR model. Then, we propose Structural Aligned Mixture of VAR (SAMoVAR), a linear Transformer variant that integrates interpretable dynamic VAR weights for multivariate TSF. By aligning the Transformer architecture with autoregressive objectives, SAMoVAR delivers improved performance, interpretability, and computational efficiency, comparing to SOTA TSF models.

large language model, machine learning, natural language, (18 more...)

arXiv.org Machine Learning

2502.07244

Country:

Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
Pacific Ocean > North Pacific Ocean > San Francisco Bay (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry: Energy > Power Industry (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Fast Multivariate Spatio-temporal Analysis via Low Rank Tensor Learning

Neural Information Processing SystemsFeb-9-2025, 16:26:45 GMT

Accurate and efficient analysis of multivariate spatio-temporal data is critical in climatology, geology, and sociology applications. Existing models usually assume simple inter-dependence among variables, space, and time, and are computationally expensive. We propose a unified low rank tensor learning framework for multivariate spatio-temporal analysis, which can conveniently incorporate different properties in spatio-temporal data, such as spatial clustering and shared structure among variables. We demonstrate how the general framework can be applied to cokriging and forecasting tasks, and develop an efficient greedy algorithm to solve the resulting optimization problem with convergence guarantee. We conduct experiments on both synthetic datasets and real application datasets to demonstrate that our method is not only significantly faster than existing methods but also achieves lower estimation error.

artificial intelligence, machine learning, temporal reasoning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.29)
Pacific Ocean (0.04)
North America > United States > Rocky Mountains (0.04)
(7 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Temporal Reasoning (0.62)

Add feedback

Powerformer: A Transformer with Weighted Causal Attention for Time-series Forecasting

Hegazy, Kareem, Mahoney, Michael W., Erichson, N. Benjamin

arXiv.org Machine LearningFeb-9-2025

Transformers have recently shown strong performance in time-series forecasting, but their all-to-all attention mechanism overlooks the (temporal) causal and often (temporally) local nature of data. We introduce Powerformer, a novel Transformer variant that replaces noncausal attention weights with causal weights that are reweighted according to a smooth heavy-tailed decay. This simple yet effective modification endows the model with an inductive bias favoring temporally local dependencies, while still allowing sufficient flexibility to learn the unique correlation structure of each dataset. Our empirical results demonstrate that Powerformer not only achieves state-of-the-art accuracy on public time-series benchmarks, but also that it offers improved interpretability of attention patterns. Our analyses show that the model's locality bias is amplified during training, demonstrating an interplay between time-series data and power-law-based attention. These findings highlight the importance of domain-specific modifications to the Transformer architecture for time-series forecasting, and they establish Powerformer as a strong, efficient, and principled baseline for future research and real-world applications.

data mining, large language model, machine learning, (19 more...)

arXiv.org Machine Learning

2502.06151

Country:

Pacific Ocean > North Pacific Ocean > San Francisco Bay (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)
(3 more...)

Genre: Research Report > New Finding (0.47)

Industry:

Energy (0.67)
Government (0.45)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

EPBC-YOLOv8: An efficient and accurate improved YOLOv8 underwater detector based on an attention mechanism

Jiang, Xing, Zhuang, Xiting, Chen, Jisheng, Zhang, Jian

arXiv.org Artificial IntelligenceFeb-9-2025

In this study, we enhance underwater target detection by integrating channel and spatial attention into YOLOv8's backbone, applying Pointwise Convolution in FasterNeXt for the FasterPW model, and leveraging Weighted Concat in a BiFPN-inspired WFPN structure for improved cross-scale connections and robustness. Utilizing CARAFE for refined feature reassembly, our framework addresses underwater image degradation, achieving mAP at 0.5 scores of 76.7 percent and 79.0 percent on URPC2019 and URPC2020 datasets, respectively. These scores are 2.3 percent and 0.7 percent higher than the original YOLOv8, showcasing enhanced precision in detecting marine organisms.

information, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2502.05788

Country:

North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
Asia > China (0.04)
Pacific Ocean (0.04)
(3 more...)

Genre: Research Report > New Finding (0.88)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
(3 more...)

Add feedback

Toward Copyright Integrity and Verifiability via Multi-Bit Watermarking for Intelligent Transportation Systems

Wang, Yihao, Li, Lingxiao, Tang, Yifan, Zhang, Ru, Liu, Jianyi

arXiv.org Artificial IntelligenceFeb-7-2025

Intelligent transportation systems (ITS) use advanced technologies such as artificial intelligence to significantly improve traffic flow management efficiency, and promote the intelligent development of the transportation industry. However, if the data in ITS is attacked, such as tampering or forgery, it will endanger public safety and cause social losses. Therefore, this paper proposes a watermarking that can verify the integrity of copyright in response to the needs of ITS, termed ITSmark. ITSmark focuses on functions such as extracting watermarks, verifying permission, and tracing tampered locations. The scheme uses the copyright information to build the multi-bit space and divides this space into multiple segments. These segments will be assigned to tokens. Thus, the next token is determined by its segment which contains the copyright. In this way, the obtained data contains the custom watermark. To ensure the authorization, key parameters are encrypted during copyright embedding to obtain cipher data. Only by possessing the correct cipher data and private key, can the user entirely extract the watermark. Experiments show that ITSmark surpasses baseline performances in data quality, extraction accuracy, and unforgeability. It also shows unique capabilities of permission verification and tampered location tracing, which ensures the security of extraction and the reliability of copyright verification. Furthermore, ITSmark can also customize the watermark embedding position and proportion according to user needs, making embedding more flexible.

data mining, large language model, machine learning, (21 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/TITS.2025.3535932

2502.05425

Country:

Asia > China > Beijing > Beijing (0.04)
North America > United States > New Mexico > Bernalillo County > Albuquerque (0.04)
Pacific Ocean > North Pacific Ocean > Philippine Sea > Leyte Gulf (0.04)
(2 more...)

Genre:

Research Report (1.00)
Overview (0.67)

Industry:

Information Technology > Security & Privacy (1.00)
Transportation > Ground > Road (0.93)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(5 more...)

Add feedback