AITopics | Pacific Ocean

Collaborating Authors

Pacific Ocean

After months fighting Houthis on the USS Eisenhower, sailors face a new kind of sea threat

FOX NewsFeb-15-2024, 16:50:45 GMT

Kirk Lippold discusses the reported three U.S. strikes against Houthis in Yemen on'Your World.' Sailors aboard the aircraft carrier USS Dwight D. Eisenhower and its accompanying warships have spent four months straight at sea defending against ballistic missiles and flying attack drones fired by Iranian-backed Houthis, and are now more regularly also defending against a new threat -- fast unmanned vessels that are fired at them through the water. While the Houthis have launched unmanned surface vessels, or USVs, in the past against Saudi coalition forces that have intervened in Yemen's civil war, they were used for the first time against U.S. military and commercial vessels in the Red Sea on Jan. 4. In the weeks since, the Navy has had to intercept and destroy multiple USVs. It's "more of an unknown threat that we don't have a lot of intel on, that could be extremely lethal -- an unmanned surface vessel," said Rear Adm. Marc Miguez, commander of Carrier Strike Group Two, of which the Eisenhower is the flagship. The Houthis "have ways of obviously controlling them just like they do the (unmanned aerial vehicles), and we have very little little fidelity as to all the stockpiles of what they have USV-wise," Miguez said.

eisenhower, houthis, threat, (14 more...)

FOX News

Country:

North America > United States (1.00)
Asia > Middle East > Yemen (0.97)
Africa > Middle East > Djibouti (0.39)
(11 more...)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Government > Military > Navy (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.56)

Add feedback

Parametric Augmentation for Time Series Contrastive Learning

Zheng, Xu, Wang, Tianchun, Cheng, Wei, Ma, Aitian, Chen, Haifeng, Sha, Mo, Luo, Dongsheng

arXiv.org Artificial IntelligenceFeb-15-2024

Modern techniques like contrastive learning have been effectively used in many areas, including computer vision, natural language processing, and graph-structured data. Creating positive examples that assist the model in learning robust and discriminative representations is a crucial stage in contrastive learning approaches. Usually, preset human intuition directs the selection of relevant data augmentations. Due to patterns that are easily recognized by humans, this rule of thumb works well in the vision and language domains. However, it is impractical to visually inspect the temporal structures in time series. The diversity of time series augmentations at both the dataset and instance levels makes it difficult to choose meaningful augmentations on the fly. In this study, we address this gap by analyzing time series data augmentation using information theory and summarizing the most commonly adopted augmentations in a unified format. We then propose a contrastive learning framework with parametric augmentation, AutoTCL, which can be adaptively employed to support time series representation learning. The proposed approach is encoder-agnostic, allowing it to be seamlessly integrated with different backbone encoders. Experiments on univariate forecasting tasks demonstrate the highly competitive results of our method, with an average 6.5\% reduction in MSE and 4.7\% in MAE over the leading baselines. In classification tasks, AutoTCL achieves a $1.2\%$ increase in average accuracy.

augmentation, contrastive learning, learning, (15 more...)

arXiv.org Artificial Intelligence

2402.10434

Country:

Pacific Ocean > North Pacific Ocean > San Francisco Bay (0.04)
North America > United States > Pennsylvania (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)
(2 more...)

Genre: Research Report > New Finding (0.66)

Industry: Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation

Yuan, Huizhuo, Chen, Zixiang, Ji, Kaixuan, Gu, Quanquan

arXiv.org Artificial IntelligenceFeb-15-2024

Fine-tuning Diffusion Models remains an underexplored frontier in generative artificial intelligence (GenAI), especially when compared with the remarkable progress made in fine-tuning Large Language Models (LLMs). While cutting-edge diffusion models such as Stable Diffusion (SD) and SDXL rely on supervised fine-tuning, their performance inevitably plateaus after seeing a certain volume of data. Recently, reinforcement learning (RL) has been employed to fine-tune diffusion models with human preference data, but it requires at least two images ("winner" and "loser" images) for each text prompt. In this paper, we introduce an innovative technique called self-play fine-tuning for diffusion models (SPIN-Diffusion), where the diffusion model engages in competition with its earlier versions, facilitating an iterative self-improvement process. Our approach offers an alternative to conventional supervised fine-tuning and RL strategies, significantly improving both model performance and alignment. Our experiments on the Pick-a-Pic dataset reveal that SPIN-Diffusion outperforms the existing supervised fine-tuning method in aspects of human preference alignment and visual appeal right from its first iteration. By the second iteration, it exceeds the performance of RLHF-based methods across all metrics, achieving these results with less data.

diffusion model, diffusion-dpo, spin-diffusion, (15 more...)

arXiv.org Artificial Intelligence

2402.1021

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.28)
Europe > Switzerland > Zürich > Zürich (0.14)
Pacific Ocean > North Pacific Ocean > San Francisco Bay > Golden Gate (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)

Genre: Research Report > New Finding (0.93)

Industry: Leisure & Entertainment > Games (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.48)

Add feedback

Generative Representational Instruction Tuning

Muennighoff, Niklas, Su, Hongjin, Wang, Liang, Yang, Nan, Wei, Furu, Yu, Tao, Singh, Amanpreet, Kiela, Douwe

arXiv.org Artificial IntelligenceFeb-15-2024

All text-based language problems can be reduced to either generation or embedding. Current models only perform well at one or the other. We introduce generative representational instruction tuning (GRIT) whereby a large language model is trained to handle both generative and embedding tasks by distinguishing between them through instructions. Compared to other open models, our resulting GritLM 7B sets a new state of the art on the Massive Text Embedding Benchmark (MTEB) and outperforms all models up to its size on a range of generative tasks. By scaling up further, GritLM 8x7B outperforms all open generative language models that we tried while still being among the best embedding models. Notably, we find that GRIT matches training on only generative or embedding data, thus we can unify both at no performance loss. Among other benefits, the unification via GRIT speeds up Retrieval-Augmented Generation (RAG) by > 60% for long documents, by no longer requiring separate retrieval and generation models. Models, code, etc. are freely available at https://github.com/ContextualAI/gritlm.

instruction, query, question title, (13 more...)

arXiv.org Artificial Intelligence

2402.09906

Country:

South America (0.04)
Pacific Ocean > North Pacific Ocean > Gulf of California (0.04)
North America > United States > Alaska (0.04)
(13 more...)

Genre: Research Report > New Finding (0.92)

Industry:

Health & Medicine (1.00)
Information Technology > Services (0.67)
Energy > Power Industry (0.45)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Spectral Filters, Dark Signals, and Attention Sinks

Cancedda, Nicola

arXiv.org Artificial IntelligenceFeb-14-2024

Projecting intermediate representations onto the vocabulary is an increasingly popular interpretation tool for transformer-based LLMs, also known as the logit lens. We propose a quantitative extension to this approach and define spectral filters on intermediate representations based on partitioning the singular vectors of the vocabulary embedding and unembedding matrices into bands. We find that the signals exchanged in the tail end of the spectrum are responsible for attention sinking (Xiao et al. 2023), of which we provide an explanation. We find that the loss of pretrained models can be kept low despite suppressing sizable parts of the embedding spectrum in a layer-dependent way, as long as attention sinking is preserved. Finally, we discover that the representation of tokens that draw attention from many tokens have large projections on the tail end of the spectrum.

residual stream, subspace, tuatara, (15 more...)

arXiv.org Artificial Intelligence

2402.09221

Country:

Pacific Ocean > North Pacific Ocean > Gulf of Alaska (0.05)
North America > United States > Alaska > Gulf of Alaska (0.05)
Pacific Ocean > North Pacific Ocean > Bering Sea > Bristol Bay (0.04)
(11 more...)

Genre: Research Report (1.00)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.92)

Add feedback

Step-On-Feet Tuning: Scaling Self-Alignment of LLMs via Bootstrapping

Wang, Haoyu, Ma, Guozheng, Meng, Ziqiao, Qin, Zeyu, Shen, Li, Zhang, Zhong, Wu, Bingzhe, Liu, Liu, Bian, Yatao, Xu, Tingyang, Wang, Xueqian, Zhao, Peilin

arXiv.org Artificial IntelligenceFeb-12-2024

Self-alignment is an effective way to reduce the cost of human annotation while ensuring promising model capability. However, most current methods complete the data collection and training steps in a single round, which may overlook the continuously improving ability of self-aligned models. This gives rise to a key query: What if we do multi-time bootstrapping self-alignment? Does this strategy enhance model performance or lead to rapid degradation? In this paper, our pioneering exploration delves into the impact of bootstrapping self-alignment on large language models. Our findings reveal that bootstrapping self-alignment markedly surpasses the single-round approach, by guaranteeing data diversity from in-context learning. To further exploit the capabilities of bootstrapping, we investigate and adjust the training order of data, which yields improved performance of the model. Drawing on these findings, we propose Step-On-Feet Tuning (SOFT) which leverages model's continuously enhanced few-shot ability to boost zero or one-shot performance. Based on easy-to-hard training recipe, we propose SOFT+ which further boost self-alignment's performance. Our experiments demonstrate the efficiency of SOFT (SOFT+) across various classification and generation tasks, highlighting the potential of bootstrapping self-alignment on continually enhancing model alignment performance.

iclexample, internal thought, reliable assistant, (12 more...)

arXiv.org Artificial Intelligence

2402.0761

Country:

North America > Canada (0.14)
Asia > China (0.05)
Europe > Spain (0.04)
(9 more...)

Genre:

Personal (1.00)
Research Report > New Finding (0.87)

Industry:

Leisure & Entertainment (1.00)
Law (1.00)
Health & Medicine > Consumer Health (1.00)
(6 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Generalizing Conversational Dense Retrieval via LLM-Cognition Data Augmentation

Chen, Haonan, Dou, Zhicheng, Mao, Kelong, Liu, Jiongnan, Zhao, Ziliang

arXiv.org Artificial IntelligenceFeb-10-2024

Conversational search utilizes muli-turn natural language contexts to retrieve relevant passages. Existing conversational dense retrieval models mostly view a conversation as a fixed sequence of questions and responses, overlooking the severe data sparsity problem -- that is, users can perform a conversation in various ways, and these alternate conversations are unrecorded. Consequently, they often struggle to generalize to diverse conversations in real-world scenarios. In this work, we propose a framework for generalizing Conversational dense retrieval via LLM-cognition data Augmentation (ConvAug). ConvAug first generates multi-level augmented conversations to capture the diverse nature of conversational contexts. Inspired by human cognition, we devise a cognition-aware process to mitigate the generation of false positives, false negatives, and hallucinations. Moreover, we develop a difficulty-adaptive sample filter that selects challenging samples for complex conversations, thereby giving the model a larger learning space. A contrastive learning objective is then employed to train a better conversational context encoder. Extensive experiments conducted on four public datasets, under both normal and zero-shot settings, demonstrate the effectiveness, generalizability, and applicability of ConvAug.

comprehension synthesis, computational linguistic, onv, (12 more...)

arXiv.org Artificial Intelligence

2402.07092

Country:

Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
Asia > Singapore (0.04)
North America > Canada > Ontario > Toronto (0.04)
(13 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

EntGPT: Linking Generative Large Language Models with Knowledge Bases

Ding, Yifan, Poudel, Amrit, Zeng, Qingkai, Weninger, Tim, Veeramani, Balaji, Bhattacharya, Sanmitra

arXiv.org Artificial IntelligenceFeb-9-2024

The ability of Large Language Models (LLMs) to generate factually correct output remains relatively unexplored due to the lack of fact-checking and knowledge grounding during training and inference. In this work, we aim to address this challenge through the Entity Disambiguation (ED) task. We first consider prompt engineering, and design a three-step hard-prompting method to probe LLMs' ED performance without supervised fine-tuning (SFT). Overall, the prompting method improves the micro-F_1 score of the original vanilla models by a large margin, on some cases up to 36% and higher, and obtains comparable performance across 10 datasets when compared to existing methods with SFT. We further improve the knowledge grounding ability through instruction tuning (IT) with similar prompts and responses. The instruction-tuned model not only achieves higher micro-F1 score performance as compared to several baseline methods on supervised entity disambiguation tasks with an average micro-F_1 improvement of 2.1% over the existing baseline models, but also obtains higher accuracy on six Question Answering (QA) tasks in the zero-shot setting. Our methodologies apply to both open- and closed-source LLMs.

arxiv preprint arxiv, computational linguistic, entgpt-i, (12 more...)

arXiv.org Artificial Intelligence

2402.06738

Country:

North America > United States > New York > New York County > New York City (0.28)
Africa > Middle East > Egypt (0.28)
North America > United States > Washington > King County > Seattle (0.14)
(34 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Education (1.00)
Government > Regional Government (0.93)
Government > Military (0.68)
Leisure & Entertainment > Sports > Football (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

FusionSF: Fuse Heterogeneous Modalities in a Vector Quantized Framework for Robust Solar Power Forecasting

Ma, Ziqing, Wang, Wenwei, Zhou, Tian, Chen, Chao, Peng, Bingqing, Sun, Liang, Jin, Rong

arXiv.org Artificial IntelligenceFeb-8-2024

Accurate solar power forecasting is crucial to integrate photovoltaic plants into the electric grid, schedule and secure the power grid safety. This problem becomes more demanding for those newly installed solar plants which lack sufficient data. Current research predominantly relies on historical solar power data or numerical weather prediction in a single-modality format, ignoring the complementary information provided in different modalities. In this paper, we propose a multi-modality fusion framework to integrate historical power data, numerical weather prediction, and satellite images, significantly improving forecast performance. We introduce a vector quantized framework that aligns modalities with varying information densities, striking a balance between integrating sufficient information and averting model overfitting. Our framework demonstrates strong zero-shot forecasting capability, which is especially useful for those newly installed plants. Moreover, we collect and release a multi-modal solar power (MMSP) dataset from real-world plants to further promote the research of multi-modal solar forecasting algorithms. Our extensive experiments show that our model not only operates with robustness but also boosts accuracy in both zero-shot forecasting and scenarios rich with training data, surpassing leading models. We have incorporated it into our eForecaster platform and deployed it for more than 300 solar plants with a capacity of over 15GW.

dataset, forecasting, prediction, (12 more...)

arXiv.org Artificial Intelligence

2402.05823

Country:

North America > United States > District of Columbia > Washington (0.05)
Asia > China (0.05)
Asia > Japan (0.04)
(8 more...)

Genre: Research Report (0.82)

Industry:

Energy > Renewable > Solar (1.00)
Energy > Power Industry (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Data Science > Data Mining (0.93)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.89)

Add feedback

Digital Twin Mobility Profiling: A Spatio-Temporal Graph Learning Approach

Chen, Xin, Hou, Mingliang, Tang, Tao, Kaur, Achhardeep, Xia, Feng

arXiv.org Artificial IntelligenceFeb-6-2024

With the arrival of the big data era, mobility profiling has become a viable method of utilizing enormous amounts of mobility data to create an intelligent transportation system. Mobility profiling can extract potential patterns in urban traffic from mobility data and is critical for a variety of traffic-related applications. However, due to the high level of complexity and the huge amount of data, mobility profiling faces huge challenges. Digital Twin (DT) technology paves the way for cost-effective and performance-optimised management by digitally creating a virtual representation of the network to simulate its behaviour. In order to capture the complex spatio-temporal features in traffic scenario, we construct alignment diagrams to assist in completing the spatio-temporal correlation representation and design dilated alignment convolution network (DACN) to learn the fine-grained correlations, i.e., spatio-temporal interactions. We propose a digital twin mobility profiling (DTMP) framework to learn node profiles on a mobility network DT model. Extensive experiments have been conducted upon three real-world datasets. Experimental results demonstrate the effectiveness of DTMP.

graph, node, spatial graph, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/HPCC-DSS-SmartCity-DependSys53884.2021.00182

2402.0375

Country:

North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.04)
Oceania > Australia (0.04)
Asia > China > Liaoning Province > Dalian (0.04)
(5 more...)

Genre: Research Report (0.70)

Industry:

Transportation > Infrastructure & Services (1.00)
Transportation > Ground > Road (1.00)
Information Technology (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback