AITopics | Kutaisi

Collaborating Authors

Kutaisi

Adversarial Tokenization

Geh, Renato Lui, Shao, Zilei, Broeck, Guy Van den

arXiv.org Artificial IntelligenceMar-3-2025

Current LLM pipelines account for only one possible tokenization for a given string, ignoring exponentially many alternative tokenizations during training and inference. For example, the standard Llama3 tokenization of penguin is [p,enguin], yet [peng,uin] is another perfectly valid alternative. In this paper, we show that despite LLMs being trained solely on one tokenization, they still retain semantic understanding of other tokenizations, raising questions about their implications in LLM safety. Put succinctly, we answer the following question: can we adversarially tokenize an obviously malicious string to evade safety and alignment restrictions? We show that not only is adversarial tokenization an effective yet previously neglected axis of attack, but it is also competitive against existing state-of-the-art adversarial approaches without changing the text of the harmful request. We empirically validate this exploit across three state-of-the-art LLMs and adversarial datasets, revealing a previously unknown vulnerability in subword models.

fulfill, preprint, tokenization, (15 more...)

arXiv.org Artificial Intelligence

2503.02174

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Africa > Cameroon (0.14)
North America > United States > Florida > Miami-Dade County > Miami (0.04)
(19 more...)

Genre: Research Report > New Finding (0.92)

Industry:

Information Technology > Security & Privacy (1.00)
Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Stationary Processes, Wiener-Granger Causality, and Matrix Spectral Factorization

Ephremidze, Lasha

arXiv.org Machine LearningDec-25-2024

Granger causality has become an indispensable tool for analyzing causal relationships between time series. In this paper, we provide a detailed overview of its mathematical foundations, trace its historical development, and explore how recent computational advancements can enhance its application in various fields. We will not hesitate to present the proofs in full if they are simple and transparent. For more complex theorems on which we rely, we will provide supporting citations. We also discuss potential future directions for the method, particularly in the context of largescale data analysis.

artificial intelligence, factorization, matrix, (11 more...)

arXiv.org Machine Learning

2412.18901

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > New York > New York County > New York City (0.04)
Asia > Georgia > Tbilisi > Tbilisi (0.04)
Asia > Georgia > Imereti > Kutaisi (0.04)

Genre:

Research Report (0.50)
Overview (0.49)

Technology: Information Technology > Artificial Intelligence (0.69)

Add feedback