Goto

Collaborating Authors

 spoiler


The Correspondence Between Bounded Graph Neural Networks and Fragments of First-Order Logic

Grau, Bernardo Cuenca, Feng, Eva, Wałęga, Przemysław A.

arXiv.org Artificial Intelligence

Graph Neural Networks (GNNs) address two key challenges in applying deep learning to graph-structured data: they handle varying size input graphs and ensure invariance under graph isomorphism. While GNNs have demonstrated broad applicability, understanding their expressive power remains an important question. In this paper, we propose GNN architectures that correspond precisely to prominent fragments of first-order logic (FO), including various modal logics as well as more expressive two-variable fragments. To establish these results, we apply methods from finite model theory of first-order and modal logics to the domain of graph representation learning. Our results provide a unifying framework for understanding the logical expressiveness of GNNs within FO.


Man tests if Tesla on Autopilot will slam through foam wall (spoiler: it did)

Popular Science

It turns out Tesla's camera-vision-only approach to self-driving is no match for a Wile E. Coyote-style fake wall. Earlier this week, former NASA engineer and YouTuber Mark Rober posted a video where he tried to see if he could trick a Tesla Model Y using its Autopilot driver-assist function into driving through a Styrofoam wall disguised to look like part of the road in front of it. The Tesla hurls towards the wall at 40 mph and, rather than stopping, plows straight through it, leaving a giant hole. "It turns out my Tesla is less Road Runner, more Wile E. Coyote," Rober says as he inspects the damage on the front hood. The video, posted only a couple days ago, had racked up over 20 million views by Wednesday morning.


Fine-Grained Expressive Power of Weisfeiler-Leman: A Homomorphism Counting Perspective

Zhou, Junru, Zhang, Muhan

arXiv.org Artificial Intelligence

The ability of graph neural networks (GNNs) to count homomorphisms has recently been proposed as a practical and fine-grained measure of their expressive power. Although several existing works have investigated the homomorphism counting power of certain GNN families, a simple and unified framework for analyzing the problem is absent. In this paper, we first propose \emph{generalized folklore Weisfeiler-Leman (GFWL)} algorithms as a flexible design basis for expressive GNNs, and then provide a theoretical framework to algorithmically determine the homomorphism counting power of an arbitrary class of GNN within the GFWL design space. As the considered design space is large enough to accommodate almost all known powerful GNNs, our result greatly extends all existing works, and may find its application in the automation of GNN model design.


Generating clickbait spoilers with an ensemble of large language models

Woźny, Mateusz, Lango, Mateusz

arXiv.org Artificial Intelligence

Clickbait posts are a widespread problem in the webspace. The generation of spoilers, i.e. short texts that neutralize clickbait by providing information that satisfies the curiosity induced by it, is one of the proposed solutions to the problem. Current state-of-the-art methods are based on passage retrieval or question answering approaches and are limited to generating spoilers only in the form of a phrase or a passage. In this work, we propose an ensemble of fine-tuned large language models for clickbait spoiler generation. Our approach is not limited to phrase or passage spoilers, but is also able to generate multipart spoilers that refer to several non-consecutive parts of text. Experimental evaluation demonstrates that the proposed ensemble model outperforms the baselines in terms of BLEU, METEOR and BERTScore metrics.


Mitigating Clickbait: An Approach to Spoiler Generation Using Multitask Learning

Pal, Sayantan, Das, Souvik, Srihari, Rohini K.

arXiv.org Artificial Intelligence

This study introduces 'clickbait spoiling', a novel technique designed to detect, categorize, and generate spoilers as succinct text responses, countering the curiosity induced by clickbait content. By leveraging a multi-task learning framework, our model's generalization capabilities are significantly enhanced, effectively addressing the pervasive issue of clickbait. The crux of our research lies in generating appropriate spoilers, be it a phrase, an extended passage, or multiple, depending on the spoiler type required. Our methodology integrates two crucial techniques: a refined spoiler categorization method and a modified version of the Question Answering (QA) mechanism, incorporated within a multi-task learning paradigm for optimized spoiler extraction from context. Notably, we have included fine-tuning methods for models capable of handling longer sequences to accommodate the generation of extended spoilers. This research highlights the potential of sophisticated text processing techniques in tackling the omnipresent issue of clickbait, promising an enhanced user experience in the digital realm.


MMoE: Robust Spoiler Detection with Multi-modal Information and Domain-aware Mixture-of-Experts

Zeng, Zinan, Ye, Sen, Cai, Zijian, Wang, Heng, Liu, Yuhan, Zhang, Haokai, Luo, Minnan

arXiv.org Artificial Intelligence

Online movie review websites are valuable for information and discussion about movies. However, the massive spoiler reviews detract from the movie-watching experience, making spoiler detection an important task. Previous methods simply focus on reviews' text content, ignoring the heterogeneity of information in the platform. For instance, the metadata and the corresponding user's information of a review could be helpful. Besides, the spoiler language of movie reviews tends to be genre-specific, thus posing a domain generalization challenge for existing methods. To this end, we propose MMoE, a multi-modal network that utilizes information from multiple modalities to facilitate robust spoiler detection and adopts Mixture-of-Experts to enhance domain generalization. MMoE first extracts graph, text, and meta feature from the user-movie network, the review's textual content, and the review's metadata respectively. To handle genre-specific spoilers, we then adopt Mixture-of-Experts architecture to process information in three modalities to promote robustness. Finally, we use an expert fusion layer to integrate the features from different perspectives and make predictions based on the fused embedding. Experiments demonstrate that MMoE achieves state-of-the-art performance on two widely-used spoiler detection datasets, surpassing previous SOTA methods by 2.56% and 8.41% in terms of accuracy and F1-score. Further experiments also demonstrate MMoE's superiority in robustness and generalization.


An extension of May's Theorem to three alternatives: axiomatizing Minimax voting

Holliday, Wesley H., Pacuit, Eric

arXiv.org Artificial Intelligence

O. May, Econometrica 20 (1952) 680-684] characterizes majority voting on two alternatives as the unique preferential voting method satisfying several simple axioms. Here we show that by adding some desirable axioms to May's axioms, we can uniquely determine how to vote on three alternatives. In particular, we add two axioms stating that the voting method should mitigate spoiler effects and avoid the so-called strong no show paradox. We prove a theorem stating that any preferential voting method satisfying our enlarged set of axioms, which includes some weak homogeneity and preservation axioms, agrees with Minimax voting in all three-alternative elections, except perhaps in some improbable knife-edged elections in which ties may arise and be broken in different ways.


Low-Resource Clickbait Spoiling for Indonesian via Question Answering

Maharani, Ni Putu Intan, Purwarianti, Ayu, Aji, Alham Fikri

arXiv.org Artificial Intelligence

Clickbait spoiling aims to generate a short text to satisfy the curiosity induced by a clickbait post. As it is a newly introduced task, the dataset is only available in English so far. Our contributions include the construction of manually labeled clickbait spoiling corpus in Indonesian and an evaluation on using cross-lingual zero-shot question answering-based models to tackle clikcbait spoiling for low-resource language like Indonesian. We utilize selection of multilingual language models. The experimental results suggest that XLM-RoBERTa (large) model outperforms other models for phrase and passage spoilers, meanwhile, mDeBERTa (base) model outperforms other models for multipart spoilers.


Clickbait Classification and Spoiling Using Natural Language Processing

Thirumala, Adhitya, Ferracane, Elisa

arXiv.org Artificial Intelligence

Clickbait is the practice of engineering titles to incentivize readers to click through to articles. Such titles with sensationalized language reveal as little information as possible. Occasionally, clickbait will be intentionally misleading, so natural language processing (NLP) can scan the article and answer the question posed by the clickbait title, or spoil it. We tackle two tasks: classifying the clickbait into one of 3 types (Task 1), and spoiling the clickbait (Task 2). For Task 1, we propose two binary classifiers to determine the final spoiler type. For Task 2, we experiment with two approaches: using a question-answering model to identify the span of text of the spoiler, and using a large language model (LLM) to generate the spoiler. Because the spoiler is contained in the article, we frame the second task as a question-answering approach for identifying the starting and ending positions of the spoiler. We created models for Task 1 that were better than the baselines proposed by the dataset authors and engineered prompts for Task 2 that did not perform as well as the baselines proposed by the dataset authors due to the evaluation metric performing worse when the output text is from a generative model as opposed to an extractive model.


'The Legend of Zelda: Tears of the Kingdom' spoilers flood the internet after pre-launch leak

Engadget

If you were hoping to avoid spoilers before playing Nintendo's highly anticipated follow-up to Breath of the Wild, that goal just got a lot harder: The Legend of Zelda: Tears of the Kingdom has been leaked. Physical copies of the game appeared on sites over the weekend, apparently selling for up to $300 each. Now, digital copies of the game have shown up online. Some who have downloaded the pirated files have streamed footage of the game's intro and gameplay. How this all happened still isn't completely clear.