Goto

Collaborating Authors

 South America


The top 3 factors heightening the risk of terror attacks on the homeland

FOX News

As a former military intelligence officer, serving in the Defense Intelligence Agency (DIA), I tracked foreign threats to the U.S. homeland, identifying adversaries' plans, intentions and capabilities that could harm Americans. I predicted Russia's invasion of Ukraine more than a year before it took place. In March, in my Fox News Digital article titled "Ignore FBI director's urgent warning about terrorist threats at our own peril," I predicted terrorist attacks striking inside the U.S. homeland, the kind that took place on New Year's Day in New Orleans and in Las Vegas. Here are the top three reasons why we will likely face more terrorism in America this year. This time, it will be something we haven't seen before.


Large Language Models, Knowledge Graphs and Search Engines: A Crossroads for Answering Users' Questions

arXiv.org Artificial Intelligence

Much has been discussed about how Large Language Models, Knowledge Graphs and Search Engines can be combined in a synergistic manner. A dimension largely absent from current academic discourse is the user perspective. In particular, there remain many open questions regarding how best to address the diverse information needs of users, incorporating varying facets and levels of difficulty. This paper introduces a taxonomy of user information needs, which guides us to study the pros, cons and possible synergies of Large Language Models, Knowledge Graphs and Search Engines. From this study, we derive a roadmap for future research.


TP-Link's Tapo smart home ecosystem is expanding rapidly

PCWorld

TP-Link's Tapo smart home ecosystem gained a host of new products at CES 2025, including new home security cameras, smart lighting products--including a bona fide NVR system. But if the recent stories about security flaws in TP-Link routers gives you pause, TP-Link assures us that there's no longer any connection between it--TP-Link Systems--and China's TP-Link Technologies. A TP-Link Systems spokesperson told us the company manufactures its products in Brazil and Vietnam, not China, and that it "owns its own factories, designs and manufactures its products, and controls its full supply chain." With that out of the way, let's talk about the new Tapo products, starting with home security. The Tapo PalmKey Smart Door Lock features palm-vein recognition technology that scans the intricate, unique patterns of veins in your hand.


Extractive Structures Learned in Pretraining Enable Generalization on Finetuned Facts

arXiv.org Artificial Intelligence

Pretrained language models (LMs) can generalize to implications of facts that they are finetuned on. For example, if finetuned on ``John Doe lives in Tokyo," LMs can correctly answer ``What language do the people in John Doe's city speak?'' with ``Japanese''. However, little is known about the mechanisms that enable this generalization or how they are learned during pretraining. We introduce extractive structures as a framework for describing how components in LMs (e.g., MLPs or attention heads) coordinate to enable this generalization. The structures consist of informative components that store training facts as weight changes, and upstream and downstream extractive components that query and process the stored information to produce the correct implication. We hypothesize that extractive structures are learned during pretraining when encountering implications of previously known facts. This yields two predictions: a data ordering effect where extractive structures can be learned only if facts precede their implications, and a weight grafting effect where extractive structures can be transferred to predict counterfactual implications. We empirically demonstrate these phenomena in the OLMo-7b, Llama 3-8b, Gemma 2-9b, and Qwen 2-7b models. Of independent interest, our results also indicate that fact learning can occur at both early and late layers, which lead to different forms of generalization.


From discrete-time policies to continuous-time diffusion samplers: Asymptotic equivalences and faster training

arXiv.org Machine Learning

We study the problem of training neural stochastic differential equations, or diffusion models, to sample from a Boltzmann distribution without access to target samples. Existing methods for training such models enforce time-reversal of the generative and noising processes, using either differentiable simulation or off-policy reinforcement learning (RL). We prove equivalences between families of objectives in the limit of infinitesimal discretization steps, linking entropic RL methods (GFlowNets) with continuous-time objects (partial differential equations and path space measures). We further show that an appropriate choice of coarse time discretization during training allows greatly improved sample efficiency and the use of time-local objectives, achieving competitive performance on standard sampling benchmarks with reduced computational cost.


Neural Network Verification is a Programming Language Challenge

arXiv.org Artificial Intelligence

Neural network verification is a new and rapidly developing field of research. So far, the main priority has been establishing efficient verification algorithms and tools, while proper support from the programming language perspective has been considered secondary or unimportant. Yet, there is mounting evidence that insights from the programming language community may make a difference in the future development of this domain. In this paper, we formulate neural network verification challenges as programming language challenges and suggest possible future solutions.


Towards a Probabilistic Framework for Analyzing and Improving LLM-Enabled Software

arXiv.org Artificial Intelligence

Ensuring the reliability and verifiability of large language model (LLM)-enabled systems remains a significant challenge in software engineering. We propose a probabilistic framework for systematically analyzing and improving these systems by modeling and refining distributions over clusters of semantically equivalent outputs. This framework facilitates the evaluation and iterative improvement of Transference Models -- key software components that utilize LLMs to transform inputs into outputs for downstream tasks. To illustrate its utility, we apply the framework to the autoformalization problem, where natural language documentation is transformed into formal program specifications. Our case illustrates how probabilistic analysis enables the identification of weaknesses and guides focused alignment improvements, resulting in more reliable and interpretable outputs. This principled approach offers a foundation for addressing critical challenges in the development of robust LLM-enabled systems.


Video game giant Ubisoft delays release date of Assassin's Creed Shadows again

BBC News

Assassin's Creed Shadows delayed again UbisoftFemale ninja Naoe is one of Assassin's Creed Shadows' two playable protagonists Video game giant Ubisoft has announced a further delay to its upcoming Assassin's Creed Shadows. The long-running series is one of the French publisher's flagship franchises, with recent instalment, Valhalla, reportedly making more than 1bn. Assassin's Creed Shadows, set in 16th Century Japan, was due to be released on PC, PlayStation and Xbox last November before an initial delay to February 2025. Announcing the new release date of 20 March, executive producer Marc-Alexis Coté said a "few additional weeks are needed" to ensure the game's launch goes smoothly. Players complained that Ubisoft's major 2024 release, Star Wars Outlaws, was launched with bugs and glitches.


Generative Flow Networks: Theory and Applications to Structure Learning

arXiv.org Artificial Intelligence

Without any assumptions about data generation, multiple causal models may explain our observations equally well. To avoid selecting a single arbitrary model that could result in unsafe decisions if it does not match reality, it is therefore essential to maintain a notion of epistemic uncertainty about our possible candidates. This thesis studies the problem of structure learning from a Bayesian perspective, approximating the posterior distribution over the structure of a causal model, represented as a directed acyclic graph (DAG), given data. It introduces Generative Flow Networks (GFlowNets), a novel class of probabilistic models designed for modeling distributions over discrete and compositional objects such as graphs. They treat generation as a sequential decision making problem, constructing samples of a target distribution defined up to a normalization constant piece by piece. In the first part of this thesis, we present the mathematical foundations of GFlowNets, their connections to existing domains of machine learning and statistics such as variational inference and reinforcement learning, and their extensions beyond discrete problems. In the second part of this thesis, we show how GFlowNets can approximate the posterior distribution over DAG structures of causal Bayesian Networks, along with the parameters of its causal mechanisms, given observational and experimental data.


Towards Automatic Evaluation for Image Transcreation

arXiv.org Artificial Intelligence

Beyond conventional paradigms of translating speech and text, recently, there has been interest in automated transcreation of images to facilitate localization of visual content across different cultures. Attempts to define this as a formal Machine Learning (ML) problem have been impeded by the lack of automatic evaluation mechanisms, with previous work relying solely on human evaluation. In this paper, we seek to close this gap by proposing a suite of automatic evaluation metrics inspired by machine translation (MT) metrics, categorized into: a) Object-based, b) Embedding-based, and c) VLM-based. Drawing on theories from translation studies and real-world transcreation practices, we identify three critical dimensions of image transcreation: cultural relevance, semantic equivalence and visual similarity, and design our metrics to evaluate systems along these axes. Our results show that proprietary VLMs best identify cultural relevance and semantic equivalence, while vision-encoder representations are adept at measuring visual similarity. Meta-evaluation across 7 countries shows our metrics agree strongly with human ratings, with average segment-level correlations ranging from 0.55-0.87. Finally, through a discussion of the merits and demerits of each metric, we offer a robust framework for automated image transcreation evaluation, grounded in both theoretical foundations and practical application. Our code can be found here: https://github.com/simran-khanuja/automatic-eval-transcreation