Goto

Collaborating Authors

 Law


Storm Karen to explode in just DAYS with hurricane models revealing dangerous path to the US

Daily Mail - Science & tech

AOC hit by shockingly crude sex insult by White House after she mocked'TINY' Stephen Miller Biden ordered CIA cover-up of his'corrupt' business ties to Ukraine, astonishing secret files show NYC girls aged 12 and 13 meet tragic end after going subway surfing across Williamsburg Bridge at 3.10am ERIC TRUMP: The darkest day in my dad's marriage to Melania... before the ugly truth was exposed More girls are starting their periods younger than ever before - scientists think they've finally found what's causing it Taylor Swift reveals truth behind raunchy song about Travis Kelce's manhood Meghan is accused of'giggling as model stumbles on the catwalk': More Paris Fashion Week disasters emerge, including awkward moment with Kristin Scott Thomas The TRUTH to the doting mother who slaughtered her children and husband told by those she'd been quietly tormenting for years The troubled background of delivery man stabbed by Mark Sanchez... as he launches million-dollar lawsuit and sparks civil war at Fox Revealed: Which slimming jab REALLY works best. The doctors' ultimate expert guide on which to pick, how to save money, beat every side effect... and what you need to know about the'golden dose' I haven't heard that name in so long' Ominous warning for humanity as birds suddenly adopt'unsettling' behavior And a humiliating lifeline: Backroom secrets of Taylor Swift and Blake Lively... after hit new song Bottled water contains dangerous levels of microplastics that lodge in vital organs and raise cancer risk', scientists warn A weather system developing right off the US East Coast could strengthen into a dangerous tropical storm in the coming days, experts have warned. Meteorologists said that this tropical rainstorm is expected to form quickly over the weekend, potentially bringing'prolonged coastal flooding' to the entire East Coast, from southern Florida to New England. The National Hurricane Center (NHC) has noted that the next two Atlantic storm names will be Jerry and Karen, with Jerry likely to form farther away from the US this week. As for the storm that could become Karen, forecasters have already predicted that North Carolina, Virginia, and Maryland are expected to get heavy rainfall between Friday and Saturday.


Artificial Armageddon? AI can now be used to design brand-new VIRUSES - sparking fears it could come up with a catastrophic bioweapon

Daily Mail - Science & tech

Clash of the White House titans: Two of Trump's most powerful lieutenants go to WAR with each other - after vicious leak sent shockwaves The troubled background of delivery man stabbed by Mark Sanchez... as he launches million-dollar lawsuit and sparks civil war at Fox Ominous warning for humanity as birds suddenly adopt'unsettling' behavior The TRUTH to the doting mother who slaughtered her children and husband told by those she'd been quietly tormenting for years Brazilian fashion influencer Junior Dutra dies at age 31 after alleged'fox eyes' procedure complications I've seen AI try to ESCAPE labs. The apocalypse is already here... and our children will be the first victims Trump brands NFL's Bad Bunny Super Bowl halftime show selection'absolutely ridiculous' Investigators reveal there is'no evidence' of arson after horror blaze destroyed South Carolina judge's beachfront home Functioning alcoholics hide in plain sight... so are YOU one? It sounds like the start of a sci-fi film, but scientists have shown that AI can design brand-new infectious viruses the first time. Experts at Stanford University in California used'Evo' - an AI tool that creates genomes from scratch. Amazingly, the tool was able to create viruses that are able to infect and kill specific bacteria.


Learning Survival Models with Right-Censored Reporting Delays

arXiv.org Machine Learning

Survival analysis is a statistical technique used to estimate the time until an event occurs. Although it is applied across a wide range of fields, adjusting for reporting delays under practical constraints remains a significant challenge in the insurance industry. Such delays render event occurrences unobservable when their reports are subject to right censoring. This issue becomes particularly critical when estimating hazard rates for newly enrolled cohorts with limited follow-up due to administrative censoring. Our study addresses this challenge by jointly modeling the parametric hazard functions of event occurrences and report timings. The joint probability distribution is marginalized over the latent event occurrence status. We construct an estimator for the proposed survival model and establish its asymptotic consistency. Furthermore, we develop an expectation-maximization algorithm to compute its estimates. Using these findings, we propose a two-stage estimation procedure based on a parametric proportional hazards model to evaluate observations subject to administrative censoring. Experimental results demonstrate that our method effectively improves the timeliness of risk evaluation for newly enrolled cohorts.


Bias and Coverage Properties of the WENDy-IRLS Algorithm

arXiv.org Machine Learning

The Weak form Estimation of Nonlinear Dynamics (WENDy) method is a recently proposed class of parameter estimation algorithms that exhibits notable noise robustness and computational efficiency. This work examines the coverage and bias properties of the original WENDy-IRLS algorithm's parameter and state estimators in the context of the following differential equations: Logistic, Lotka-Volterra, FitzHugh-Nagumo, Hindmarsh-Rose, and a Protein Transduction Benchmark. The estimators' performance was studied in simulated data examples, under four different noise distributions (normal, log-normal, additive censored normal, and additive truncated normal), and a wide range of noise, reaching levels much higher than previously tested for this algorithm.


SSFO: Self-Supervised Faithfulness Optimization for Retrieval-Augmented Generation

arXiv.org Artificial Intelligence

Retrieval-Augmented Generation (RAG) systems require Large Language Models (LLMs) to generate responses that are faithful to the retrieved context. However, faithfulness hallucination remains a critical challenge, as existing methods often require costly supervision and post-training or significant inference burdens. To overcome these limitations, we introduce Self-Supervised Faithfulness Optimization (SSFO), the first self-supervised alignment approach for enhancing RAG faithfulness. SSFO constructs preference data pairs by contrasting the model's outputs generated with and without the context. Leveraging Direct Preference Optimization (DPO), SSFO aligns model faithfulness without incurring labeling costs or additional inference burden. We theoretically and empirically demonstrate that SSFO leverages a benign form of \emph{likelihood displacement}, transferring probability mass from parametric-based tokens to context-aligned tokens. Based on this insight, we propose a modified DPO loss function to encourage likelihood displacement. Comprehensive evaluations show that SSFO significantly outperforms existing methods, achieving state-of-the-art faithfulness on multiple context-based question-answering datasets. Notably, SSFO exhibits strong generalization, improving cross-lingual faithfulness and preserving general instruction-following capabilities. We release our code and model at the anonymous link: https://github.com/chkwy/SSFO


Negative Shanshui: Real-time Interactive Ink Painting Synthesis

arXiv.org Artificial Intelligence

This paper presents Negative Shanshui, a real-time interactive AI synthesis approach that reinterprets classical Chinese landscape ink painting, i.e., shanshui, to engage with ecological crises in the Anthropocene. Negative Shanshui optimizes a fine-tuned Stable Diffusion model for real-time inferences and integrates it with gaze-driven inpainting, frame interpolation; it enables dynamic morphing animations in response to the viewer's gaze and presents as an interactive virtual reality (VR) experience. The paper describes the complete technical pipeline, covering the system framework, optimization strategies, gaze-based interaction, and multimodal deployment in an art festival. Further analysis of audience feedback collected during its public exhibition highlights how participants variously engaged with the work through empathy, ambivalence, and critical reflection.


LegalSearchLM: Rethinking Legal Case Retrieval as Legal Elements Generation

arXiv.org Artificial Intelligence

Legal Case Retrieval (LCR), which retrieves relevant cases from a query case, is a fundamental task for legal professionals in research and decision-making. However, existing studies on LCR face two major limitations. First, they are evaluated on relatively small-scale retrieval corpora (e.g., 100-55K cases) and use a narrow range of criminal query types, which cannot sufficiently reflect the complexity of real-world legal retrieval scenarios. Second, their reliance on embedding-based or lexical matching methods often results in limited representations and legally irrelevant matches. To address these issues, we present: (1) LEGAR BENCH, the first large-scale Korean LCR benchmark, covering 411 diverse crime types in queries over 1.2M candidate cases; and (2) LegalSearchLM, a retrieval model that performs legal element reasoning over the query case and directly generates content containing those elements, grounded in the target cases through constrained decoding. Experimental results show that LegalSearchLM outperforms baselines by 6-20% on LEGAR BENCH, achieving state-of-the-art performance. It also demonstrates strong generalization to out-of-domain cases, outperforming naive generative models trained on in-domain data by 15%.


StressTest: Can YOUR Speech LM Handle the Stress?

arXiv.org Artificial Intelligence

Sentence stress refers to emphasis on words within a spoken utterance to highlight or contrast an idea. It is often used to imply an underlying intention not explicitly stated. Recent speech-aware language models (SLMs) have enabled direct audio processing, allowing models to access the full richness of speech to perform audio reasoning tasks such as spoken question answering. Despite the crucial role of sentence stress in shaping meaning and intent, it remains largely overlooked in evaluation and development of SLMs. We address this gap by introducing StressTest, a benchmark designed to evaluate models' ability to distinguish between meanings of speech based on the stress pattern. We evaluate leading SLMs, and find that despite their overall capabilities, they perform poorly on such tasks. Hence, we propose a novel data generation pipeline, and create Stress-17k, a training set that simulates change of meaning implied by stress variation. Results suggest, that our finetuned model, StresSLM, generalizes well to real recordings and notably outperforms existing SLMs on sentence stress reasoning and detection. Models, code, data, samples - pages.cs.huji.ac.il/adiyoss-lab/stresstest.


RECAST: Expanding the Boundaries of LLMs' Complex Instruction Following with Multi-Constraint Data

arXiv.org Artificial Intelligence

Large language models (LLMs) are increasingly expected to tackle complex tasks, driven by their expanding applications and users' growing proficiency in crafting sophisticated prompts. However, as the number of explicitly stated requirements increases (particularly more than 10 constraints), LLMs often struggle to accurately follow such complex instructions, which limits their applicability in complex real-world scenarios. To the best of our knowledge, existing datasets do not exceed 10 constraints per instance. To address this challenge, we propose RECAST, an efficient and scalable framework for synthesizing datasets where each example incorporates far more constraints than those in existing benchmarks, aiming to challenge and extend the boundaries of models' ability to follow complex instructions. These constraints are extracted from real-world prompt-response pairs to ensure practical relevance. Using this framework, we construct RECAST-30K, a large-scale, high-quality dataset comprising 30k instances spanning 19 constraint types. Experimental results demonstrate that models finetuned on RECAST-30K substantially improve in following complex instructions while maintaining their general capabilities without degradation. Moreover, RECAST enables automatic verification of constraint satisfaction via rule-based validators for quantitative constraints and LLM-based validators for qualitative ones; the verifiability provided by RECAST enables the design of reward functions for reinforcement learning, which further boosts model performance on complex and challenging tasks.


Social Good or Scientific Curiosity? Uncovering the Research Framing Behind NLP Artefacts

arXiv.org Artificial Intelligence

Clarifying the research framing of NLP artefacts (e.g., models, datasets, etc.) is crucial to aligning research with practical applications. Recent studies manually analyzed NLP research across domains, showing that few papers explicitly identify key stakeholders, intended uses, or appropriate contexts. In this work, we propose to automate this analysis, developing a three-component system that infers research framings by first extracting key elements (means, ends, stakeholders), then linking them through interpretable rules and contextual reasoning. We evaluate our approach on two domains: automated fact-checking using an existing dataset, and hate speech detection for which we annotate a new dataset-achieving consistent improvements over strong LLM baselines. Finally, we apply our system to recent automated fact-checking papers and uncover three notable trends: a rise in vague or underspecified research goals, increased emphasis on scientific exploration over application, and a shift toward supporting human fact-checkers rather than pursuing full automation.