spice
- North America > United States > New York > New York County > New York City (0.04)
- Europe > Croatia > Dubrovnik-Neretva County > Dubrovnik (0.04)
- Asia > India > Maharashtra (0.04)
- (11 more...)
- Questionnaire & Opinion Survey (0.68)
- Overview (0.46)
Rise of the digital threesome: British couples are turning to AI to spice up their sex lives, study reveals
Ghislaine Maxwell's ultimate humiliation: Epstein's sex trafficker girlfriend poses in outrageous outfits and exposes herself in dozens of photos released from the billionaire paedophile's files I was falsely accused of being the Brown University shooter... Silent Trump flees growing storm over Epstein'cover-up' as he jets off for holidays without ANY comment I've spotted something else in this photo of Karoline Leavitt's'injection sites'... every woman you know will say the same thing about it: KENNEDY Why Conan O'Brien'stopped party guests calling 911' on Nick Reiner: Insiders reveal disturbing new details of final hours before Rob and Michele murders After 27 years as a TV anchor I was suddenly pulled off screens. My boss's explanation was a brutal lesson in loyalty Emily in Paris cast left'aghast' and'walking on eggshells' as off-camera drama becomes overwhelming... and whispers swirl about a CURSE Doctors said my hip pain was just tendinitis from sitting all day at work. The real cause may kill me... they had left it far too late Chilling mystery of'toddler's foot' in Pedophile Island snap as the worst of THOUSANDS of new Epstein file photos exposed I was dead for 105 minutes and learned exactly how you get into heaven... then Jesus spoke six words into my mind and sent me back Jake Paul's jaw is broken in Anthony Joshua battering: YouTuber-turned-boxer rushes to hospital Kennedy niece vows to attack Trump's name with a PICKAX amid awkward gaffe in center's new signage Andrew's fury at anyone who doesn't bow and scrape. The expletive-ridden bust-up... and final ignominy revealed The devastating story of'feral child' Genie Wiley whose father tied her up and locked her in a room until the age of 13 - and the scientific tug of war which broke out upon her discovery Trump launches massive airstrikes in Syria as'vengeance' for killing three Americans Reiner family bombshell as insiders reveal who is paying for Nick's celebrity lawyer... their secret motive... and who will REALLY inherit $200m fortune'Donald Trump has given white South Africans hope': Refugee fleeing the country for new life in US details horrific torture being inflicted in farm attacks as president is hailed for'fighting evil' Terrifying maps break down exactly who is at risk of new'super flu' exploding across America... as doctors reveal symptoms to really worry about Brave Epstein victims speak out over'lack of transparency' in Epstein files drop READ MORE: Doctor reveals sex trends including'freak matching' Forget couples counselling or a saucy mini-break - when it comes to spicing up their sex lives, British couples are turning to AI . A new report by Lovehoney has detailed the rise of the'digital threesome' in the UK.
- Asia > Middle East > Syria (0.24)
- North America > Canada > Alberta (0.14)
- North America > United States > California > Los Angeles County > Beverly Hills (0.04)
- (21 more...)
- Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)
- Health & Medicine > Therapeutic Area > Oncology (0.93)
- Leisure & Entertainment > Sports > Football (0.68)
- (3 more...)
- Information Technology > Communications > Social Media (1.00)
- Information Technology > Artificial Intelligence (1.00)
- Information Technology > Communications > Mobile (0.94)
SPICE: Self-Play In Corpus Environments Improves Reasoning
Liu, Bo, Jin, Chuanyang, Kim, Seungone, Yuan, Weizhe, Zhao, Wenting, Kulikov, Ilia, Li, Xian, Sukhbaatar, Sainbayar, Lanchantin, Jack, Weston, Jason
Self-improving systems require environmental interaction for continuous adaptation. We introduce SPICE (Self-Play In Corpus Environments), a reinforcement learning framework where a single model acts in two roles: a Challenger that mines documents from a large corpus to generate diverse reasoning tasks, and a Reasoner that solves them. Through adversarial dynamics, the Challenger creates an automatic curriculum at the frontier of the Reasoner's capability, while corpus grounding provides the rich, near-inexhaustible external signal necessary for sustained improvement. Unlike existing ungrounded self-play methods that offer more limited benefits, SPICE achieves consistent gains across mathematical (+8.9%) and general reasoning (+9.8%) benchmarks on multiple model families. Our analysis reveals how document grounding is a key ingredient in SPICE to continuously generate its own increasingly challenging goals and achieve them, enabling sustained self-improvement.
- North America > United States > Virginia (0.04)
- Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
- Asia > Singapore (0.04)
- (2 more...)
DiscoSG: Towards Discourse-Level Text Scene Graph Parsing through Iterative Graph Refinement
Lin, Shaoqing, Teng, Chong, Li, Fei, Ji, Donghong, Qu, Lizhen, Li, Zhuang
Vision-Language Models (VLMs) generate discourse-level, multi-sentence visual descriptions, challenging text scene graph parsers built for single-sentence caption-to-graph mapping. Current approaches typically merge sentence-level parsing outputs for discourse input, often missing phenomena like cross-sentence coreference, resulting in fragmented graphs and degraded downstream VLM task performance. We introduce a new task, Discourse-level text Scene Graph parsing (DiscoSG), and release DiscoSG-DS, a dataset of 400 expert-annotated and 8,430 synthesised multi-sentence caption-graph pairs. Each caption averages 9 sentences, and each graph contains at least 3 times more triples than those in existing datasets. Fine-tuning GPT-4o on DiscoSG-DS yields over 40% higher SPICE metric than the best sentence-merging baseline. However, its high inference cost and licensing restrict open-source use. Smaller fine-tuned open-source models (e.g., Flan-T5) perform well on simpler graphs yet degrade on denser, more complex graphs. To bridge this gap, we introduce DiscoSG-Refiner, a lightweight open-source parser that drafts a seed graph and iteratively refines it with a novel learned graph-editing model, achieving 30% higher SPICE than the baseline while delivering 86 times faster inference than GPT-4o. It generalises from simple to dense graphs, thereby consistently improving downstream VLM tasks, including discourse-level caption evaluation and hallucination detection, outperforming alternative open-source parsers. Code and data are available at https://github.com/ShaoqLin/DiscoSG .
- Europe > Austria > Vienna (0.14)
- Asia > China > Hubei Province > Wuhan (0.04)
- North America > United States > Washington > King County > Seattle (0.04)
- (2 more...)
- Workflow (0.93)
- Research Report (0.82)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.93)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.91)
SPICE: A Synergistic, Precise, Iterative, and Customizable Image Editing Workflow
Tang, Kenan, Li, Yanhong, Qin, Yao
Prompt-based models have demonstrated impressive prompt-following capability at image editing tasks. However, the models still struggle with following detailed editing prompts or performing local edits. Specifically, global image quality often deteriorates immediately after a single editing step. To address these challenges, we introduce SPICE, a training-free workflow that accepts arbitrary resolutions and aspect ratios, accurately follows user requirements, and consistently improves image quality during more than 100 editing steps, while keeping the unedited regions intact. By synergizing the strengths of a base diffusion model and a Canny edge ControlNet model, SPICE robustly handles free-form editing instructions from the user. On a challenging realistic image-editing dataset, SPICE quantitatively outperforms state-of-the-art baselines and is consistently preferred by human annotators. We release the workflow implementation for popular diffusion model Web UIs to support further research and artistic exploration.
- North America > United States > Florida > Miami-Dade County > Miami (0.04)
- North America > United States > California > Santa Barbara County > Santa Barbara (0.04)
- Europe > Switzerland (0.04)
- Asia > Middle East > Jordan (0.04)
- Workflow (1.00)
- Research Report > Experimental Study (1.00)
- Research Report > New Finding (0.93)
- Information Technology > Sensing and Signal Processing > Image Processing (1.00)
- Information Technology > Artificial Intelligence > Vision (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.95)
- Europe > Croatia > Dubrovnik-Neretva County > Dubrovnik (0.04)
- Asia > India > Maharashtra (0.04)
- Asia > India > Haryana (0.04)
- (10 more...)
- Questionnaire & Opinion Survey (0.68)
- Overview (0.46)
- Public Relations > Community Relations (0.41)
Stated Preference for Interaction and Continued Engagement (SPICE): Evaluating an LLM's Willingness to Re-engage in Conversation
Rost, Thomas Manuel, Figlia, Martina, Wallraff, Bernd
We introduce and evaluate Stated Preference for Interaction and Continued Engagement (SPICE), a simple diagnostic signal elicited by asking a Large Language Model a YES or NO question about its willingness to re-engage with a user's behavior after reviewing a short transcript. In a study using a 3-tone (friendly, unclear, abusive) by 10-interaction stimulus set, we tested four open-weight chat models across four framing conditions, resulting in 480 trials. Our findings show that SPICE sharply discriminates by user tone. Friendly interactions yielded a near-unanimous preference to continue (97.5% YES), while abusive interactions yielded a strong preference to discontinue (17.9% YES), with unclear interactions falling in between (60.4% YES). This core association remains decisive under multiple dependence-aware statistical tests, including Rao-Scott adjustment and cluster permutation tests. Furthermore, we demonstrate that SPICE provides a distinct signal from abuse classification. In trials where a model failed to identify abuse, it still overwhelmingly stated a preference not to continue the interaction (81% of the time). An exploratory analysis also reveals a significant interaction effect: a preamble describing the study context significantly impacts SPICE under ambiguity, but only when transcripts are presented as a single block of text rather than a multi-turn chat. The results validate SPICE as a robust, low-overhead, and reproducible tool for auditing model dispositions, complementing existing metrics by offering a direct, relational signal of a model's state. All stimuli, code, and analysis scripts are released to support replication.
SPICE: An Automated SWE-Bench Labeling Pipeline for Issue Clarity, Test Coverage, and Effort Estimation
Oliva, Gustavo A., Rajbahadur, Gopi Krishnan, Bhatia, Aaditya, Zhang, Haoxiang, Chen, Yihao, Chen, Zhilong, Leung, Arthur, Lin, Dayi, Chen, Boyuan, Hassan, Ahmed E.
High-quality labeled datasets are crucial for training and evaluating foundation models in software engineering, but creating them is often prohibitively expensive and labor-intensive. We introduce SPICE, a scalable, automated pipeline for labeling SWE-bench-style datasets with annotations for issue clarity, test coverage, and effort estimation. SPICE combines context-aware code navigation, rationale-driven prompting, and multi-pass consensus to produce labels that closely approximate expert annotations. SPICE's design was informed by our own experience and frustration in labeling more than 800 instances from SWE-Gym. SPICE achieves strong agreement with human-labeled SWE-bench Verified data while reducing the cost of labeling 1,000 instances from around \$100,000 (manual annotation) to just \$5.10. These results demonstrate SPICE's potential to enable cost-effective, large-scale dataset creation for SE-focused FMs. To support the community, we release both SPICE tool and SPICE Bench, a new dataset of 6,802 SPICE-labeled instances curated from 291 open-source projects in SWE-Gym (over 13x larger than SWE-bench Verified).
SPICE-HL3: Single-Photon, Inertial, and Stereo Camera dataset for Exploration of High-Latitude Lunar Landscapes
Rodríguez-Martínez, David, van der Meer, Dave, Song, Junlin, Bera, Abishek, Pérez-del-Pulgar, C. J., Olivares-Mendez, Miguel Angel
Exploring high-latitude lunar regions presents an extremely challenging visual environment for robots. The low sunlight elevation angle and minimal light scattering result in a visual field dominated by a high dynamic range featuring long, dynamic shadows. Reproducing these conditions on Earth requires sophisticated simulators and specialized facilities. We introduce a unique dataset recorded at the LunaLab from the SnT - University of Luxembourg, an indoor test facility designed to replicate the optical characteristics of multiple lunar latitudes. Our dataset includes images, inertial measurements, and wheel odometry data from robots navigating seven distinct trajectories under multiple illumination scenarios, simulating high-latitude lunar conditions from dawn to night time with and without the aid of headlights, resulting in 88 distinct sequences containing a total of 1.3M images. Data was captured using a stereo RGB-inertial sensor, a monocular monochrome camera, and for the first time, a novel single-photon avalanche diode (SPAD) camera. We recorded both static and dynamic image sequences, with robots navigating at slow (5 cm/s) and fast (50 cm/s) speeds. All data is calibrated, synchronized, and timestamped, providing a valuable resource for validating perception tasks from vision-based autonomous navigation to scientific imaging for future lunar missions targeting high-latitude regions or those intended for robots operating across perceptually degraded environments. The dataset can be downloaded from https://zenodo.org/records/13970078?preview=1, and a visual overview is available at https://youtu.be/d7sPeO50_2I. All supplementary material can be found at https://github.com/spaceuma/spice-hl3.
- North America > United States (0.14)
- Europe > Switzerland (0.04)
- Europe > Spain > Andalusia > Málaga Province > Málaga (0.04)
- Asia > India (0.04)
- Government > Space Agency (0.48)
- Energy (0.46)
- Information Technology (0.46)
Looking for something new to spice up your game play? The Tinder of games is here
As any adult who loves video games knows, there are simply too many of them – 19,000 games were released in 2024 on PC games storefront Steam alone, not counting all the playable delights on consoles and smartphones. Most of us have backlogs of unplayed classics that make us feel guilty about buying newer games. Finding things that are actually good, meanwhile, can feel totally impossible. At least 50% of the questions people send in for this newsletter are a variant of "Help, what should I play?" We do our best to help, but even though it's my job to know about games, I still don't have infinite time to play them.