Goto

Collaborating Authors

 Personal


Benchmarking Chinese Commonsense Reasoning with a Multi-hop Reasoning Perspective

arXiv.org Artificial Intelligence

While Large Language Models (LLMs) have demonstrated advanced reasoning capabilities, their comprehensive evaluation in general Chinese-language contexts remains understudied. To bridge this gap, we propose Chinese Commonsense Multi-hop Reasoning (CCMOR), a novel benchmark designed to evaluate LLMs' ability to integrate Chinese-specific factual knowledge with multi-step logical reasoning. Specifically, we first construct a domain-balanced seed set from existing QA datasets, then develop an LLM-powered pipeline to generate multi-hop questions anchored on factual unit chains. To ensure the quality of resulting dataset, we implement a human-in-the-loop verification system, where domain experts systematically validate and refine the generated questions. Using CCMOR, we evaluate state-of-the-art LLMs, demonstrating persistent limitations in LLMs' ability to process long-tail knowledge and execute knowledge-intensive reasoning. Notably, retrieval-augmented generation substantially mitigates these knowledge gaps, yielding significant performance gains.


Text2Stories: Evaluating the Alignment Between Stakeholder Interviews and Generated User Stories

arXiv.org Artificial Intelligence

Large language models (LLMs) can be employed for automating the generation of software requirements from natural language inputs such as the transcripts of elicitation interviews. However, evaluating whether those derived requirements faithfully reflect the stakeholders' needs remains a largely manual task. We introduce Text2Stories, a task and metrics for text-to-story alignment that allow quantifying the extent to which requirements (in the form of user stories) match the actual needs expressed by the elicitation session participants. Given an interview transcript and a set of user stories, our metric quantifies (i) correctness: the proportion of stories supported by the transcript, and (ii) completeness: the proportion of transcript supported by at least one story. We segment the transcript into text chunks and instantiate the alignment as a matching problem between chunks and stories. Experiments over four datasets show that an LLM-based matcher achieves 0.86 macro-F1 on held-out annotations, while embedding models alone remain behind but enable effective blocking. Finally, we show how our metrics enable the comparison across sets of stories (e.g., human vs. generated), positioning Text2Stories as a scalable, source-faithful complement to existing user-story quality criteria.


Move over, Alan Turing: meet the working-class hero of Bletchley Park you didn't see in the movies

The Guardian

Tommy Flowers: nothing like the machine he proposed had ever been contemplated. Tommy Flowers: nothing like the machine he proposed had ever been contemplated. Move over, Alan Turing: meet the working-class hero of Bletchley Park you didn't see in the movies The Oxbridge-educated boffin is feted as the codebreaking genius who helped Britain win the war. But should a little-known Post Office engineer named Tommy Flowers be seen as the real father of computing? T his is a story you know, right? It's early in the war and western Europe has fallen. Only the Channel stands between Britain and the fascist yoke; only Atlantic shipping lanes offer hope of the population continuing to be fed, clothed and armed. But hunting "wolf packs" of Nazi U-boats pick off merchant shipping at will, coordinated by radio instructions the Brits can intercept but can't read, thanks to the fiendish Enigma encryption machine.


Aftermath of RSF drone attack which killed dozens in Sudan's el-Fasher

Al Jazeera

Aftermath of RSF drone attack which killed dozens in Sudan's el-Fasher NewsFeed Aftermath of RSF drone attack which killed dozens in Sudan's el-Fasher Video shows the aftermath of drone and artillery strikes on a shelter in the besieged city of el-Fasher in Sudan's North Darfur state, which killed at least 60 people. The attack was carried out by the paramilitary Rapid Support Forces (RSF), according to a Sudanese medical advocacy group. Al Jazeera reporters follow Palestinians' return to northern Gaza Who is Nobel Peace Prize winner Maria Corina Machado?


Osaka Expo androids to be moved to Kyoto

The Japan Times

Android robots shown at the Osaka Expo in a pavilion produced by University of Osaka professor Hiroshi Ishiguro will be relocated to Kyoto Prefecture. OSAKA - Seven android robots shown at the 2025 World Exposition in Osaka in a pavilion produced by University of Osaka professor Hiroshi Ishiguro will be relocated to Kyoto Prefecture after the end of the event on Monday. In addition, the Dutch pavilion will be moved to Awaji Island, Hyogo Prefecture. People involved in the use of expo assets after the event hope that they will be loved as tourist attractions in their new places. The prefectural government of Kyoto was chosen as the new owner of the androids in an open tender held by the expo organizer, the Japan Association for the 2025 World Exposition, in September. The robots will be shown to the public at a research facility in the Keihanna Science City research district straddling the Kyoto municipalities of Seika and Kizugawa.


Japan group to launch AI service for saury size predictions

The Japan Times

Saury catches from August to the end of September this year totaled about 28,500 tons -- a 2.4-fold increase from the same period last year. The Japan Fisheries Information Service Center will start a service next fishing season that shows expected fishing grounds for saury by size class based on analysis using artificial intelligence technology. The Tokyo-based group of fisheries organizations provides information on fishing and ocean conditions. Since 2020, the group provides its predictions of likely saury fishing spots using AI, based on seawater temperature changes and past fishing records. The accuracy of the predictions has improved year after year.


How can Europe protect its skies against 'escalating' drone menace?

The Japan Times

How can Europe protect its skies against'escalating' drone menace? A drone detection and defense system is parked in Kottingbrunn, Austria, on Oct. 3 | REUTERS Paris - Drones flying over airports, commercial sites and other sensitive infrastructure in Europe is a growing phenomenon which EU leaders blame on Russia, and preventing the disruption they cause will prove a tough technical challenge, observers say. Detecting the drones, making them non-operational by jamming them, or even shooting them down, are all complex and hazardous tasks. And while Russian involvement is suspected, it is difficult to prove. Concerns are growing that such disruptions could be part of Russian hybrid war tactics three-and-a-half years into its invasion of Ukraine, as most European countries double down on their support for Kyiv including by delivering military hardware.


Nasa unveils plan for astronauts to live on the moon - inside glass bubbles made from lunar dust

Daily Mail - Science & tech

'Four dead and 12 injured' in Mississippi shooting after people descend on town for homecoming game Joe Biden, 82, receiving new treatment after'aggressive' cancer spread to his bones REVEALED: The secret George Soros network'behind America's street chaos'... and the dossier that shows how to stop it Tinnitus destroyed Peter's life but doctors dismissed him. Then he tried an extraordinary drug-free University of Cambridge-backed treatment that gives instant relief - no wonder medics say it's so'exciting' KENNEDY: Obama's bitter post about Trump's Gaza peace deal proves what I've long suspected about Barry... and it would make Sigmund Freud blush Gold is soaring... here's what the pros say you should do with your 401(k) before it's too late Model dubbed'the world's most beautiful girl' when she was six is now all grown up and looks VERY different as she poses up a storm at Paris Fashion Week Teacher was'so high on cocaine she thought one of her students was her dog' But now, a royal insider claims they're'just as entitled as their parents' with'shady friends' Heartbreaking moment NFL reporter makes brutal comment about player Xavier Legette's dead father in locker room interview Experts reveal the surprising TRUTH behind RFK Jr's link between circumcision and autism Bombshell records that damn Letitia James and show Trump was RIGHT... and the staggering sum she was swindling Trump starts DOGE 2.0 as mass layoffs take place across federal government amid shutdown Famed'Big Short' investor gives terrifying verdict on Trump hammering China with 100 PERCENT tariff... and issues doomsday warning to Wall Street Jennifer Aniston, you've betrayed every woman with your selfish admission about not having children: CAROLINE BULLOCK Nasa has unveiled plans to send astronauts to live on the moon - inside glass bubbles made from lunar dust. The American space agency is funding research into the large livable spheres which would be created in situ, the Telegraph reports. Tiny pieces of so-called lunar glass - a component of the moon's soil, or regolith, along with rocks and mineral fragments - would be collected upon arrival from Earth. The material would be melted down using the same technology as in a domestic microwave oven, along with a'smart microwave furnace'.


Meteorologist's stark warning to Americans to brace for a harsh winter with less snow but more nor'easters

Daily Mail - Science & tech

'Four dead and 12 injured' in Mississippi shooting after people descend on town for homecoming game Joe Biden, 82, receiving new treatment after'aggressive' cancer spread to his bones REVEALED: The secret George Soros network'behind America's street chaos'... and the dossier that shows how to stop it Tinnitus destroyed Peter's life but doctors dismissed him. Then he tried an extraordinary drug-free University of Cambridge-backed treatment that gives instant relief - no wonder medics say it's so'exciting' KENNEDY: Obama's bitter post about Trump's Gaza peace deal proves what I've long suspected about Barry... and it would make Sigmund Freud blush Gold is soaring... here's what the pros say you should do with your 401(k) before it's too late Model dubbed'the world's most beautiful girl' when she was six is now all grown up and looks VERY different as she poses up a storm at Paris Fashion Week Teacher was'so high on cocaine she thought one of her students was her dog' But now, a royal insider claims they're'just as entitled as their parents' with'shady friends' Heartbreaking moment NFL reporter makes brutal comment about player Xavier Legette's dead father in locker room interview Experts reveal the surprising TRUTH behind RFK Jr's link between circumcision and autism Bombshell records that damn Letitia James and show Trump was RIGHT... and the staggering sum she was swindling Trump starts DOGE 2.0 as mass layoffs take place across federal government amid shutdown Famed'Big Short' investor gives terrifying verdict on Trump hammering China with 100 PERCENT tariff... and issues doomsday warning to Wall Street Jennifer Aniston, you've betrayed every woman with your selfish admission about not having children: CAROLINE BULLOCK Meteorologist's stark warning to Americans to brace for a harsh winter with less snow but more nor'easters Meteorologists are already predicting what the winter months will bring, with some regions of the US expected to see less snow than last year, and nor'easters anticipated to ravage parts of the Northeast. Paul Pastelok, chief meteorologist for AccuWeather's long-range forecasting team, told the Daily Mail that while he didn't expect above normal snowfall for the winter season, he warned that those in the Northeast should brace for nor'easters and it would still be a harsh winter. Pastelok explained that the nor'easter over this weekend is on trend with what is to come, as rapidly developing storms come in off the East Coast. 'People may say, Well, you're forecasting less snow, so it doesn't look like a harsh winter.


I discovered secret tunnels below Egypt's Giza pyramids... and they may lead to a forgotten underworld

Daily Mail - Science & tech

'Four dead and 12 injured' in Mississippi shooting after people descend on town for homecoming game Joe Biden, 82, receiving new treatment after'aggressive' cancer spread to his bones REVEALED: The secret George Soros network'behind America's street chaos'... and the dossier that shows how to stop it Tinnitus destroyed Peter's life but doctors dismissed him. Then he tried an extraordinary drug-free University of Cambridge-backed treatment that gives instant relief - no wonder medics say it's so'exciting' KENNEDY: Obama's bitter post about Trump's Gaza peace deal proves what I've long suspected about Barry... and it would make Sigmund Freud blush Gold is soaring... here's what the pros say you should do with your 401(k) before it's too late Model dubbed'the world's most beautiful girl' when she was six is now all grown up and looks VERY different as she poses up a storm at Paris Fashion Week Teacher was'so high on cocaine she thought one of her students was her dog' But now, a royal insider claims they're'just as entitled as their parents' with'shady friends' Heartbreaking moment NFL reporter makes brutal comment about player Xavier Legette's dead father in locker room interview Experts reveal the surprising TRUTH behind RFK Jr's link between circumcision and autism Bombshell records that damn Letitia James and show Trump was RIGHT... and the staggering sum she was swindling Trump starts DOGE 2.0 as mass layoffs take place across federal government amid shutdown Famed'Big Short' investor gives terrifying verdict on Trump hammering China with 100 PERCENT tariff... and issues doomsday warning to Wall Street Jennifer Aniston, you've betrayed every woman with your selfish admission about not having children: CAROLINE BULLOCK I discovered secret tunnels below Egypt's Giza pyramids... and they may lead to a forgotten underworld On the northeastern edge of the Giza Plateau, I discovered three perfectly cut shafts hidden beneath the sands. They sit in the triangle between the Great Sphinx, Khufu's Pyramid and Khafre's Pyramid, and may open into a long-forgotten underground world. These are not water wells. They bear no inscriptions, no signs of casual digging, and their geometry is too precise, their walls too smooth, their design too deliberate.