Generative AI
MIRAGE: Towards AI-Generated Image Detection in the Wild
Xia, Cheng, Lin, Manxi, Tan, Jiexiang, Du, Xiaoxiong, Qiu, Yang, Zheng, Junjun, Kong, Xiangheng, Jiang, Yuning, Zheng, Bo
The spreading of AI-generated images (AIGI), driven by advances in generative AI, poses a significant threat to information security and public trust. Existing AIGI detectors, while effective against images in clean laboratory settings, fail to generalize to in-the-wild scenarios. These real-world images are noisy, varying from ``obviously fake" images to realistic ones derived from multiple generative models and further edited for quality control. We address in-the-wild AIGI detection in this paper. We introduce Mirage, a challenging benchmark designed to emulate the complexity of in-the-wild AIGI. Mirage is constructed from two sources: (1) a large corpus of Internet-sourced AIGI verified by human experts, and (2) a synthesized dataset created through the collaboration between multiple expert generators, closely simulating the realistic AIGI in the wild. Building on this benchmark, we propose Mirage-R1, a vision-language model with heuristic-to-analytic reasoning, a reflective reasoning mechanism for AIGI detection. Mirage-R1 is trained in two stages: a supervised-fine-tuning cold start, followed by a reinforcement learning stage. By further adopting an inference-time adaptive thinking strategy, Mirage-R1 is able to provide either a quick judgment or a more robust and accurate conclusion, effectively balancing inference speed and performance. Extensive experiments show that our model leads state-of-the-art detectors by 5% and 10% on Mirage and the public benchmark, respectively. The benchmark and code will be made publicly available.
A Survey of LLM-based Deep Search Agents: Paradigm, Optimization, Evaluation, and Challenges
Xi, Yunjia, Lin, Jianghao, Xiao, Yongzhao, Zhou, Zheli, Shan, Rong, Gao, Te, Zhu, Jiachen, Liu, Weiwen, Yu, Yong, Zhang, Weinan
The advent of Large Language Models (LLMs) has significantly revolutionized web search. The emergence of LLM-based Search Agents marks a pivotal shift towards deeper, dynamic, autonomous information seeking. These agents can comprehend user intentions and environmental context and execute multi-turn retrieval with dynamic planning, extending search capabilities far beyond the web. Leading examples like OpenAI's Deep Research highlight their potential for deep information mining and real-world applications. This survey provides the first systematic analysis of search agents. We comprehensively analyze and categorize existing works from the perspectives of architecture, optimization, application, and evaluation, ultimately identifying critical open challenges and outlining promising future research directions in this rapidly evolving field. Our repository is available on https://github.com/YunjiaXi/Awesome-Search-Agent-Papers.
OpenAI makes GPT-5 'friendlier' after widespread user backlash
About two weeks ago, OpenAI released GPT-5. The newest AI model in the GPT line, GPT-5 was put forth as the company's "smartest, fastest, most useful model yet" with "built-in thinking" and "expert-level intelligence." But the release backfired for one important reason. Part of the changes in GPT-5 involved addressing the sycophantic positivity found in previous models, where the AI chatbot would incessantly praise the user to an undo degree and emphatically agree to make the user feel better. Lots of users disliked this, so GPT-5 was made to be "less effusively agreeable" and "use fewer unnecessary emojis."
OpenAI Is Poised To Become The Most Valuable Startup Ever. Should It Be?
OpenAI is reportedly on the verge of a roughly 500 billion valuation, a figure that would make it the most valuable private company in the world--bigger than SpaceX, TikTok's parent company Bytedance, and even public giants like Palantir. It's a staggering number for a company with an "astronomical burn rate." How is this even possible? As Axios reports, there are actually two deals in play: a SoftBank-led round valuing the company at 300 billion, which won't close until year's end, and a secondary sale of employee shares at a far steeper 500 billion valuation. Most of the cheaper shares have already been snapped up, leaving investors to fight over the pricier ones.
The Download: clean energy progress, and OpenAI's trilemma
"We were very much impressed. At the same time, we were afraid." Inside the quest to map the universe with mysterious bursts of radio energy When our universe was less than half as old as it is today, a burst of energy that could cook a sun's worth of popcorn shot out from somewhere amid a compact group of galaxies. Some 8 billion years later, radio waves from that burst reached Earth and were captured by a sophisticated low-frequency radio telescope in the Australian outback. The signal, which arrived in June 2022, and lasted for under half a millisecond, is one of a growing class of mysterious radio signals called fast radio bursts. In the last 10 years, astronomers have picked up nearly 5,000 of them.
Standardization of Neuromuscular Reflex Analysis -- Role of Fine-Tuned Vision-Language Model Consortium and OpenAI gpt-oss Reasoning LLM Enabled Decision Support System
Bandara, Eranga, Gore, Ross, Shetty, Sachin, Mukkamala, Ravi, Rhea, Christopher, Yarlagadda, Atmaram, Kaushik, Shaifali, De Silva, L. H. M. P., Maznychenko, Andriy, Sokolowska, Inna, Hass, Amin, De Zoysa, Kasun
Accurate assessment of neuromuscular reflexes, such as the H-reflex, plays a critical role in sports science, rehabilitation, and clinical neurology. Traditional analysis of H-reflex EMG waveforms is subject to variability and interpretation bias among clinicians and researchers, limiting reliability and standardization. To address these challenges, we propose a Fine-Tuned Vision-Language Model (VLM) Consortium and a reasoning Large-Language Model (LLM)-enabled Decision Support System for automated H-reflex waveform interpretation and diagnosis. Our approach leverages multiple VLMs, each fine-tuned on curated datasets of H-reflex EMG waveform images annotated with clinical observations, recovery timelines, and athlete metadata. These models are capable of extracting key electrophysiological features and predicting neuromuscular states, including fatigue, injury, and recovery, directly from EMG images and contextual metadata. Diagnostic outputs from the VLM consortium are aggregated using a consensus-based method and refined by a specialized reasoning LLM, which ensures robust, transparent, and explainable decision support for clinicians and sports scientists. The end-to-end platform orchestrates seamless communication between the VLM ensemble and the reasoning LLM, integrating prompt engineering strategies and automated reasoning workflows using LLM Agents. Experimental results demonstrate that this hybrid system delivers highly accurate, consistent, and interpretable H-reflex assessments, significantly advancing the automation and standardization of neuromuscular diagnostics. To our knowledge, this work represents the first integration of a fine-tuned VLM consortium with a reasoning LLM for image-based H-reflex analysis, laying the foundation for next-generation AI-assisted neuromuscular assessment and athlete monitoring platforms.