AITopics | Generative AI

Collaborating Authors

Generative AI

News Overviews Instructional Materials AI-Alerts Classics

MIRAGE: Towards AI-Generated Image Detection in the Wild

Xia, Cheng, Lin, Manxi, Tan, Jiexiang, Du, Xiaoxiong, Qiu, Yang, Zheng, Junjun, Kong, Xiangheng, Jiang, Yuning, Zheng, Bo

arXiv.org Artificial IntelligenceAug-20-2025

The spreading of AI-generated images (AIGI), driven by advances in generative AI, poses a significant threat to information security and public trust. Existing AIGI detectors, while effective against images in clean laboratory settings, fail to generalize to in-the-wild scenarios. These real-world images are noisy, varying from ``obviously fake" images to realistic ones derived from multiple generative models and further edited for quality control. We address in-the-wild AIGI detection in this paper. We introduce Mirage, a challenging benchmark designed to emulate the complexity of in-the-wild AIGI. Mirage is constructed from two sources: (1) a large corpus of Internet-sourced AIGI verified by human experts, and (2) a synthesized dataset created through the collaboration between multiple expert generators, closely simulating the realistic AIGI in the wild. Building on this benchmark, we propose Mirage-R1, a vision-language model with heuristic-to-analytic reasoning, a reflective reasoning mechanism for AIGI detection. Mirage-R1 is trained in two stages: a supervised-fine-tuning cold start, followed by a reinforcement learning stage. By further adopting an inference-time adaptive thinking strategy, Mirage-R1 is able to provide either a quick judgment or a more robust and accurate conclusion, effectively balancing inference speed and performance. Extensive experiments show that our model leads state-of-the-art detectors by 5% and 10% on Mirage and the public benchmark, respectively. The benchmark and code will be made publicly available.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2508.13223

Genre: Research Report > New Finding (0.92)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

Add feedback

A Survey of LLM-based Deep Search Agents: Paradigm, Optimization, Evaluation, and Challenges

Xi, Yunjia, Lin, Jianghao, Xiao, Yongzhao, Zhou, Zheli, Shan, Rong, Gao, Te, Zhu, Jiachen, Liu, Weiwen, Yu, Yong, Zhang, Weinan

arXiv.org Artificial IntelligenceAug-20-2025

The advent of Large Language Models (LLMs) has significantly revolutionized web search. The emergence of LLM-based Search Agents marks a pivotal shift towards deeper, dynamic, autonomous information seeking. These agents can comprehend user intentions and environmental context and execute multi-turn retrieval with dynamic planning, extending search capabilities far beyond the web. Leading examples like OpenAI's Deep Research highlight their potential for deep information mining and real-world applications. This survey provides the first systematic analysis of search agents. We comprehensively analyze and categorize existing works from the perspectives of architecture, optimization, application, and evaluation, ultimately identifying critical open challenges and outlining promising future research directions in this rapidly evolving field. Our repository is available on https://github.com/YunjiaXi/Awesome-Search-Agent-Papers.

arxiv preprint arxiv, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2508.05668

Country: Asia (0.46)

Genre: Overview (1.00)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.34)

Add feedback

ec795aeadae0b7d230fa35cbaf04c041-Supplemental-Conference.pdf

Neural Information Processing SystemsAug-19-2025, 16:58:50 GMT

artificial intelligence, dall-e 2, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > New York (0.04)
North America > Canada > Ontario > Toronto (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.36)

Add feedback

ec795aeadae0b7d230fa35cbaf04c041-Paper-Conference.pdf

Neural Information Processing SystemsAug-19-2025, 16:58:46 GMT

diffusion model, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Maryland (0.04)
(3 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Leisure & Entertainment > Sports (0.46)
Law (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

OpenAI makes GPT-5 'friendlier' after widespread user backlash

PCWorldAug-19-2025, 16:38:16 GMT

About two weeks ago, OpenAI released GPT-5. The newest AI model in the GPT line, GPT-5 was put forth as the company's "smartest, fastest, most useful model yet" with "built-in thinking" and "expert-level intelligence." But the release backfired for one important reason. Part of the changes in GPT-5 involved addressing the sycophantic positivity found in previous models, where the AI chatbot would incessantly praise the user to an undo degree and emphatically agree to make the user feel better. Lots of users disliked this, so GPT-5 was made to be "less effusively agreeable" and "use fewer unnecessary emojis."

gpt-5, openai make gpt-5, widespread user backlash

PCWorld

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.76)

Add feedback

OpenAI Is Poised To Become The Most Valuable Startup Ever. Should It Be?

WIREDAug-19-2025, 16:00:00 GMT

OpenAI is reportedly on the verge of a roughly 500 billion valuation, a figure that would make it the most valuable private company in the world--bigger than SpaceX, TikTok's parent company Bytedance, and even public giants like Palantir. It's a staggering number for a company with an "astronomical burn rate." How is this even possible? As Axios reports, there are actually two deals in play: a SoftBank-led round valuing the company at 300 billion, which won't close until year's end, and a secondary sale of employee shares at a far steeper 500 billion valuation. Most of the cheaper shares have already been snapped up, leaving investors to fight over the pricier ones.

investor, openai, valuation, (5 more...)

WIRED

Country: North America > United States > New York (0.06)

Industry: Information Technology (0.57)

Technology:

Information Technology > Communications > Social Media (0.99)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.88)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.73)

Add feedback

Deep Generative Model for Periodic Graphs

Neural Information Processing SystemsAug-19-2025, 15:37:51 GMT

Their generative modeling has great potential in real-world applications such as material design and graphics synthesis. Classical models either rely on domain-specific predefined generation principles (e.g., in crystal net design),

artificial intelligence, graph, machine learning, (19 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre: Overview (0.67)

Industry:

Information Technology (0.46)
Health & Medicine > Diagnostic Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.42)

Add feedback

The Download: clean energy progress, and OpenAI's trilemma

MIT Technology ReviewAug-19-2025, 12:10:00 GMT

"We were very much impressed. At the same time, we were afraid." Inside the quest to map the universe with mysterious bursts of radio energy When our universe was less than half as old as it is today, a burst of energy that could cook a sun's worth of popcorn shot out from somewhere amid a compact group of galaxies. Some 8 billion years later, radio waves from that burst reached Earth and were captured by a sophisticated low-frequency radio telescope in the Australian outback. The signal, which arrived in June 2022, and lasted for under half a millisecond, is one of a growing class of mysterious radio signals called fast radio bursts. In the last 10 years, astronomers have picked up nearly 5,000 of them.

clean energy progress, download, trilemma, (4 more...)

MIT Technology Review

Country: North America > United States > Maryland (0.07)

Industry:

Leisure & Entertainment (0.46)
Media (0.43)
Energy > Renewable (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.40)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.40)

Add feedback

43ab1646052dab79731f5d70bf40f6dc-Paper-Conference.pdf

Neural Information Processing SystemsAug-19-2025, 07:19:08 GMT

dimension, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.14)
Europe > Germany (0.14)

Genre: Research Report > Experimental Study (0.93)

Industry: Energy > Oil & Gas > Upstream (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.44)

Add feedback

Standardization of Neuromuscular Reflex Analysis -- Role of Fine-Tuned Vision-Language Model Consortium and OpenAI gpt-oss Reasoning LLM Enabled Decision Support System

Bandara, Eranga, Gore, Ross, Shetty, Sachin, Mukkamala, Ravi, Rhea, Christopher, Yarlagadda, Atmaram, Kaushik, Shaifali, De Silva, L. H. M. P., Maznychenko, Andriy, Sokolowska, Inna, Hass, Amin, De Zoysa, Kasun

arXiv.org Artificial IntelligenceAug-19-2025

Accurate assessment of neuromuscular reflexes, such as the H-reflex, plays a critical role in sports science, rehabilitation, and clinical neurology. Traditional analysis of H-reflex EMG waveforms is subject to variability and interpretation bias among clinicians and researchers, limiting reliability and standardization. To address these challenges, we propose a Fine-Tuned Vision-Language Model (VLM) Consortium and a reasoning Large-Language Model (LLM)-enabled Decision Support System for automated H-reflex waveform interpretation and diagnosis. Our approach leverages multiple VLMs, each fine-tuned on curated datasets of H-reflex EMG waveform images annotated with clinical observations, recovery timelines, and athlete metadata. These models are capable of extracting key electrophysiological features and predicting neuromuscular states, including fatigue, injury, and recovery, directly from EMG images and contextual metadata. Diagnostic outputs from the VLM consortium are aggregated using a consensus-based method and refined by a specialized reasoning LLM, which ensures robust, transparent, and explainable decision support for clinicians and sports scientists. The end-to-end platform orchestrates seamless communication between the VLM ensemble and the reasoning LLM, integrating prompt engineering strategies and automated reasoning workflows using LLM Agents. Experimental results demonstrate that this hybrid system delivers highly accurate, consistent, and interpretable H-reflex assessments, significantly advancing the automation and standardization of neuromuscular diagnostics. To our knowledge, this work represents the first integration of a fine-tuned VLM consortium with a reasoning LLM for image-based H-reflex analysis, laying the foundation for next-generation AI-assisted neuromuscular assessment and athlete monitoring platforms.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2508.12473

Country: Asia (0.28)

Genre: Research Report > New Finding (0.34)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Therapeutic Area > Musculoskeletal (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.53)

Add feedback