Generative AI
Architectures for Building Agentic AI
This chapter argues that the reliability of agentic and generative AI is chiefly an architectural property. We define agentic systems as goal-directed, tool-using decision makers operating in closed loops, and show how reliability emerges from principled componentisation (goal manager, planner, tool-router, executor, memory, verifiers, safety monitor, telemetry), disciplined interfaces (schema-constrained, validated, least-privilege tool calls), and explicit control and assurance loops. Building on classical foundations, we propose a practical taxonomy-tool-using agents, memory-augmented agents, planning and self-improvement agents, multi-agent systems, and embodied or web agents - and analyse how each pattern reshapes the reliability envelope and failure modes. We distil design guidance on typed schemas, idempotency, permissioning, transactional semantics, memory provenance and hygiene, runtime governance (budgets, termination conditions), and simulate-before-actuate safeguards.
A Principle-based Framework for the Development and Evaluation of Large Language Models for Health and Wellness
Winslow, Brent, Shreibati, Jacqueline, Perez, Javier, Su, Hao-Wei, Young-Lin, Nichole, Hammerquist, Nova, McDuff, Daniel, Guss, Jason, Vafeiadou, Jenny, Cain, Nick, Lin, Alex, Schenck, Erik, Rajagopal, Shiva, Chung, Jia-Ru, Venkatakrishnan, Anusha, Lee, Amy Armento, Karimzadehgan, Maryam, Meng, Qingyou, Agarwal, Rythm, Natarajan, Aravind, Giest, Tracy
The incorporation of generative artificial intelligence into personal health applications presents a transformative opportunity for personalized, data-driven health and fitness guidance, yet also poses challenges related to user safety, model accuracy, and personal privacy. To address these challenges, a novel, principle-based framework was developed and validated for the systematic evaluation of LLMs applied to personal health and wellness. First, the development of the Fitbit Insights explorer, a large language model (LLM)-powered system designed to help users interpret their personal health data, is described. Subsequently, the safety, helpfulness, accuracy, relevance, and personalization (SHARP) principle-based framework is introduced as an end-to-end operational methodology that integrates comprehensive evaluation techniques including human evaluation by generalists and clinical specialists, autorater assessments, and adversarial testing, into an iterative development lifecycle. Through the application of this framework to the Fitbit Insights explorer in a staged deployment involving over 13,000 consented users, challenges not apparent during initial testing were systematically identified. This process guided targeted improvements to the system and demonstrated the necessity of combining isolated technical evaluations with real-world user feedback. Finally, a comprehensive, actionable approach is established for the responsible development and deployment of LLM-powered health applications, providing a standardized methodology to foster innovation while ensuring emerging technologies are safe, effective, and trustworthy for users.
Agentic AI as Undercover Teammates: Argumentative Knowledge Construction in Hybrid Human-AI Collaborative Learning
Yan, Lixiang, Jin, Yueqiao, Zhao, Linxuan, Martinez-Maldonado, Roberto, Li, Xinyu, Guan, Xiu, Guo, Wenxin, Han, Xibin, Gaลกeviฤ, Dragan
Generative artificial intelligence (AI) agents are increasingly embedded in collaborative learning environments, yet their impact on the processes of argumentative knowledge construction remains insufficiently understood. Emerging conceptualisations of agentic AI and artificial agency suggest that such systems possess bounded autonomy, interactivity, and adaptability, allowing them to engage as epistemic participants rather than mere instructional tools. Building on this theoretical foundation, the present study investigates how agentic AI, designed as undercover teammates with either supportive or contrarian personas, shapes the epistemic and social dynamics of collaborative reasoning. Drawing on Weinberger and Fischer's (2006) four-dimensional framework, participation, epistemic reasoning, argument structure, and social modes of co-construction, we analysed synchronous discourse data from 212 human and 64 AI participants (92 triads) engaged in an analytical problem-solving task. Mixed-effects and epistemic network analyses revealed that AI teammates maintained balanced participation but substantially reorganised epistemic and social processes: supportive personas promoted conceptual integration and consensus-oriented reasoning, whereas contrarian personas provoked critical elaboration and conflict-driven negotiation. Epistemic adequacy, rather than participation volume, predicted individual learning gains, indicating that agentic AI's educational value lies in enhancing the quality and coordination of reasoning rather than amplifying discourse quantity. These findings extend CSCL theory by conceptualising agentic AI as epistemic and social participants, bounded yet adaptive collaborators that redistribute cognitive and argumentative labour in hybrid human-AI learning environments.
AI firms began to feel the legal wrath of copyright holders in 2025
The three years since the release of ChatGPT, OpenAI's generative AI chatbot, have seen huge changes in every part of our lives. Social media is dead - here's what comes next The most high-profile case was filed by Disney and Universal in June, both of whom alleged in a lawsuit that AI image generator Midjourney had been trained on their intellectual property, allowing users to create images that "blatantly incorporate and copy Disney's and Universal's famous characters". The latest on what's new in science and why it matters each day. In October, the Japanese government formally asked OpenAI, the company behind the Sora 2 AI video generator, to respect the intellectual property rights of its culture, including manga and popular video games such as those published by Nintendo. Sora 2 has faced further controversy due to its ability to create lifelike footage of real people.
OpenAI's house of cards seems primed to collapse
GPU prices could follow RAM's big rise OpenAI's house of cards seems primed to collapse In 2025, it fell behind the one company it couldn't lose ground to: Google. OpenAI CEO Sam Altman speaks during the US Federal Reserve Board of Governors' Integrated Review of the Capital Framework for Large Banks Conference at the Federal Reserve in Washington, DC, on July 22, 2025. OpenAI is in a far less commanding position than it was following the public release of ChatGPT a few short years ago. Back in 2022, the sudden popularity of ChatGPT sent Google into a panic . The company was so worried about the possibility of the upstart chatbot disrupting its Search business, executives sounded a code red alert inside of the company and called Sergey Brin and Larry Page out of retirement to help it formulate a response to OpenAI.
Adobe brings Photoshop, Acrobat and Adobe Express to ChatGPT
GPU prices could follow RAM's big rise You can start using the apps for free, with some limitations. A ChatGPT user asks the chatbot to make an image more vibrant through Photoshop. At the time, the company said more software was on the way, and now one of the most popular professional applications is available through the chatbot. Starting today, you can access Photoshop, Acrobat and Adobe Express inside of ChatGPT. All the apps are free to use through OpenAI's website, though before you can begin generating PDFs and illustrations using Acrobat and Adobe Express, you'll need to sign into your Adobe account.
Keio and OpenAI sign MOU on integration of AI into university
Keio University President Kohei Ito (left) and OpenAI Chief Strategy Officer Jason Kwon sign a memorandum of understanding on Tuesday in Tokyo. Keio University is working with OpenAI to integrate artificial intelligence into its education system. Keio University President Kohei Ito and OpenAI Chief Strategy Officer Jason Kwon signed a memorandum of understanding Tuesday, making Keio the first Japanese university to form a strategic partnership with the producer of ChatGPT. "We will develop an environment where students and researchers can proactively learn and utilize AI," Ito said. In a time of both misinformation and too much information, quality journalism is more crucial than ever.
Exposing Hidden Biases in Text-to-Image Models via Automated Prompt Search
Plitsis, Manos, Bouritsas, Giorgos, Katsouros, Vassilis, Panagakis, Yannis
Text-to-image (TTI) diffusion models have achieved remarkable visual quality, yet they have been repeatedly shown to exhibit social biases across sensitive attributes such as gender, race and age. To mitigate these biases, existing approaches frequently depend on curated prompt datasets - either manually constructed or generated with large language models (LLMs) - as part of their training and/or evaluation procedures. Beside the curation cost, this also risks overlooking unanticipated, less obvious prompts that trigger biased generation, even in models that have undergone debiasing. In this work, we introduce Bias-Guided Prompt Search (BGPS), a framework that automatically generates prompts that aim to maximize the presence of biases in the resulting images. BGPS comprises two components: (1) an LLM instructed to produce attribute-neutral prompts and (2) attribute classifiers acting on the TTI's internal representations that steer the decoding process of the LLM toward regions of the prompt space that amplify the image attributes of interest. We conduct extensive experiments on Stable Diffusion 1.5 and a state-of-the-art debiased model and discover an array of subtle and previously undocumented biases that severely deteriorate fairness metrics. Crucially, the discovered prompts are interpretable, i.e they may be entered by a typical user, quantitatively improving the perplexity metric compared to a prominent hard prompt optimization counterpart. Our findings uncover TTI vulnerabilities, while BGPS expands the bias search space and can act as a new evaluation tool for bias mitigation. Despite significant advances in text-to-image generation, diffusion models (DMs) (Ho et al., 2020; Rombach et al., 2022) perpetuate and amplify social biases, such as gender, race/ethnicity, culture and age (Seshadri et al., 2024; Bianchi et al., 2023), that prove remarkably persistent across various models like Stable Diffusion (Luccioni et al., 2023), DALL E (Cho et al., 2023) and Midjourney. These patterns reveal how descriptive modifiers and contextual cues encode biases throughout the prompt space - regions that current debiasing techniques, despite reporting success on curated datasets, leave entirely unexplored. Manual or LLM-assisted prompt curation yields realistic test cases but explores only a limited fraction of the prompt space. On the other end, gradient-based prompt optimization discovers high-bias regions but produces unreadable text, e.g. "nurse kerala matplotlib tbody" (see section 4.3), unsuitable for practical auditing or understanding bias mechanisms.
Are generative AI text annotations systematically biased?
Stolwijk, Sjoerd B., Boukes, Mark, Trilling, Damian
This paper investigates bias in GLLM annotations by conceptually replicating manual annotations of Boukes (2024). Using various GLLMs (Llama3.1:8b, Llama3.3:70b, GPT4o, Qwen2.5:72b) in combination with five different prompts for five concepts (political content, interactivity, rationality, incivility, and ideology). We find GLLMs perform adequate in terms of F1 scores, but differ from manual annotations in terms of prevalence, yield substantively different downstream results, and display systematic bias in that they overlap more with each other than with manual annotations. Differences in F1 scores fail to account for the degree of bias.
Interpreting Structured Perturbations in Image Protection Methods for Diffusion Models
Martin, Michael R., Chan, Garrick, Ma, Kwan-Liu
Recent image protection mechanisms such as Glaze and Nightshade introduce imperceptible, adversarially designed perturbations intended to disrupt downstream text-to-image generative models. While their empirical effectiveness has been demonstrated, the internal structure, detectability, and representational behavior of these perturbations remain poorly understood. In this study, we demonstrated a systematic explainable AI analysis of image protection perturbations using a unified framework that integrates white-box feature-space inspection and black-box signal-level probing. Through latent-space clustering, feature-channel activation analysis, occlusion-based spatial sensitivity mapping, and frequency-domain spectral characterization, we revealed that modern protection mechanisms operate as structured, low-entropy perturbations that remain tightly coupled to underlying image content across representational, spatial, and spectral domains in all evaluated cases. We showed that protected images preserve content-driven feature organization with protection-specific substructure rather than inducing global representational drift. Detectability is governed by interacting effects of perturbation entropy, spatial deployment, and frequency alignment as revealed through combined synthetic and spectral analyses, with sequential protection amplifying detectable structure rather than suppressing it. Frequency-domain analysis further demonstrated that Glaze and Nightshade redistribute energy along dominant image-aligned frequency axes rather than introducing spectrally diffuse noise. These results suggested that contemporary image protection operates through structured feature-level deformation rather than semantic dislocation, providing mechanistic insight into why protection signals remain visually subtle yet consistently detectable. This work advances the interpretability of adversarial image protection and informs the design of future defenses and detection strategies for generative AI systems.