Goto

Collaborating Authors

 Generative AI


OpenAI has fixed ChatGPT's infamous 'em dash' obsession (somewhat)

PCWorld

When you purchase through links in our articles, we may earn a small commission. The em dash is seen by some as a dead giveaway of AI-generated text, mainly because ChatGPT loves to use it. OpenAI CEO Sam Altman shared in a social media post that the company has now fixed ChatGPT's overuse of the "em dash," which is the extra-long hyphen that's commonly seen in AI-generated text. In the past, ChatGPT was overzealous in its use of the em dash, to the point where it'd continue to include them even when users asked it not to. Now, with the fix, a user can instruct ChatGPT to not use em dashes and it will respect the instruction.


How generative AI in Arc Raiders started a scrap over the gaming industry's future

The Guardian

How generative AI in Arc Raiders started a scrap over the gaming industry's future Don't get Pushing Buttons delivered to your inbox? A rc Raiders is, by all accounts, a late game-of-the-year contender. Dropped into a multiplayer world overrun with hostile drones and military robots, every human player is at the mercy of the machines - and each other. Can you trust the other raider you've spotted on your way back to humanity's safe haven underground, or will they shoot you and take everything you've just scavenged? Perhaps surprisingly, humanity is (mostly) choosing to band together, according to most people I've talked to about this game.


Larry Summers resigns from OpenAI board after Epstein emails made public

BBC News

Former US treasury secretary Larry Summers is stepping down from the board at OpenAI, a week after a tranche of emails between him and late convicted sex offender Jeffrey Epstein was released. Summers said in a statement to the BBC that he was grateful for the opportunity to have served, excited about the potential of the company, and look forward to following their progress. Summers, who was also once the president of Harvard University, said on Monday that he would be stepping back from public commitments over his ties to Epstein. The recently released emails showed Summers communicated with Epstein until the day before Epstein's 2019 arrest for the alleged sex trafficking of minors. In a statement, the artificial intelligence company said it respected Summers' decision to resign.


From Legacy Fortran to Portable Kokkos: An Autonomous Agentic AI Workflow

arXiv.org Artificial Intelligence

Scientific applications continue to rely on legacy Fortran codebases originally developed for homogeneous, CPU-based systems. As High-Performance Computing (HPC) shifts toward heterogeneous GPU-accelerated architectures, many accelerators lack native Fortran bindings, creating an urgent need to modernize legacy codes for portability. Frameworks like Kokkos provide performance portability and a single-source C++ abstraction, but manual Fortran-to-Kokkos porting demands significant expertise and time. Large language models (LLMs) have shown promise in source-to-source code generation, yet their use in fully autonomous workflows for translating and optimizing parallel code remains largely unexplored, especially for performance portability across diverse hardware. This paper presents an agentic AI workflow where specialized LLM "agents" collaborate to translate, validate, compile, run, test, debug, and optimize Fortran kernels into portable Kokkos C++ programs. Results show the pipeline modernizes a range of benchmark kernels, producing performance-portable Kokkos codes across hardware partitions. Paid OpenAI models such as GPT-5 and o4-mini-high executed the workflow for only a few U.S. dollars, generating optimized codes that surpassed Fortran baselines, whereas open-source models like Llama4-Maverick often failed to yield functional codes. This work demonstrates the feasibility of agentic AI for Fortran-to-Kokkos transformation and offers a pathway for autonomously modernizing legacy scientific applications to run portably and efficiently on diverse supercomputers. It further highlights the potential of LLM-driven agentic systems to perform structured, domain-specific reasoning tasks in scientific and systems-oriented applications.


Apo2Mol: 3D Molecule Generation via Dynamic Pocket-Aware Diffusion Models

arXiv.org Artificial Intelligence

Deep generative models are rapidly advancing structure-based drug design, offering substantial promise for generating small molecule ligands that bind to specific protein targets. However, most current approaches assume a rigid protein binding pocket, neglecting the intrinsic flexibility of proteins and the conformational rearrangements induced by ligand binding, limiting their applicability in practical drug discovery. Here, we propose Apo2Mol, a diffusion-based generative framework for 3D molecule design that explicitly accounts for conformational flexibility in protein binding pockets. To support this, we curate a dataset of over 24,000 experimentally resolved apo-holo structure pairs from the Protein Data Bank, enabling the characterization of protein structure changes associated with ligand binding. Apo2Mol employs a full-atom hierarchical graph-based diffusion model that simultaneously generates 3D ligand molecules and their corresponding holo pocket conformations from input apo states. Empirical studies demonstrate that Apo2Mol can achieve state-of-the-art performance in generating high-affinity ligands and accurately capture realistic protein pocket conformational changes.


AfriSpeech-MultiBench: A Verticalized Multidomain Multicountry Benchmark Suite for African Accented English ASR

arXiv.org Artificial Intelligence

Recent advances in speech-enabled AI, including Google's NotebookLM and OpenAI's speech-to-speech API, are driving widespread interest in voice interfaces globally. Despite this momentum, there exists no publicly available application-specific model evaluation that caters to Africa's linguistic diversity. We present AfriSpeech-MultiBench, the first domain-specific evaluation suite for over 100 African English accents across 10+ countries and seven application domains: Finance, Legal, Medical, General dialogue, Call Center, Named Entities and Hallucination Robustness. We benchmark a diverse range of open, closed, unimodal ASR and multimodal LLM-based speech recognition systems using both spontaneous and non-spontaneous speech conversation drawn from various open African accented English speech datasets. Our empirical analysis reveals systematic variation: open-source ASR models excels in spontaneous speech contexts but degrades on noisy, non-native dialogue; multimodal LLMs are more accent-robust yet struggle with domain-specific named entities; proprietary models deliver high accuracy on clean speech but vary significantly by country and domain. Models fine-tuned on African English achieve competitive accuracy with lower latency, a practical advantage for deployment, hallucinations still remain a big problem for most SOTA models. By releasing this comprehensive benchmark, we empower practitioners and researchers to select voice technologies suited to African use-cases, fostering inclusive voice applications for underserved communities.


A Comprehensive Study of Implicit and Explicit Biases in Large Language Models

arXiv.org Artificial Intelligence

Large Language Models (LLMs) inherit explicit and implicit biases from their training datasets. Identifying and mitigating biases in LLMs is crucial to ensure fair outputs, as they can perpetuate harmful stereotypes and misinformation. This study highlights the need to address biases in LLMs amid growing generative AI. We studied bias-specific benchmarks such as StereoSet and CrowSPairs to evaluate the existence of various biases in multiple generative models such as BERT and GPT 3.5. We proposed an automated Bias-Identification Framework to recognize various social biases in LLMs such as gender, race, profession, and religion. We adopted a two-pronged approach to detect explicit and implicit biases in text data. Results indicated fine-tuned models struggle with gender biases but excelled at identifying and avoiding racial biases. Our findings illustrated that despite having some success, LLMs often over-relied on keywords. To illuminate the capability of the analyzed LLMs in detecting implicit biases, we employed Bag-of-Words analysis and unveiled indications of implicit stereotyping within the vocabulary. To bolster the model performance, we applied an enhancement strategy involving fine-tuning models using prompting techniques and data augmentation of the bias benchmarks. The fine-tuned models exhibited promising adaptability during cross-dataset testing and significantly enhanced performance on implicit bias benchmarks, with performance gains of up to 20%.


Scene Graph-Guided Generative AI Framework for Synthesizing and Evaluating Industrial Hazard Scenarios

arXiv.org Artificial Intelligence

Training vision models to detect workplace hazards accurately requires realistic images of unsafe conditions that could lead to accidents. However, acquiring such datasets is difficult because capturing accident-triggering scenarios as they occur is nearly impossible. To overcome this limitation, this study presents a novel scene graph-guided generative AI framework that synthesizes photorealistic images of hazardous scenarios grounded in historical Occupational Safety and Health Administration (OSHA) accident reports. OSHA narratives are analyzed using GPT-4o to extract structured hazard reasoning, which is converted into object-level scene graphs capturing spatial and contextual relationships essential for understanding risk. These graphs guide a text-to-image diffusion model to generate compositionally accurate hazard scenes. To evaluate the realism and semantic fidelity of the generated data, a visual question answering (VQA) framework is introduced. Across four state-of-the-art generative models, the proposed VQA Graph Score outperforms CLIP and BLIP metrics based on entropy-based validation, confirming its higher discriminative sensitivity.


GenAI Voice Mode in Programming Education

arXiv.org Artificial Intelligence

Real-time voice interfaces using multimodal Generative AI (GenAI) can potentially address the accessibility needs of novice programmers with disabilities (e.g., related to vision). Yet, little is known about how novices interact with GenAI tools and their feedback quality in the form of audio output. This paper analyzes audio dialogues from nine 9th-grade students using a voice-enabled tutor (powered by OpenAI's Realtime API) in an authentic classroom setting while learning Python. We examined the students' voice prompts and AI's responses (1210 messages) by using qualitative coding. We also gathered students' perceptions via the Partner Modeling Questionnaire. The GenAI Voice Tutor primarily offered feedback on mistakes and next steps, but its correctness was limited (71.4% correct out of 416 feedback outputs). Quality issues were observed, particularly when the AI attempted to utter programming code elements. Students used the GenAI voice tutor primarily for debugging. They perceived it as competent, only somewhat human-like, and flexible. The present study is the first to explore the interaction dynamics of real-time voice GenAI tutors and novice programmers, informing future educational tool design and potentially addressing accessibility needs of diverse learners.


Microsoft, Nvidia invest in Anthropic in cloud services deal

Al Jazeera

Microsoft and Nvidia plan to invest in Anthropic under a new tie-up that includes a $30bn commitment by the Claude maker to use Microsoft's cloud services, the latest high-profile deal binding together major players in the AI industry. Nvidia will commit up to $10bn to Anthropic and Microsoft up to $5bn, the companies said on Tuesday, without sharing more details. The announcement underscores the AI industry's insatiable appetite for computing power as companies race to build systems that can rival or surpass human intelligence. It also ties major OpenAI-backer Microsoft, as well as key AI chip supplier Nvidia, closer to one of the ChatGPT maker's biggest rivals. "We're increasingly going to be customers of each other. We will use Anthropic models, they will use our infrastructure and we'll go to market together," Microsoft CEO Satya Nadella said in a video.