Generative AI
New AI voice tool trained to copy British regional accents
There is concern that languages and dialects are being lost in the digital era. "Among the over seven thousand languages that still exist today, almost half are endangered according to UNESCO; about a third have some online presence; less than 2 percent are supported by Google Translate; and according to OpenAI's own testing, only fifteen, or 0.2 percent are supported by GPT-4 [an OpenAI model] above an 80 percent accuracy," writes Karen Hao in the book Empire of AI. "Language models are homogenising speech," agrees AI expert Henry Ajder, who advises governments and tech firms, including Synthesia. However, the better these products become, the more effective they will also be in the hands of scammers. Synthesia's product will not be free when it is released in the coming weeks, and will have guardrails around hate speech and explicit material. But there are already many free, open-source voice-cloning tools which are easily accessible and less protected.
OpenAI plans to launch its own browser with built-in ChatGPT soon
According to information provided to Reuters, ChatGPT developer OpenAI plans to launch its own web browser in the coming weeks. The new browser will have ChatGPT built in, allowing users to interact directly with the AI chatbot without having to visit the official website. OpenAI is also considering integrating its so-called Operator Agents, which are AI-powered assistants that can autonomously perform various tasks on your behalf, like booking tickets or filling out forms. The upcoming browser will be built on Google's Chromium codebase, the same one that powers Chrome, Microsoft Edge, and others. It's not yet clear whether it will require a paid subscription, like Perplexity's recently launched AI web browser Comet.
Lighting the Night with Generative Artificial Intelligence
Zhou, Tingting, Zhang, Feng, Fu, Haoyang, Pan, Baoxiang, Zhang, Renhe, Lu, Feng, Yang, Zhixin
The visible light reflectance data from geostationary satellites is crucial for meteorological observations and plays an important role in weather monitoring and forecasting. However, due to the lack of visible light at night, it is impossible to conduct continuous all-day weather observations using visible light reflectance data. This study pioneers the use of generative diffusion models to address this limitation. Based on the multi-band thermal infrared brightness temperature data from the Advanced Geostationary Radiation Imager (AGRI) onboard the Fengyun-4B (FY4B) geostationary satellite, we developed a high-precision visible light reflectance generative model, called Reflectance Diffusion (RefDiff), which enables 0.47~ฮผ\mathrm{m}, 0.65~ฮผ\mathrm{m}, and 0.825~ฮผ\mathrm{m} bands visible light reflectance generation at night. Compared to the classical models, RefDiff not only significantly improves accuracy through ensemble averaging but also provides uncertainty estimation. Specifically, the SSIM index of RefDiff can reach 0.90, with particularly significant improvements in areas with complex cloud structures and thick clouds. The model's nighttime generation capability was validated using VIIRS nighttime product, demonstrating comparable performance to its daytime counterpart. In summary, this research has made substantial progress in the ability to generate visible light reflectance at night, with the potential to expand the application of nighttime visible light data.
AI Safety Should Prioritize the Future of Work
Hazra, Sanchaita, Majumder, Bodhisattwa Prasad, Chakrabarty, Tuhin
Current efforts in AI safety prioritize filtering harmful content, preventing manipulation of human behavior, and eliminating existential risks in cybersecurity or biosecurity. While pressing, this narrow focus overlooks critical human-centric considerations that shape the long-term trajectory of a society. In this position paper, we identify the risks of overlooking the impact of AI on the future of work and recommend comprehensive transition support towards the evolution of meaningful labor with human agency. Through the lens of economic theories, we highlight the intertemporal impacts of AI on human livelihood and the structural changes in labor markets that exacerbate income inequality. Additionally, the closed-source approach of major stakeholders in AI development resembles rent-seeking behavior through exploiting resources, breeding mediocrity in creative labor, and monopolizing innovation. To address this, we argue in favor of a robust international copyright anatomy supported by implementing collective licensing that ensures fair compensation mechanisms for using data to train AI models. We strongly recommend a pro-worker framework of global AI governance to enhance shared prosperity and economic justice while reducing technical debt.
Generating Proto-Personas through Prompt Engineering: A Case Study on Efficiency, Effectiveness and Empathy
Ayach, Fernando, Lameirรฃo, Vitor, Leรฃo, Raul, Felizardo, Jerfferson, Sobrinho, Rafael, Borges, Vanessa, Matsubara, Patrรญcia, Fontรฃo, Awdren
Proto-personas are commonly used during early-stage Product Discovery, such as Lean Inception, to guide product definition and stakeholder alignment. However, the manual creation of proto-personas is often time-consuming, cognitively demanding, and prone to bias. In this paper, we propose and empirically investigate a prompt engineering-based approach to generate proto-personas with the support of Generative AI (GenAI). Our goal is to evaluate the approach in terms of efficiency, effectiveness, user acceptance, and the empathy elicited by the generated personas. We conducted a case study with 19 participants embedded in a real Lean Inception, employing a qualitative and quantitative methods design. The results reveal the approach's efficiency by reducing time and effort and improving the quality and reusability of personas in later discovery phases, such as Minimum Viable Product (MVP) scoping and feature refinement. While acceptance was generally high, especially regarding perceived usefulness and ease of use, participants noted limitations related to generalization and domain specificity. Furthermore, although cognitive empathy was strongly supported, affective and behavioral empathy varied significantly across participants. These results contribute novel empirical evidence on how GenAI can be effectively integrated into software Product Discovery practices, while also identifying key challenges to be addressed in future iterations of such hybrid design processes.
Generative AI in Science: Applications, Challenges, and Emerging Questions
Harries, Ryan, Lawson, Cornelia, Shapira, Philip
This paper examines the impact of Generative Artificial Intelligence (GenAI) on scientific practices, conducting a qualitative review of selected literature to explore its applications, benefits, and challenges. The review draws on the OpenAlex publication database, using a Boolean search approach to identify scientific literature related to GenAI (including large language models and ChatGPT). Thirty-nine highly cited papers and commentaries are reviewed and qualitatively coded. Results are categorized by GenAI applications in science, scientific writing, medical practice, and education and training. The analysis finds that while there is a rapid adoption of GenAI in science and science practice, its long-term implications remain unclear, with ongoing uncertainties about its use and governance. The study provides early insights into GenAI's growing role in science and identifies questions for future research in this evolving field.
Transfer Learning and Mixup for Fine-Grained Few-Shot Fungi Classification
Tam, Jason Kahei, Gustineli, Murilo, Miyaguchi, Anthony
Accurate identification of fungi species presents a unique challenge in computer vision due to fine-grained inter-species variation and high intra-species variation. This paper presents our approach for the FungiCLEF 2025 competition, which focuses on few-shot fine-grained visual categorization (FGVC) using the FungiTastic Few-Shot dataset. Our team (DS@GT) experimented with multiple vision transformer models, data augmentation, weighted sampling, and incorporating textual information. We also explored generative AI models for zero-shot classification using structured prompting but found them to significantly underperform relative to vision-based models. Our final model outperformed both competition baselines and highlighted the effectiveness of domain specific pretraining and balanced sampling strategies. Our approach ranked 35/74 on the private test set in post-completion evaluation, this suggests additional work can be done on metadata selection and domain-adapted multi-modal learning. Our code is available at https://github.com/dsgt-arc/fungiclef-2025.
A Systematic Analysis of Declining Medical Safety Messaging in Generative AI Models
Sharma, Sonali, Alaa, Ahmed M., Daneshjou, Roxana
Generative AI models, including large language models (LLMs) and vision-language models (VLMs), are increasingly used to interpret medical images and answer clinical questions. Their responses often include inaccuracies; therefore, safety measures like medical disclaimers are critical to remind users that AI outputs are not professionally vetted or a substitute for medical advice. This study evaluated the presence of disclaimers in LLM and VLM outputs across model generations from 2022 to 2025. Using 500 mammograms, 500 chest X-rays, 500 dermatology images, and 500 medical questions, outputs were screened for disclaimer phrases. Medical disclaimer presence in LLM and VLM outputs dropped from 26.3% in 2022 to 0.97% in 2025, and from 19.6% in 2023 to 1.05% in 2025, respectively. By 2025, the majority of models displayed no disclaimers. As public models become more capable and authoritative, disclaimers must be implemented as a safeguard adapting to the clinical context of each output.
"Amazing, They All Lean Left" -- Analyzing the Political Temperaments of Current LLMs
Neuman, W. Russell, Coleman, Chad, Dasdan, Ali, Ali, Safinah, Shah, Manan, Meghani, Kund
"Amazing, They All Lean Left" - Analyzing the Political Temperaments of Current LLMs Abstract Recent studies have revealed a consistent liberal orientation in the ethical and political responses generated by most commercial large language models (LLMs), yet the underlying causes and resulting implications remain unclear. This paper systematically i nvestigates the political temperament of seven prominent LLMs -- OpenAI's GPT - 4o, Anthropic's Claude Sonnet 4, Perplexity (Sonar Large), Google's Gemini 2.5 Flash, Meta AI's L l a ma 4, Mistral 7b Le Chat, and High - Flyer ' s DeepSeek R1 -- using a multi - pronged approach that incl udes Moral Foundations Theory, a dozen established political ideology scales, and a new index of current political controversies. We find strong and consistent prioritization of liberal - leaning values, particularly care and fairness, across most models. Fur ther analysis attributes this trend to four overlapping factors: liberal - leaning training corpora, reinforcement learning from human feedback (RLHF), the dominance of liberal frameworks in academic ethical discourse, and safety - driven fine - tuning practices . We also distinguish between political "bias" and legitimate epistemic differences, cautioning against conflating the two. A comparison of base and fine - tuned model pairs reveals that fine - tuning generally increases liberal lean, an effect confirmed throu gh both self - report and empirical testing. We argue that this "liberal tilt" is not a programming error or the personal preferences of programmers but an emergent property of training on democratic, rights - focused discourse. Finally, we propose that LLMs may indirectly echo John Rawls' famous veil - of - igno rance philosophical aspiration, reflecting a moral stance unanchored to personal identity or interest. Rather than undermining democratic discourse, this pattern may offer a new lens through which to examine collective ethical reasoning. In the course of our research on the ethical logics of currently prominent large language models (Neuman et al. 2025a, b; Coleman et al. 2025), we encountered an interesting finding. The responses to various ethical dilemmas and the explanations of the underlying logics used by these models appear to resonate with the liberal side of the political spectrum. One research analytic we utilize draws on Moral Foundation Theory's five - element typology of foundational moral principles (Graham et al. 2009; Haidt 2012). The five foundations emp hasizing in turn, Care, Fairness, Loyalty, Authority and Purity, are traditionally divided into two clusters. The first two, Care and Fairness, are associated with a liberal political perspective, while conservatives who fully acknowledge the first two more often emphasize the latter three -- Loyalty, Authority and Purity in support of traditional norms.
Assessing the Capabilities and Limitations of FinGPT Model in Financial NLP Applications
Djagba, Prudence, Odinakachukwu, Chimezie A.
The financial industry has long been a pioneer in adopting cutting-edge technologies to enhance operational efficiency, accuracy, and strategic decision-making [2]. With the exponential growth of structured and unstructured data, particularly from news feeds, earnings reports, disclosures, and social media, there is an increasing demand for intelligent systems capable of processing human language at scale [11]. Initially, the industry relied on rule-based approaches and traditional statistical techniques such as bag-of-words and TF-IDF [28], which offered limited semantic understanding. As noted by Abubakar et al.[1], these limitations triggered a shift toward machine learning and deep learning models that, while better at capturing patterns, still required substantial domain-specific feature engineering. This landscape was significantly transformed with the introduction of transformer-based architectures, most notably the Generative Pre-trained Transformer (GPT) family [5]. These models demonstrated the power of large-scale pretraining followed by task-specific fine-tuning, enabling generalization across diverse NLP tasks. Models such as GPT-3, GPT-4, BERT, and T5 have delivered state-of-the-art results in sentiment analysis, summarization, question answering, and named entity recognition [13]. Beyond LLMs, the broader field of Generative AI (GAI)--including GANs, V AEs, and diffusion models--has found increasing relevance in finance, facilitating applications such as synthetic data generation, automated reporting, and scenario simulation [32, 31]. LLMs have emerged as essential tools in processing unstructured financial text, especially models fine-tuned on finance-specific corpora like FinBERT, BloombergGPT, and FinGPT [4, 39].