Generative AI
Generative AI Carries Non-Democratic Biases and Stereotypes: Representation of Women, Black Individuals, Age Groups, and People with Disability in AI-Generated Images across Occupations
AI governance and ethics in AI development have become critical concerns, prompting active discussions among tech companies, governments, and researchers about the potential risks AI poses to our democracies. This short essay aims to highlight one such risk: how generative AI includes or excludes equity-deserving groups in its outputs. The findings reveal that generative AI is not equitably inclusive regarding gender, race, age, and visible disability. Mutual Impacts: Technology and Democracy Technology is a human creation and, as such, inherently reflects our values, prejudices, and biases. Additionally, it plays a crucial role in shaping societal norms and social contracts.
Multi-Modality Conditioned Variational U-Net for Field-of-View Extension in Brain Diffusion MRI
Li, Zhiyuan, Yao, Tianyuan, Kanakaraj, Praitayini, Gao, Chenyu, Bao, Shunxing, Zuo, Lianrui, Kim, Michael E., Newlin, Nancy R., Rudravaram, Gaurav, Khairi, Nazirah M., Huo, Yuankai, Schilling, Kurt G., Kukull, Walter A., Toga, Arthur W., Archer, Derek B., Hohman, Timothy J., Landman, Bennett A.
An incomplete field-of-view (FOV) in diffusion magnetic resonance imaging (dMRI) can severely hinder the volumetric and bundle analyses of whole-brain white matter connectivity. Although existing works have investigated imputing the missing regions using deep generative models, it remains unclear how to specifically utilize additional information from paired multi-modality data and whether this can enhance the imputation quality and be useful for downstream tractography. To fill this gap, we propose a novel framework for imputing dMRI scans in the incomplete part of the FOV by integrating the learned diffusion features in the acquired part of the FOV to the complete brain anatomical structure. We hypothesize that by this design the proposed framework can enhance the imputation performance of the dMRI scans and therefore be useful for repairing whole-brain tractography in corrupted dMRI scans with incomplete FOV. We tested our framework on two cohorts from different sites with a total of 96 subjects and compared it with a baseline imputation method that treats the information from T1w and dMRI scans equally. The proposed framework achieved significant improvements in imputation performance, as demonstrated by angular correlation coefficient (p < 1E-5), and in downstream tractography accuracy, as demonstrated by Dice score (p < 0.01). Results suggest that the proposed framework improved imputation performance in dMRI scans by specifically utilizing additional information from paired multi-modality data, compared with the baseline method. The imputation achieved by the proposed framework enhances whole brain tractography, and therefore reduces the uncertainty when analyzing bundles associated with neurodegenerative.
Contextualized AI for Cyber Defense: An Automated Survey using LLMs
Haryanto, Christoforus Yoga, Elvira, Anne Maria, Nguyen, Trung Duc, Vu, Minh Hieu, Hartanto, Yoshiano, Lomempow, Emily, Arakala, Arathi
This paper surveys the potential of contextualized AI in enhancing cyber defense capabilities, revealing significant research growth from 2015 to 2024. We identify a focus on robustness, reliability, and integration methods, while noting gaps in organizational trust and governance frameworks. Our study employs two LLM-assisted literature survey methodologies: (A) ChatGPT 4 for exploration, and (B) Gemma 2:9b for filtering with Claude 3.5 Sonnet for full-text analysis. We discuss the effectiveness and challenges of using LLMs in academic research, providing insights for future researchers.
LLMs Still Can't Plan; Can LRMs? A Preliminary Evaluation of OpenAI's o1 on PlanBench
Valmeekam, Karthik, Stechly, Kaya, Kambhampati, Subbarao
The ability to plan a course of action that achieves a desired state of affairs has long been considered a core competence of intelligent agents and has been an integral part of AI research since its inception. With the advent of large language models (LLMs), there has been considerable interest in the question of whether or not they possess such planning abilities. PlanBench, an extensible benchmark we developed in 2022, soon after the release of GPT3, has remained an important tool for evaluating the planning abilities of LLMs. Despite the slew of new private and open source LLMs since GPT3, progress on this benchmark has been surprisingly slow. OpenAI claims that their recent o1 (Strawberry) model has been specifically constructed and trained to escape the normal limitations of autoregressive LLMs--making it a new kind of model: a Large Reasoning Model (LRM). Using this development as a catalyst, this paper takes a comprehensive look at how well current LLMs and new LRMs do on PlanBench. As we shall see, while o1's performance is a quantum improvement on the benchmark, outpacing the competition, it is still far from saturating it. This improvement also brings to the fore questions about accuracy, efficiency, and guarantees which must be considered before deploying such systems.
BoilerTAI: A Platform for Enhancing Instruction Using Generative AI in Educational Forums
Sinha, Anvit, Goyal, Shruti, Sy, Zachary, Kuperus, Rhianna, Dickey, Ethan, Bejarano, Andres
Contribution: This Full paper in the Research Category track describes a practical, scalable platform that seamlessly integrates Generative AI (GenAI) with online educational forums, offering a novel approach to augment the instructional capabilities of staff. The platform empowers instructional staff to efficiently manage, refine, and approve responses by facilitating interaction between student posts and a Large Language Model (LLM). This contribution enhances the efficiency and effectiveness of instructional support and significantly improves the quality and speed of responses provided to students, thereby enriching the overall learning experience. Background: Grounded in Vygotsky's socio-cultural theory and the concept of the More Knowledgeable Other (MKO), the study examines how GenAI can act as an auxiliary MKO to enrich educational dialogue between students and instructors. Research Question: How effective is GenAI in reducing the workload of instructional staff when used to pre-answer student questions posted on educational discussion forums? Methodology: Using a mixed-methods approach in large introductory programming courses, human Teaching Assistants (AI-TAs) employed an AI-assisted platform to pre-answer student queries. We analyzed efficiency indicators like the frequency of modifications to AI-generated responses and gathered qualitative feedback from AI-TAs. Findings: The findings indicate no significant difference in student reception to responses generated by AI-TAs compared to those provided by human instructors. This suggests that GenAI can effectively meet educational needs when adequately managed. Moreover, AI-TAs experienced a reduction in the cognitive load required for responding to queries, pointing to GenAI's potential to enhance instructional efficiency without compromising the quality of education.
What a major movie studio's AI deal could mean for the future of Hollywood
Technology AI What a major movie studio's AI deal could mean for the future of Hollywood Generative AI might save studios'millions and millions of dollars,' but at what cost? Breakthroughs, discoveries, and DIY tips sent every weekday. When Hollywood's actors took to the streets last year for a 118 day strike, many wielded signs reading "no digital clones," "AI is soulless," and "AI is not art." These ticked-off thespians were expressing a sentiment shared by a growing share of writers, video games voice actors, and many other creatives: generative AI tools, trained off their work, may threaten their jobs and shrink the entertainment industry. When the strike ended, actors were awarded new, hard-won protections against AI-generated clones .
LinkedIn is training AI with your data. Here's how to opt out ASAP
As an objective journalistic observer of the tech and business world, I'd like to state the inarguable, quantifiable, unavoidable fact that LinkedIn sucks. LinkedIn is a terrible fusion of the worst parts of social networks, job boards, and office culture -- and it's about to suck even harder with the help of AI. Like seemingly every large tech company these days, LinkedIn is injecting generative AI into its platform. But the Microsoft-owned website is now scraping its user data to train its artificial intelligence systems. Naturally, you're opted into sharing your data with LinkedIn's AI for free, without any kind of message or alert.
Generation and Editing of Mandrill Faces: Application to Sex Editing and Assessment
Dibot, Nicolas M., Renoult, Julien P., Puech, William
Generative AI has seen major developments in recent years, enhancing the realism of synthetic images, also known as computer-generated images. In addition, generative AI has also made it possible to modify specific image characteristics through image editing. Previous work has developed methods based on generative adversarial networks (GAN) for generating realistic images, in particular faces, but also to modify specific features. However, this work has never been applied to specific animal species. Moreover, the assessment of the results has been generally done subjectively, rather than quantitatively. In this paper, we propose an approach based on methods for generating images of faces of male or female mandrills, a non-human primate. The main novelty of proposed method is the ability to edit their sex by identifying a sex axis in the latent space of a specific GAN. In addition, we have developed an assessment of the sex levels based on statistical features extracted from real image distributions. The experimental results we obtained from a specific database are not only realistic, but also accurate, meeting a need for future work in behavioral experiments with wild mandrills.
Deep generative models as an adversarial attack strategy for tabular machine learning
Dyrmishi, Salijona, Stoian, Mihaela Cฤtฤlina, Giunchiglia, Eleonora, Cordy, Maxime
Deep Generative Models (DGMs) have found application in computer vision for generating adversarial examples to test the robustness of machine learning (ML) systems. Extending these adversarial techniques to tabular ML presents unique challenges due to the distinct nature of tabular data and the necessity to preserve domain constraints in adversarial examples. In this paper, we adapt four popular tabular DGMs into adversarial DGMs (AdvDGMs) and evaluate their effectiveness in generating realistic adversarial examples that conform to domain constraints.
The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives
Arif, Samee, Arif, Taimoor, Haroon, Muhammad Saad, Khan, Aamina Jamal, Raza, Agha Ali, Athar, Awais
This paper introduces the concept of an education tool that utilizes Generative Artificial Intelligence (GenAI) to enhance storytelling for children. The system combines GenAI-driven narrative co-creation, text-to-speech conversion, and text-to-video generation to produce an engaging experience for learners. We describe the co-creation process, the adaptation of narratives into spoken words using text-to-speech models, and the transformation of these narratives into contextually relevant visuals through text-to-video technology. Our evaluation covers the linguistics of the generated stories, the text-to-speech conversion quality, and the accuracy of the generated visuals.