Generative AI
Formula 1, AWS team up for AI-inspired trophy ahead of Canadian Grand Prix
Fox News Flash top sports headlines are here. Check out what's clicking on Foxnews.com. Formula 1 and Amazon Web Services (AWS) have been partners for more than six years. But, that longstanding partnership is now set to reach new heights as the popular sports league and the leading tech company will leverage AWS tools to develop a generative artificial intelligence-designed trophy for the upcoming Canadian Grand Prix. The first-of-its-kind approach to the trophy for the highly-anticipated event is expected to help increase creativity.
Inspired by AI? A Novel Generative AI System To Assist Conceptual Automotive Design
Wang, Ye, Damen, Nicole B., Gale, Thomas, Seo, Voho, Shayani, Hooman
Design inspiration is crucial for establishing the direction of a design as well as evoking feelings and conveying meanings during the conceptual design process. Many practice designers use text-based searches on platforms like Pinterest to gather image ideas, followed by sketching on paper or using digital tools to develop concepts. Emerging generative AI techniques, such as diffusion models, offer a promising avenue to streamline these processes by swiftly generating design concepts based on text and image inspiration inputs, subsequently using the AI generated design concepts as fresh sources of inspiration for further concept development. However, applying these generative AI techniques directly within a design context has challenges. Firstly, generative AI tools may exhibit a bias towards particular styles, resulting in a lack of diversity of design outputs. Secondly, these tools may struggle to grasp the nuanced meanings of texts or images in a design context. Lastly, the lack of integration with established design processes within design teams can result in fragmented use scenarios. Focusing on these challenges, we conducted workshops, surveys, and data augmentation involving teams of experienced automotive designers to investigate their current practices in generating concepts inspired by texts and images, as well as their preferred interaction modes for generative AI systems to support the concept generation workflow. Finally, we developed a novel generative AI system based on diffusion models to assist conceptual automotive design.
Benchmark Data Contamination of Large Language Models: A Survey
Xu, Cheng, Guan, Shuhao, Greene, Derek, Kechadi, M-Tahar
The rapid development of Large Language Models (LLMs) like GPT-4, Claude-3, and Gemini has transformed the field of natural language processing. However, it has also resulted in a significant issue known as Benchmark Data Contamination (BDC). This occurs when language models inadvertently incorporate evaluation benchmark information from their training data, leading to inaccurate or unreliable performance during the evaluation phase of the process. This paper reviews the complex challenge of BDC in LLM evaluation and explores alternative assessment methods to mitigate the risks associated with traditional benchmarks. The paper also examines challenges and future directions in mitigating BDC risks, highlighting the complexity of the issue and the need for innovative solutions to ensure the reliability of LLM evaluation in real-world applications.
Intersectional Unfairness Discovery
Xu, Gezheng, Chen, Qi, Ling, Charles, Wang, Boyu, Shui, Changjian
AI systems have been shown to produce unfair results for certain subgroups of population, highlighting the need to understand bias on certain sensitive attributes. Current research often falls short, primarily focusing on the subgroups characterized by a single sensitive attribute, while neglecting the nature of intersectional fairness of multiple sensitive attributes. This paper focuses on its one fundamental aspect by discovering diverse high-bias subgroups under intersectional sensitive attributes. Specifically, we propose a Bias-Guided Generative Network (BGGN). By treating each bias value as a reward, BGGN efficiently generates high-bias intersectional sensitive attributes. Experiments on real-world text and image datasets demonstrate a diverse and efficient discovery of BGGN. To further evaluate the generated unseen but possible unfair intersectional sensitive attributes, we formulate them as prompts and use modern generative AI to produce new texts and images. The results of frequently generating biased data provides new insights of discovering potential unfairness in popular modern generative AI systems. Warning: This paper contains generative examples that are offensive in nature.
GenAI Arena: An Open Evaluation Platform for Generative Models
Jiang, Dongfu, Ku, Max, Li, Tianle, Ni, Yuansheng, Sun, Shizhuo, Fan, Rongqi, Chen, Wenhu
Generative AI has made remarkable strides to revolutionize fields such as image and video generation. These advancements are driven by innovative algorithms, architecture, and data. However, the rapid proliferation of generative models has highlighted a critical gap: the absence of trustworthy evaluation metrics. Current automatic assessments such as FID, CLIP, FVD, etc often fail to capture the nuanced quality and user satisfaction associated with generative outputs. This paper proposes an open platform GenAI-Arena to evaluate different image and video generative models, where users can actively participate in evaluating these models. By leveraging collective user feedback and votes, GenAI-Arena aims to provide a more democratic and accurate measure of model performance. It covers three arenas for text-to-image generation, text-to-video generation, and image editing respectively. Currently, we cover a total of 27 open-source generative models. GenAI-Arena has been operating for four months, amassing over 6000 votes from the community. We describe our platform, analyze the data, and explain the statistical methods for ranking the models. To further promote the research in building model-based evaluation metrics, we release a cleaned version of our preference data for the three tasks, namely GenAI-Bench. We prompt the existing multi-modal models like Gemini, GPT-4o to mimic human voting. We compute the correlation between model voting with human voting to understand their judging abilities. Our results show existing multimodal models are still lagging in assessing the generated visual content, even the best model GPT-4o only achieves a Pearson correlation of 0.22 in the quality subscore, and behaves like random guessing in others.
Generative AI-in-the-loop: Integrating LLMs and GPTs into the Next Generation Networks
Zhang, Han, Sediq, Akram Bin, Afana, Ali, Erol-Kantarci, Melike
In recent years, machine learning (ML) techniques have created numerous opportunities for intelligent mobile networks and have accelerated the automation of network operations. However, complex network tasks may involve variables and considerations even beyond the capacity of traditional ML algorithms. On the other hand, large language models (LLMs) have recently emerged, demonstrating near-human-level performance in cognitive tasks across various fields. However, they remain prone to hallucinations and often lack common sense in basic tasks. Therefore, they are regarded as assistive tools for humans. In this work, we propose the concept of "generative AI-in-the-loop" and utilize the semantic understanding, context awareness, and reasoning abilities of LLMs to assist humans in handling complex or unforeseen situations in mobile communication networks. We believe that combining LLMs and ML models allows both to leverage their respective capabilities and achieve better results than either model alone. To support this idea, we begin by analyzing the capabilities of LLMs and compare them with traditional ML algorithms. We then explore potential LLM-based applications in line with the requirements of next-generation networks. We further examine the integration of ML and LLMs, discussing how they can be used together in mobile networks. Unlike existing studies, our research emphasizes the fusion of LLMs with traditional ML-driven next-generation networks and serves as a comprehensive refinement of existing surveys. Finally, we provide a case study to enhance ML-based network intrusion detection with synthesized data generated by LLMs. Our case study further demonstrates the advantages of our proposed idea.
The Researcher Trying to Glimpse the Future of AI
Imagine if the world's response to climate change relied solely on speculative predictions from pundits and CEOs, rather than the rigorous--though still imperfect--models of climate science. "Two degrees of warming will arrive soon-ish but will change the world less than we all think," one might say. "Two degrees of warming is not just around the corner. This is going to take a long time," another could counter. This is more or less the world we're in with artificial intelligence, with OpenAI CEO Sam Altman saying that AI systems that can do any task a human can will be developed in the "reasonably close-ish future," while Yann LeCun, Chief AI Scientist at Facebook, argues that human-level AI systems are "going to take a long time."
OpenAI Is Just Facebook Now
Investors led by Microsoft pressured OpenAI to reinstate Altman, which it did within days, alongside vague promises to be more responsible. Then, last month, the company disbanded the internal group tasked with safety research, known as the "superalignment team." Some of the team's most prominent members publicly resigned, including its head, Jan Leike, who posted on X that "over the past years, safety culture and processes have taken a backseat to shiny products." Fortune reported that OpenAI did not provide anywhere near the resources it had initially, publicly promised for safety research. Saunders, who also worked on superalignment, said he resigned when he "lost hope a few months before Jan did."
'The Stakes Are Incredibly High.' Two Former OpenAI Employees On the Need for Whistleblower Protections
This could be a costly interview for William Saunders. The former safety researcher resigned from OpenAI in February, and--like many other departing employees--signed a non-disparagement agreement in order to keep the right to sell his equity in the company. Although he says OpenAI has since told him that it does not intend to enforce the agreement, and has made similar public commitments, he is still taking a risk by speaking out. "By speaking to you I might never be able to access vested equity worth millions of dollars," he tells TIME. "But I think it's more important to have a public dialogue about what is happening at these AGI companies."
The Near Future of Deepfakes Just Got Way Clearer
Before the start of India's general election in April, a top candidate looking to unseat Prime Minister Narendra Modi was not out wooing voters on the campaign trail. Arvind Kejriwal, the chief minister of Delhi and the head of a political party known for its anti-corruption platform, was arrested in late March for, yes, alleged corruption. His supporters hit the streets in protest, decrying the arrest as a politically motivated move by Modi aimed at weakening a rival. Soon after the arrest, Kejriwal implored his supporters to stay strong. "There are some forces who are trying to weaken our country and its democracy," he said in a 34-second audio clip posted to social media by a fellow party member.