Goto

Collaborating Authors

 Generative AI


Using generative AI to diversify virtual training grounds for robots

Robohub

Chatbots like ChatGPT and Claude have experienced a meteoric rise in usage over the past three years because they can help you with a wide range of tasks. Whether you're writing Shakespearean sonnets, debugging code, or need an answer to an obscure trivia question, artificial intelligence systems seem to have you covered. Those data aren't enough to teach a robot to be a helpful household or factory assistant, though. To understand how to handle, stack, and place various arrangements of objects across diverse environments, robots need demonstrations. You can think of robot training data as a collection of how-to videos that walk the systems through each motion of a task.


Sora 2 and the Limits of Digital Narcissism

The New Yorker

What we enjoy about generative A.I. may also be its ultimate limitation: we want to see ourselves. During the past few weeks, I've seen a proliferation of A.I.-generated video in my social-media feeds and group texts. The more impressive--or, at least, more personalized--of these have been the work of Sora 2, the updated version of OpenAI's video-generation platform, which the company released on an invitation-only basis at the end of September. This iteration of Sora comes with a socially networked app, and it appears to be much better at integrating you and your friends, say, into a stock scene. What this means is that, when you open up Sora 2, you'll likely see a video of someone you know winning a Nobel Prize, getting drafted into the N.B.A., or flying a bomber plane in the Second World War.


Breakdance Video classification in the age of Generative AI

arXiv.org Artificial Intelligence

Large Vision Language models have seen huge application in several sports use-cases recently. Most of these works have been targeted towards a limited subset of popular sports like soccer, cricket, basketball etc; focusing on generative tasks like visual question answering, highlight generation. This work analyzes the applicability of the modern video foundation models (both encoder and decoder) for a very niche but hugely popular dance sports - breakdance. Our results show that Video Encoder models continue to outperform state-of-the-art Video Language Models for prediction tasks. We provide insights on how to choose the encoder model and provide a thorough analysis into the workings of a finetuned decoder model for breakdance video classification.


The Verification-Value Paradox: A Normative Critique of Gen AI in Legal Practice

arXiv.org Artificial Intelligence

It is often claimed that machine learning-based generative AI products will drastically streamline and reduce the cost of legal practice. This enthusiasm assumes lawyers can effectively manage AI's risks. Cases in Australia and elsewhere in which lawyers have been reprimanded for submitting inaccurate AI-generated content to courts suggest this paradigm must be revisited. This paper argues that a new paradigm is needed to evaluate AI use in practice, given (a) AI's disconnection from reality and its lack of transparency, and (b) lawyers' paramount duties like honesty, integrity, and not to mislead the court. It presents an alternative model of AI use in practice that more holistically reflects these features (the verification-value paradox). That paradox suggests increases in efficiency from AI use in legal practice will be met by a correspondingly greater imperative to manually verify any outputs of that use, rendering the net value of AI use often negligible to lawyers. The paper then sets out the paradox's implications for legal practice and legal education, including for AI use but also the values that the paradox suggests should undergird legal practice: fidelity to the truth and civic responsibility.


A Framework for the Adoption and Integration of Generative AI in Midsize Organizations and Enterprises (FAIGMOE)

arXiv.org Artificial Intelligence

Generative Artificial Intelligence (GenAI) presents transformative opportunities for organizations, yet both midsize organizations and larger enterprises face distinctive adoption challenges. Midsize organizations encounter resource constraints and limited AI expertise, while enterprises struggle with organizational complexity and coordination challenges. Existing technology adoption frameworks, including TAM (Technology Acceptance Model), TOE (Technology Organization Environment), and DOI (Diffusion of Innovations) theory, lack the specificity required for GenAI implementation across these diverse contexts, creating a critical gap in adoption literature. This paper introduces FAIGMOE (Framework for the Adoption and Integration of Generative AI in Midsize Organizations and Enterprises), a conceptual framework addressing the unique needs of both organizational types. FAIGMOE synthesizes technology adoption theory, organizational change management, and innovation diffusion perspectives into four interconnected phases: Strategic Assessment, Planning and Use Case Development, Implementation and Integration, and Operationalization and Optimization. Each phase provides scalable guidance on readiness assessment, strategic alignment, risk governance, technical architecture, and change management adaptable to organizational scale and complexity. The framework incorporates GenAI specific considerations including prompt engineering, model orchestration, and hallucination management that distinguish it from generic technology adoption frameworks. As a perspective contribution, FAIGMOE provides the first comprehensive conceptual framework explicitly addressing GenAI adoption across midsize and enterprise organizations, offering actionable implementation protocols, assessment instruments, and governance templates requiring empirical validation through future research.


A new wave of vehicle insurance fraud fueled by generative AI

arXiv.org Artificial Intelligence

Generative AI is supercharging insurance fraud by making it easier to falsify accident evidence at scale and in rapid time. Insurance fraud is a pervasive and costly problem, amounting to tens of billions of dollars in losses each year. In the vehicle insurance sector, fraud schemes have traditionally involved staged accidents, exaggerated damage, or forged documents. The rise of generative AI, including deepfake image and video generation, has introduced new methods for committing fraud at scale. Fraudsters can now fabricate highly realistic crash photos, damage evidence, and even fake identities or documents with minimal effort, exploiting AI tools to bolster false insurance claims. Insurers have begun deploying countermeasures such as AI-based deepfake detection software and enhanced verification processes to detect and mitigate these AI-driven scams. However, current mitigation strategies face significant limitations. Detection tools can suffer from false positives and negatives, and sophisticated fraudsters continuously adapt their tactics to evade automated checks. This cat-and-mouse arms race between generative AI and detection technology, combined with resource and cost barriers for insurers, means that combating AI-enabled insurance fraud remains an ongoing challenge. In this white paper, we present UVeye layered solution for vehicle fraud, representing a major leap forward in the ability to detect, mitigate and deter this new wave of fraud.


OpenAI Removed Safeguards Before Teen's Suicide, Amended Lawsuit Claims

TIME - Tech

OpenAI Removed Safeguards Before Teen's Suicide, Amended Lawsuit Claims OpenAI relaxed safeguards that would have prevented ChatGPT from engaging in conversations about self-harm in the months leading up to the suicide of Adam Raine, an amended complaint filed by the family in the San Francisco County Superior Court on Wednesday alleges. The amendment changes the theory of the case from reckless indifference to intentional misconduct, according to the family's lawyers, which could raise the damages awarded to the family. The Raine family's lawyers will have to prove that OpenAI was aware of the risks posed by ChatGPT and disregarded them. The family has asked for a jury trial. In an interview with TIME, Jay Edelson, one of the Raine family's lawyers, says OpenAI relaxed safeguards in an "intentional decision" to "prioritize engagement."


OpenAI launches its own free 'Atlas' browser with ChatGPT built-in

PCWorld

When you purchase through links in our articles, we may earn a small commission. OpenAI launches its own free'Atlas' browser with ChatGPT built-in It's yet another Chromium fork, except this one comes integrated with ChatGPT. It even has agentic features for paid users. OpenAI recently launched ChatGPT Atlas, which is "a new web browser built with ChatGPT at its core." It's based on Chromium--which is true of pretty much all browsers these days except Firefox and Safari--and its unique selling point is that it integrates ChatGPT right into the browser, allowing users to chat with their search results and use a side panel that automatically provides ChatGPT with on-screen context. ChatGPT Atlas also has access to your browsing history, allowing the AI assistant to customize its responses based on your activity.


ChatGPT's Horny Era Could Be Its Stickiest Yet

WIRED

ChatGPT's Horny Era Could Be Its Stickiest Yet OpenAI will soon let adults create erotic content in ChatGPT. Experts say that could lead to "emotional commodification," or horniness as a revenue stream. In May of 2024, while I was combing through OpenAI's "Model Spec" laying out how ChatGPT should act, one comment buried in the document struck me as peculiar. It said OpenAI was "exploring" how to let adult ChatGPT users generate content with mature themes such as "erotica, extreme gore, slurs, and unsolicited profanity." Seems like the exploration phase is over.


To Use or to Refuse? Re-Centering Student Agency with Generative AI in Engineering Design Education

arXiv.org Artificial Intelligence

This pilot study traces students' reflections on the use of AI in a 13-week foundational design course enrolling over 500 first-year engineering and architecture students at the Singapore University of Technology and Design. The course was an AI-enhanced design course, with several interventions to equip students with AI based design skills. Students were required to reflect on whether the technology was used as a tool (instrumental assistant), a teammate (collaborative partner), or neither (deliberate non-use). By foregrounding this three-way lens, students learned to use AI for innovation rather than just automation and to reflect on agency, ethics, and context rather than on prompt crafting alone. Evidence stems from coursework artefacts: thirteen structured reflection spreadsheets and eight illustrated briefs submitted, combined with notes of teachers and researchers. Qualitative coding of these materials reveals shared practices brought about through the inclusion of Gen-AI, including accelerated prototyping, rapid skill acquisition, iterative prompt refinement, purposeful "switch-offs" during user research, and emergent routines for recognizing hallucinations. Unexpectedly, students not only harnessed Gen-AI for speed but (enabled by the tool-teammate-neither triage) also learned to reject its outputs, invent their own hallucination fire-drills, and divert the reclaimed hours into deeper user research, thereby transforming efficiency into innovation. The implications of the approach we explore shows that: we can transform AI uptake into an assessable design habit; that rewarding selective non-use cultivates hallucination-aware workflows; and, practically, that a coordinated bundle of tool access, reflection, role tagging, and public recognition through competition awards allows AI based innovation in education to scale without compromising accountability.