AITopics | Personal

Collaborating Authors

Personal

Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey

Miyai, Atsuyuki, Yang, Jingkang, Zhang, Jingyang, Ming, Yifei, Lin, Yueqian, Yu, Qing, Irie, Go, Joty, Shafiq, Li, Yixuan, Li, Hai, Liu, Ziwei, Yamasaki, Toshihiko, Aizawa, Kiyoharu

arXiv.org Artificial IntelligenceJul-31-2024

Detecting out-of-distribution (OOD) samples is crucial for ensuring the safety of machine learning systems and has shaped the field of OOD detection. Meanwhile, several other problems are closely related to OOD detection, including anomaly detection (AD), novelty detection (ND), open set recognition (OSR), and outlier detection (OD). To unify these problems, a generalized OOD detection framework was proposed, taxonomically categorizing these five problems. However, Vision Language Models (VLMs) such as CLIP have significantly changed the paradigm and blurred the boundaries between these fields, again confusing researchers. In this survey, we first present a generalized OOD detection v2, encapsulating the evolution of AD, ND, OSR, OOD detection, and OD in the VLM era. Our framework reveals that, with some field inactivity and integration, the demanding challenges have become OOD detection and AD. In addition, we also highlight the significant shift in the definition, problem settings, and benchmarks; we thus feature a comprehensive review of the methodology for OOD detection, including the discussion over other related tasks to clarify their relationship to OOD detection. Finally, we explore the advancements in the emerging Large Vision Language Model (LVLM) era, such as GPT-4V. We conclude this survey with open challenges and future directions.

anomaly detection, detection, ood detection, (12 more...)

arXiv.org Artificial Intelligence

2407.21794

Country:

North America > United States > Wisconsin > Dane County > Madison (0.14)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)
Asia > Singapore (0.04)
(8 more...)

Genre:

Overview (1.00)
Personal > Honors (0.46)

Industry: Information Technology (0.93)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Generative Learning for Simulation of Vehicle Faults

Kuiper, Patrick, Lin, Sirui, Blanchet, Jose, Tarokh, Vahid

arXiv.org Machine LearningJul-30-2024

We focus this analysis on the United States' Department of Defense (DoD), where the US Army alone is projected to spend an estimated $5 billion per year (in 2020 dollar terms through 2050), developing and acquiring ground vehicles, where ground vehicles are any vehicles other than aircraft and ships (CBO 2021). Maintaining this enormous investment is critical to ensuring combat readiness across the DoD, where the department spent $90 billion in 2022 on maintaining vehicles across domains: ground, air, and sea (GAO 2022). Predicting requirements is critical to an effective maintenance program. The application of statistics towards vehicle maintenance prediction is often referred to as predictive maintenance. Recognizing the importance of predictive maintenance, in the 2022 National Defense Authorization Act (NDAA) Congress required the DoD Inspector General Office to review predictive maintenance practices, originally established by DoD directives in 2002 and 2007 (DoDIG 2023).

covariate, representation, vehicle, (15 more...)

arXiv.org Machine Learning

2407.17654

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > United States > North Carolina > Durham County > Durham (0.04)
North America > United States > Massachusetts (0.04)
(4 more...)

Genre:

Research Report (0.64)
Personal (0.46)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Government > Military (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)

Add feedback

Engaging with Children's Artwork in Mixed Visual-Ability Families

Chheda-Kothary, Arnavi, Wobbrock, Jacob O., Froehlich, Jon E.

arXiv.org Artificial IntelligenceJul-30-2024

We present two studies exploring how blind or low-vision (BLV) family members engage with their sighted children's artwork, strategies to support understanding and interpretation, and the potential role of technology, such as AI, therein. Our first study involved 14 BLV individuals, and the second included five groups of BLV individuals with their children. Through semi-structured interviews with AI descriptions of children's artwork and multi-sensory design probes, we found that BLV family members value artwork engagement as a bonding opportunity, preferring the child's storytelling and interpretation over other nonvisual representations. Additionally, despite some inaccuracies, BLV family members felt that AI-generated descriptions could facilitate dialogue with their children and aid self-guided art discovery. We close with specific design considerations for supporting artwork engagement in mixed visual-ability families, including enabling artwork access through various methods, supporting children's corrections of AI output, and distinctions in context vs. content and interpretation vs. description of children's artwork.

artwork, computing machinery, family member, (15 more...)

arXiv.org Artificial Intelligence

2407.18874

Country:

North America > United States > Washington > King County > Seattle (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.06)
(29 more...)

Genre:

Research Report (1.00)
Questionnaire & Opinion Survey (0.87)
Personal > Interview (0.34)

Industry:

Education (1.00)
Health & Medicine > Therapeutic Area > Ophthalmology/Optometry (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Human Computer Interaction > Interfaces (0.93)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)

Add feedback

Canadian Olympic Committee says spying scandal 'could tarnish' women's Tokyo gold medal

FOX NewsJul-26-2024, 15:23:09 GMT

The drone scandal surrounding the Canadian women's soccer team could have bigger implications than just this year's Games in Paris. Head coach Bev Priestman was removed from her position on Thursday night after two staff members were sent home from Paris after an investigation found that analyst Joseph Lombardi had used a drone to spy on New Zealand's practice sessions. Head coach Beverly Priestman reacts during the Women's Gold Medal match between Canada and Sweden on day 14 of the Tokyo 2020 Olympic Games at International Stadium Yokohama on Aug. 6, 2021 in Yokohama, Kanagawa, Japan. "Over the past 24 hours, additional information has come to our attention regarding previous drone use against opponents, predating the Paris 2024 Olympic Games," Canada Soccer CEO Kevin Blue said in a statement. "In light of these new revelations, Canada Soccer has made the decision to suspend Women's National Soccer Team Head Coach, Bev Priestman for the remainder of the Paris 2024 Olympic Games, and until the completion of our recently announced independent external review."

canadian olympic committee, olympic game, woman, (13 more...)

FOX News

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.69)
Asia > Japan > Honshū > Kantō > Kanagawa Prefecture > Yokohama (0.51)
Oceania > New Zealand (0.27)
(2 more...)

Genre:

Personal > Honors (0.42)
Research Report (0.40)
Press Release (0.39)

Industry:

Leisure & Entertainment > Sports > Soccer (1.00)
Leisure & Entertainment > Sports > Olympic Games (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.37)

Add feedback

Congratulations to the #ICML2024 award winners

AIHubJul-25-2024, 09:25:58 GMT

VideoPoet employs a decoder-only transformer architecture that processes multimodal inputs – including images, videos, text, and audio. The training protocol follows that of Large Language Models (LLMs), consisting of two stages: pretraining and task-specific adaptation. During pretraining, VideoPoet incorporates a mixture of multimodal generative objectives within an autoregressive Transformer framework. The pretrained LLM serves as a foundation that can be adapted for a range of video generation tasks. We present empirical results demonstrating the model's state-of-the-art capabilities in zero-shot video generation, specifically highlighting the ability to generate high-fidelity motions.

dataset, information, language model, (14 more...)

AIHub

Country: Europe > Austria > Vienna (0.14)

Genre:

Research Report > New Finding (0.49)
Personal > Honors > Award (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Self-Directed Synthetic Dialogues and Revisions Technical Report

Lambert, Nathan, Schoelkopf, Hailey, Gokaslan, Aaron, Soldaini, Luca, Pyatkin, Valentina, Castricato, Louis

arXiv.org Artificial IntelligenceJul-25-2024

Synthetic data has become an important tool in the fine-tuning of language models to follow instructions and solve complex problems. Nevertheless, the majority of open data to date is often lacking multi-turn data and collected on closed models, limiting progress on advancing open fine-tuning methods. We introduce Self Directed Synthetic Dialogues (SDSD), an experimental dataset consisting of guided conversations of language models talking to themselves. The dataset consists of multi-turn conversations generated with DBRX, Llama 2 70B, and Mistral Large, all instructed to follow a conversation plan generated prior to the conversation. We also explore including principles from Constitutional AI and other related works to create synthetic preference data via revisions to the final conversation turn. We hope this work encourages further exploration in multi-turn data and the use of open models for expanding the impact of synthetic data.

arxiv preprint arxiv, assistant response, self-directed synthetic dialogue, (12 more...)

arXiv.org Artificial Intelligence

2407.18421

Country: Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Genre:

Research Report (0.64)
Personal (0.46)
Workflow (0.46)

Industry:

Health & Medicine (1.00)
Law > Civil Rights & Constitutional Law (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Why Colin Kaepernick Is Starting an AI Company

TIME - TechJul-24-2024, 16:00:00 GMT

When NFL quarterback Colin Kaepernick began kneeling during the national anthem to protest police brutality and racial injustice in 2016, he soon found himself out of a job, eventually moving onto other ventures in media and entertainment. Today, he's entering the AI industry by launching a project he says he hopes will allow others to bypass "gatekeeping:" an artificial intelligence platform called Lumi. The new subscription-based platform aims to provide tools for storytellers to create, illustrate, publish and monetize their ideas. The company has raised 4 million in funding led by Alexis Ohanian's Seven Seven Six, and its product went live today, July 24. In an interview with TIME, Kaepernick says this project can be viewed as an extension of his activism.

colin kaepernick, kaepernick, platform, (9 more...)

TIME - Tech

Genre: Personal > Interview (0.36)

Industry:

Media (0.52)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.31)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)

Add feedback

AMONGAGENTS: Evaluating Large Language Models in the Interactive Text-Based Social Deduction Game

Chi, Yizhou, Mao, Lingjun, Tang, Zineng

arXiv.org Artificial IntelligenceJul-24-2024

Strategic social deduction games serve as valuable testbeds for evaluating the understanding and inference skills of language models, offering crucial insights into social science, artificial intelligence, and strategic gaming. This paper focuses on creating proxies of human behavior in simulated environments, with Among Us utilized as a tool for studying simulated human behavior. The study introduces a text-based game environment, named AmongAgents, that mirrors the dynamics of Among Us. Players act as crew members aboard a spaceship, tasked with identifying impostors who are sabotaging the ship and eliminating the crew. Within this environment, the behavior of simulated language agents is analyzed. The experiments involve diverse game sequences featuring different configurations of Crewmates and Impostor personality archetypes. Our work demonstrates that state-of-the-art large language models (LLMs) can effectively grasp the game rules and make decisions based on the current context. This work aims to promote further exploration of LLMs in goal-oriented games with incomplete information and complex action spaces, as these settings offer valuable opportunities to assess language model performance in socially driven scenarios.

agent, crewmate, impostor, (17 more...)

arXiv.org Artificial Intelligence

2407.16521

Genre:

Personal > Interview (0.93)
Research Report (0.82)

Industry: Leisure & Entertainment > Games (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Carvalho, who unplugged school AI chatbot, wants task force to tell him what went wrong

Los Angeles TimesJul-23-2024, 10:00:10 GMT

Alberto Carvalho, who remains determined to bring artificial intelligence into district classrooms despite the collapse of the technology company leading the effort, will appoint a task force to examine what went wrong and how to move forward. The schools chief announced the task force in an interview with The Times in advance of Tuesday's annual address to administrators, which is akin to a state-of-the-schools speech. In his public address, Carvalho is expected to highlight academic progress and L.A. Unified School District initiatives. In a recent appearance, he said he was hopeful that standardized test scores would rise at all grade levels in math and English. Although school districts throughout the state have received results -- and can make them public if they wish -- the state has not yet released local or statewide scores.

carvalho, chatbot, task force, (15 more...)

Los Angeles Times

Country: North America > United States > California > Los Angeles County > Los Angeles (0.05)

Genre: Personal > Interview (0.35)

Industry:

Information Technology > Security & Privacy (0.73)
Education > Assessment & Standards > Student Performance (0.55)

Technology: Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.93)

Add feedback

The Contribution of XAI for the Safe Development and Certification of AI: An Expert-Based Analysis

Fresz, Benjamin, Göbels, Vincent Philipp, Omri, Safa, Brajovic, Danilo, Aichele, Andreas, Kutz, Janika, Neuhüttler, Jens, Huber, Marco F.

arXiv.org Artificial IntelligenceJul-22-2024

Developing and certifying safe - or so-called trustworthy - AI has become an increasingly salient issue, especially in light of upcoming regulation such as the EU AI Act. In this context, the black-box nature of machine learning models limits the use of conventional avenues of approach towards certifying complex technical systems. As a potential solution, methods to give insights into this black-box - devised in the field of eXplainable AI (XAI) - could be used. In this study, the potential and shortcomings of such methods for the purpose of safe AI development and certification are discussed in 15 qualitative interviews with experts out of the areas of (X)AI and certification. We find that XAI methods can be a helpful asset for safe AI development, as they can show biases and failures of ML-models, but since certification relies on comprehensive and correct information about technical systems, their impact is expected to be limited.

certification, explanation, xai, (16 more...)

arXiv.org Artificial Intelligence

2408.02379

Country:

North America > United States > New York > New York County > New York City (0.05)
Europe > Germany > Baden-Württemberg > Stuttgart Region > Stuttgart (0.04)
North America > United States > Virginia > Fairfax County > Reston (0.04)
(5 more...)

Genre:

Research Report (1.00)
Questionnaire & Opinion Survey (0.93)
Personal > Interview (0.46)

Industry: Transportation > Air (0.55)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)

Add feedback