Segment Anything Model (SAM) Meets Glass: Mirror and Transparent Objects Cannot Be Easily Detected

Han, Dongsheng, Zhang, Chaoning, Qiao, Yu, Qamar, Maryam, Jung, Yuna, Lee, SeungKyu, Bae, Sung-Ho, Hong, Choong Seon

Apr-29-2023–arXiv.org Artificial Intelligence

A key factor that drives the development of generative AI is foundation model Bommasani et al. [2021] that at inference can generalize to tasks and data distributions different from training. With the success of ChatGPT Zhang et al. [2023b], GPT-3 [Brown et al., 2020] has been widely recognized as one of the most widely recognized foundation models for NLP. Very recently, Meta AI research team has recent released a segment anything project Kirillov et al. [2023] that introduces a promotable segmentation task for training a vision foundation model. The resulting segment anything model (SAM) has been recognized as the GPT-3 moment for vision. The model was trained on over 1 billion masks on 11 million licensed and privacy-respecting images. It represents a significant step towards achieving cognitive recognition for all objects in the world, aiming to handle interactive segmentation tasks while addressing real-world constraints.

large language model, machine learning, natural language, (14 more...)

arXiv.org Artificial Intelligence

Apr-29-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States > Oklahoma > Beaver County (0.04)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (0.90)
    - Chatbot (0.76)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found