AITopics | sam

Collaborating Authors

sam

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

FundamentalConvergenceAnalysisof Sharpness-AwareMinimization

Neural Information Processing SystemsFeb-8-2026, 14:25:26 GMT

Additionally, it is evident that the results in (ii) do not implytheconvergenceof f(xk) to0.

artificial intelligence, justification, machine learning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Belmont (0.04)
Europe > Switzerland (0.04)
Asia > China (0.04)

Genre: Research Report (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

ExplicitEigenvalueRegularizationImproves Sharpness-AwareMinimization

Neural Information Processing SystemsFeb-7-2026, 13:06:05 GMT

Sharpness-Aware Minimization (SAM) has attracted significant attention for its effectiveness in improving generalization across various tasks. However, its underlying principles remain poorly understood.

artificial intelligence, justification, machine learning, (19 more...)

Neural Information Processing Systems

Country:

Oceania > Australia (0.04)
Europe > Spain > Andalusia > Granada Province > Granada (0.04)
Europe > Austria (0.04)
(2 more...)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Segment Anything in 3D with NeRFs

Neural Information Processing SystemsOct-10-2025, 23:14:04 GMT

We refer to the proposed solution as SA3D, for Segment Anything in 3D. It is only required to provide a manual segmentation prompt ( e.g., rough points) for the target object in a single view, which is used to generate its 2D mask in this view with SAM.

machine learning, natural language, segmentation, (19 more...)

Neural Information Processing Systems

Country:

Asia > China > Shanghai > Shanghai (0.04)
Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)
North America > United States > Oklahoma > Beaver County (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.68)
Information Technology > Artificial Intelligence > Natural Language (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Segment Anything without Supervision

Neural Information Processing SystemsMay-27-2025, 21:47:41 GMT

The Segmentation Anything Model (SAM) requires labor-intensive data labeling. We present Unsupervised SAM (UnSAM) for promptable and automatic whole-image segmentation that does not require human annotations. UnSAM utilizes a divide-and-conquer strategy to "discover" the hierarchical structure of visual scenes. For all pixels within a segment, a bottom-up clustering method is employed to iteratively merge them into larger groups, thereby forming a hierarchical structure. These unsupervised multi-granular masks are then utilized to supervise model training.

hierarchical structure, sam, supervision, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.77)

Add feedback

Segment Any Change

Neural Information Processing SystemsMay-27-2025, 09:17:23 GMT

Visual foundation models have achieved remarkable results in zero-shot image classification and segmentation, but zero-shot change detection remains an open problem. In this paper, we propose the segment any change models (AnyChange), a new type of change detection model that supports zero-shot prediction and generalization on unseen change types and data distributions.AnyChange is built on the segment anything model (SAM) via our training-free adaptation method, bitemporal latent matching.By revealing and exploiting intra-image and inter-image semantic similarities in SAM's latent space, bitemporal latent matching endows SAM with zero-shot change detection capabilities in a training-free way. We also propose a point query mechanism to enable AnyChange's zero-shot object-centric change detection capability.We perform extensive experiments to confirm the effectiveness of AnyChange for zero-shot change detection.AnyChange sets a new record on the SECOND benchmark for unsupervised change detection, exceeding the previous SOTA by up to 4.4\% F _1 score, and achieving comparable accuracy with negligible manual annotations (1 pixel per image) for supervised change detection.

anychange, change detection, detection, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Changing the Training Data Distribution to Reduce Simplicity Bias Improves In-distribution Generalization

Neural Information Processing SystemsMay-27-2025, 06:38:21 GMT

Can we modify the training data distribution to encourage the underlying optimization method toward finding solutions with superior generalization performance on in-distribution data? In this work, we approach this question for the first time by comparing the inductive bias of gradient descent (GD) with that of sharpness-aware minimization (SAM). By studying a two-layer CNN, we rigorously prove that SAM learns different features more uniformly, particularly in early epochs. That is, SAM is less susceptible to simplicity bias compared to GD. We also show that examples constraining features that are learned early are separable from the rest based on the model's output.

generalization performance, in-distribution generalization, training data distribution, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Sam's Club is adding AI to the shopping experience. Why are privacy advocacy groups worried?

Los Angeles TimesMay-2-2025, 17:58:56 GMT

Sam's Club is going register-free and introducing an all-digital, AI-powered shopping experience for its customers, a move that has privacy advocates worried that the new AI tool could be used to unfairly target some customers with higher-priced items based on their shopping habits. The all-digital approach started with the reconstruction of a Sam's Club in Grapevine, a suburb of Dallas, that was severely damaged in 2022 by a tornado. When the retail location opened two years later it was the first of its kind to ditch its registers for a "Scan and Go" program that allowed customers to scan each item placed in their physical cart and pay through a mobile app. This program has since been piloted in nine Dallas metro locations and one store in Missouri, Retail Dive reported. Instead of handing a receipt to a Sam's Club employee to review before leaving the store, customers walk through an arch that's equipped with AI-powered cameras to capture images of the items in the cart and electronically match them with the items paid for through the app. Sam's Club did not disclose when the AI technology would be coming to California stores but Sam's Club has outlets in Torrance, Fountain Valley, El Monte and Riverside.

artificial intelligence, customer, sam, (15 more...)

Los Angeles Times

Country:

North America > United States > California (0.62)
North America > United States > Missouri (0.25)

Industry:

Retail (1.00)
Information Technology > Security & Privacy (0.36)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Attention-Guided Integration of CLIP and SAM for Precise Object Masking in Robotic Manipulation

Muttaqien, Muhammad A., Motoda, Tomohiro, Hanai, Ryo, Yukiyasu, Domae

arXiv.org Artificial IntelligenceFeb-27-2025

Attention-Guided Integration of CLIP and SAM for Precise Object Masking in Robotic Manipulation 1 st Muhammad A. Muttaqien Automation Research T eam National Institute of AIST Tokyo, Japan muha.muttaqien@aist.go.jp 2 nd Tomohiro Motoda Automation Research T eam National Institute of AIST Tokyo, Japan tomohiro.motoda@aist.go.jp 3 rd Ryo Hanai Automation Research T eam National Institute of AIST Tokyo, Japan ryo.hanai@aist.go.jp 4 th Domae Y ukiyasu Automation Research T eam National Institute of AIST Tokyo, Japan domae.yukiyasu@aist.go.jp Abstract --This paper introduces a novel pipeline to enhance the precision of object masking for robotic manipulation within the specific domain of masking products in convenience stores. The approach integrates two advanced AI models, CLIP and SAM, focusing on their synergistic combination and the effective use of multimodal data (image and text). Emphasis is placed on utilizing gradient-based attention mechanisms and customized datasets to fine-tune performance. While CLIP, SAM, and Grad-CAM are established components, their integration within this structured pipeline represents a significant contribution to the field. The resulting segmented masks, generated through this combined approach, can be effectively utilized as inputs for robotic systems, enabling more precise and adaptive object manipulation in the context of convenience store products. I NTRODUCTION In recent years, the ability to recognize and manipulate specific objects within well-defined domains, such as products in convenience stores, has become increasingly important in the field of robotic manipulation [1] [2] [3]. As robots are expected to perform more complex tasks in diverse environments, the need for precise object identification and interaction grows, particularly in domains where a high level of accuracy is crucial. For instance, in convenience stores (Figure 1), robots must reliably identify and handle a wide variety of products, each with unique visual characteristics, to automate tasks such as stocking, sorting, and customer assistance.

attention map, clip, grad-cam, (17 more...)

arXiv.org Artificial Intelligence

2502.18842

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (1.00)
Asia > Japan > Honshū > Kantō > Ibaraki Prefecture > Tsukuba (0.04)
South America > Uruguay > Maldonado > Maldonado (0.04)
North America > Canada (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Monge SAM: Robust Reparameterization-Invariant Sharpness-Aware Minimization Based on Loss Geometry

Jacobsen, Albert Kjøller, Arvanitidis, Georgios

arXiv.org Machine LearningFeb-12-2025

Recent studies on deep neural networks show that flat minima of the loss landscape correlate with improved generalization. Sharpness-aware minimization (SAM) efficiently finds flat regions by updating the parameters according to the gradient at an adversarial perturbation. The perturbation depends on the Euclidean metric, making SAM non-invariant under reparametrizations, which blurs sharpness and generalization. We propose Monge SAM (M-SAM), a reparametrization invariant version of SAM by considering a Riemannian metric in the parameter space induced naturally by the loss surface. Compared to previous approaches, M-SAM works under any modeling choice, relies only on mild assumptions while being as computationally efficient as SAM. We theoretically argue that M-SAM varies between SAM and gradient descent (GD), which increases robustness to hyperparameter selection and reduces attraction to suboptimal equilibria like saddle points. We demonstrate this behavior both theoretically and empirically on a multi-modal representation alignment task.

m-sam, perturbation, sam, (12 more...)

arXiv.org Machine Learning

2502.08448

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Europe > Denmark > Capital Region > Kongens Lyngby (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Scaling Memory-Augmented Neural Networks with Sparse Reads and Writes

Neural Information Processing SystemsMar-12-2024, 10:16:28 GMT

Neural networks augmented with external memory have the ability to learn algorithmic solutions to complex tasks. These models appear promising for applications such as language modeling and machine translation. However, they scale poorly in both space and time as the amount of memory grows -- limiting their applicability to real-world domains. Here, we present an end-to-end differentiable memory access scheme, which we call Sparse Access Memory (SAM), that retains the representational power of the original approaches whilst training efficiently with very large memories. We show that SAM achieves asymptotic lower bounds in space and time complexity, and find that an implementation runs 1,000 faster and with 3,000 less physical memory than non-sparse models. SAM learns with comparable data efficiency to existing models on a range of synthetic tasks and one-shot Omniglot character recognition, and can scale to tasks requiring 100,000s of time steps and memories. As well, we show how our approach can be adapted for models that maintain temporal associations between memories, as with the recently introduced Differentiable Neural Computer.

sam, sequence, time step, (14 more...)

Neural Information Processing Systems

Country: Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)

Genre: Research Report (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.72)

Add feedback