AITopics | Vision

Collaborating Authors

Vision

"What exactly is computer vision then? Computer vision is a research field working to equip computers with the ability to process and understand visual data, as sighted humans can. Human brains process the gigabytes of data passing through our eyes every second and translate that data into sight - that is, into discrete objects and entities we can recognise or understand. Similarly, computer vision aims to give computers the ability to understand what they are seeing, and act intelligently on that knowledge."
– Computer vision: Cheat Sheet. ZDNet.com (December 6, 2011), by Natasha Lomas.

News Overviews Instructional Materials AI-Alerts Classics

Peek-a-boo, Big Tech sees you: Expert warns just 20 cloud images can make an AI deepfake video of your child

FOX NewsMay-20-2025, 21:56:50 GMT

Texas high school student Elliston Berry joins'Fox & Friends' to discuss the House's passage of a new bill that criminalizes the sharing of non-consensual intimate images, including content created with artificial intelligence. Parents love capturing their kids' big moments, from first steps to birthday candles. But a new study out of the U.K. shows many of those treasured images may be scanned, analyzed and turned into data by cloud storage services, and nearly half of parents don't even realize it. A survey of 2,019 U.K. parents, conducted by Perspectus Global and commissioned by Swiss privacy tech company Proton, found that 48% of parents were unaware providers like Google Photos, Apple iCloud, Amazon Photos and Dropbox can access and analyze the photos they upload. First lady Melania Trump, joined by President Donald Trump, delivers remarks before President Trump signed the Take it Down Act into law in the Rose Garden of the White House May 19, 2025, in Washington, D.C. (Chip Somodevilla/Getty Images) These companies use artificial intelligence to sort images into albums, recognize faces and locations and suggest memories.

artificial intelligence, deepfake video, machine learning, (11 more...)

FOX News

Country:

North America > United States > Washington, D.C. > District of Columbia > Washington (0.37)
North America > United States > District of Columbia > Washington (0.37)
North America > United States > Texas (0.26)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)

Add feedback

Sharing Key Semantics in Transformer Makes Efficient Image Restoration

Neural Information Processing SystemsMay-20-2025, 21:38:04 GMT

Image Restoration (IR), a classic low-level vision task, has witnessed significant advancements through deep models that effectively model global information. Notably, the emergence of Vision Transformers (ViTs) has further propelled these advancements. When computing, the self-attention mechanism, a cornerstone of ViTs, tends to encompass all global cues, even those from semantically unrelated objects or regions. This inclusivity introduces computational inefficiencies, particularly noticeable with high input resolution, as it requires processing irrelevant information, thereby impeding efficiency. Additionally, for IR, it is commonly noted that small segments of a degraded image, particularly those closely aligned semantically, provide particularly relevant information to aid in the restoration process, as they contribute essential contextual cues crucial for accurate reconstruction. To address these challenges, we propose boosting IR's performance by sharing the key semantics via Transformer for IR (i.e., SemanIR) in this paper.

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country:

Europe (0.14)
Asia > China (0.14)
North America > United States > California (0.14)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.67)

Industry: Education (0.45)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Add feedback

Learning to Orient Surfaces by Self-supervised Spherical CNNs, Federico Stella 1, Luciano Silva

Neural Information Processing SystemsMay-20-2025, 21:26:43 GMT

Defining and reliably finding a canonical orientation for 3D surfaces is key to many Computer Vision and Robotics applications. This task is commonly addressed by handcrafted algorithms exploiting geometric cues deemed as distinctive and robust by the designer. Yet, one might conjecture that humans learn the notion of the inherent orientation of 3D objects from experience and that machines may do so alike. In this work, we show the feasibility of learning a robust canonical orientation for surfaces represented as point clouds. Based on the observation that the quintessential property of a canonical orientation is equivariance to 3D rotations, we propose to employ Spherical CNNs, a recently introduced machinery that can learn equivariant representations defined on the Special Orthogonal group SO(3). Specifically, spherical correlations compute feature maps whose elements define 3D rotations. Our method learns such feature maps from raw data by a self-supervised training procedure and robustly selects a rotation to transform the input point cloud into a learned canonical orientation. Thereby, we realize the first end-to-end learning approach to define and extract the canonical orientation of 3D shapes, which we aptly dub Compass. Experiments on several public datasets prove its effectiveness at orienting local surface patches as well as whole objects.

artificial intelligence, image understanding, machine learning, (15 more...)

Neural Information Processing Systems

Country:

South America > Brazil (0.14)
North America > Canada (0.14)
Europe > Italy (0.14)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.46)

Add feedback

Jan P. Bauer

Neural Information Processing SystemsMay-20-2025, 20:27:13 GMT

Exp. Psychology, Oxford ELSC, HebrewU Department of Computing Brain Mind Institute, EPFL Gatsby Unit, UCL Imperial College London Andrew M. Saxe Christopher Summerfield Ali Hummos

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country: North America > United States (0.14)

Genre:

Research Report > Experimental Study (0.93)
Research Report > New Finding (0.67)

Industry:

Health & Medicine > Therapeutic Area > Neurology (0.67)
Education > Educational Setting (0.66)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Vision (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
(3 more...)

Add feedback

2616697705f72f16a8eac9c295d37d94-Supplemental-Conference.pdf

Neural Information Processing SystemsMay-20-2025, 19:50:50 GMT

artificial intelligence, feature map, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States (0.28)
North America > Canada > Ontario > Toronto (0.14)
Asia > China > Guangdong Province (0.14)

Genre: Research Report (0.67)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Communications (0.93)
(2 more...)

Add feedback

Urgent warning to Americans over 'dangerous' technology quietly rolled out in 80 airports

Daily Mail - Science & techMay-20-2025, 17:25:27 GMT

Within seconds, you've been scanned, stored, and tracked--before even reaching airport security. Without ever handing over your ID, the Transportation Security Administration (TSA) already knows exactly who you are. This is happening at 84 airports across the US. And chances are, you didn't even notice. Marketed as a tool to enhance security, TSA's facial recognition system is drawing criticism for its potential to track Americans from the terminal entrance to their final destination.

airport, artificial intelligence, passenger, (13 more...)

Daily Mail - Science & tech

Country: North America > United States (1.00)

Industry:

Transportation (1.00)
Law (1.00)
Government > Regional Government > North America Government > United States Government (0.95)

Technology:

Information Technology > Security & Privacy (0.52)
Information Technology > Artificial Intelligence > Vision > Face Recognition (0.35)

Add feedback

Its now a federal crime to publish AI deepfake porn

MashableMay-20-2025, 15:45:09 GMT

The Take It Down Act, a controversial bipartisan bill recently hailed by First Lady Melania Trump as a tool to build a safer internet, is officially law, as President Donald Trump took to the White House Rose Garden today to put ink to legislative paper. It's the first high-profile tech legislation to pass under the new administration. "With the rise of AI image generation, countless women have been harassed with deepfakes and other explicit images distributed against their will. This is wrong, so horribly wrong, and it's a very abusive situation," said Trump at the time of signing. "This will be the first ever federal law to combat the distribution of explicit, imaginary, posted without subject's consent... We've all heard about deepfakes. I have them all the time, but nobody does anything. I ask Pam [Bondi], 'Can you help me Pam?' She says, 'No I'm too busy doing other things. But a lot of people don't survive, that's true and so horrible... Today, we're making it totally illegal."

artificial intelligence, machine learning, publish ai deepfake porn, (5 more...)

Mashable

Country: North America > United States (1.00)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Information Technology > Security & Privacy (0.81)

Technology:

Information Technology > Artificial Intelligence > Vision (0.84)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.84)

Add feedback

Deep love or deepfake: Dating in the time of AI

The Japan TimesMay-19-2025, 04:35:00 GMT

Beth Hyland thought she had met the love of her life on Tinder. In reality, the Michigan-based administrative assistant had been manipulated by an online scam artist who posed as a French man named "Richard," used deepfake video on Skype calls and posted photos of another man to pull off his con. Deepfakes -- manipulated video or audio made using artificial intelligence to look and sound real -- are often difficult to detect without specialized tools.

deepfake, machine learning, social media, (2 more...)

The Japan Times

Country: North America > United States > Michigan (0.37)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Communications > Social Media (0.80)

Add feedback

A Appendix

Neural Information Processing SystemsMay-16-2025, 03:22:26 GMT

A.1 Conventional Test-Time Augmentation Center-Crop is the standard test-time augmentation for most of computer vision tasks [56, 29, 5, 7, 18, 26, 52]. The Center-Crop first resizes an image to a fixed size and then crops the central area to make a predefined input size. We resize an image to 256 pixels and crop the central 224 pixels for ResNet-50 in ImageNet experiment, as the same way as [18, 26, 52]. In the case of CIFAR, all images in the dataset are 32 by 32 pixels; we use the original images without any modification at the test time. Horizontal-Flip is an ensemble method using the original image and the horizontally inverted image.

artificial intelligence, machine learning, test-time augmentation, (18 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.89)

Add feedback

Learning Loss for Test-Time Augmentation

Neural Information Processing SystemsMay-16-2025, 03:22:17 GMT

Data augmentation has been actively studied for robust neural networks. Most of the recent data augmentation methods focus on augmenting datasets during the training phase. At the testing phase, simple transformations are still widely used for test-time augmentation. This paper proposes a novel instance-level testtime augmentation that efficiently selects suitable transformations for a test input. Our proposed method involves an auxiliary module to predict the loss of each possible transformation given the input. Then, the transformations having lower predicted losses are applied to the input.

artificial intelligence, deep learning, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > Canada (0.14)

Genre: Research Report (0.68)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback