AITopics | computer vision dataset

Collaborating Authors

computer vision dataset

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

How hard are computer vision datasets? Calibrating dataset difficulty to viewing time

Neural Information Processing SystemsDec-24-2025, 05:38:14 GMT

Humans outperform object recognizers despite the fact that models perform well on current datasets, including those explicitly designed to challenge machines with debiased images or distribution shift. This problem persists, in part, because we have no guidance on the absolute difficulty of an image or dataset making it hard to objectively assess progress toward human-level performance, to cover the range of human abilities, and to increase the challenge posed by a dataset. We develop a dataset difficulty metric MVT, Minimum Viewing Time, that addresses these three problems. Subjects view an image that flashes on screen and then classify the object in the image. Images that require brief flashes to recognize are easy, those which require seconds of viewing are hard. We compute the ImageNet and ObjectNet image difficulty distribution, which we find significantly undersamples hard images.

calibrating dataset difficulty, dataset, dataset difficulty, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Vision (0.35)

Add feedback

Review for NeurIPS paper: Principal Neighbourhood Aggregation for Graph Nets

Neural Information Processing SystemsJan-26-2025, 21:51:41 GMT

Weaknesses: Methodological: The work here places importance on topology/structure. For example, the message scaling is dependent on node degree. Thus this method is apt for applications where the structure is paramount, e.g. one such application mentioned is reasoning about social networks where the degree of the nodes/users provides a lot of information about that node/user. Though useful in many domains, there are domains where GNNs are useful but topology is not important. This is reflected empirically for regular grid graph of the computer vision datasets where PNA does not significantly improve over other methods.

dataset, graph, principal neighbourhood aggregation, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.42)

Add feedback

How hard are computer vision datasets? Calibrating dataset difficulty to viewing time

Neural Information Processing SystemsOct-10-2024, 12:10:16 GMT

calibrating dataset difficulty, dataset, dataset difficulty, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Vision (0.70)

Add feedback

Master Data Integrity to Clean Your Computer Vision Datasets

#artificialintelligenceDec-23-2022, 12:15:34 GMT

Data integrity is one of the biggest concerns for companies and engineers in the latest period. The amount of data we have to process and understand only gets more significant, and manually looking at millions of samples is not sustainable. Thus, we need tools that can help us navigate our datasets. This tutorial will present how to clean, visualize and understand Computer Vision datasets, such as videos or images. We will be working on a video of the most precious thing in my house, my cat.

computer vision dataset, dataset, video, (6 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Technology:

Information Technology > Data Science > Data Quality (0.64)
Information Technology > Data Science > Data Mining > Big Data (0.64)
Information Technology > Artificial Intelligence > Vision (0.64)

Add feedback

DeepSportradar-v1: Computer Vision Dataset for Sports Understanding with High Quality Annotations

Van Zandycke, Gabriel, Somers, Vladimir, Istasse, Maxime, Del Don, Carlo, Zambrano, Davide

arXiv.org Artificial IntelligenceAug-17-2022

With the recent development of Deep Learning applied to Computer Vision, sport video understanding has gained a lot of attention, providing much richer information for both sport consumers and leagues. This paper introduces DeepSportradar-v1, a suite of computer vision tasks, datasets and benchmarks for automated sport understanding. The main purpose of this framework is to close the gap between academic research and real world settings. To this end, the datasets provide high-resolution raw images, camera parameters and high quality annotations. DeepSportradar currently supports four challenging tasks related to basketball: ball 3D localization, camera calibration, player instance segmentation and player re-identification. For each of the four tasks, a detailed description of the dataset, objective, performance metrics, and the proposed baseline method are provided. To encourage further research on advanced methods for sport understanding, a competition is organized as part of the MMSports workshop from the ACM Multimedia 2022 conference, where participants have to develop state-of-the-art methods to solve the above tasks. The four datasets, development kits and baselines are publicly available.

dataset, proceedings, segmentation, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3552437.3555699

2208.0819

Country:

Europe > Portugal > Lisbon > Lisbon (0.05)
Europe > Switzerland (0.05)
Europe > France > Pays de la Loire > Loire-Atlantique > Nantes (0.04)
(4 more...)

Genre: Research Report > Promising Solution (0.34)

Industry: Leisure & Entertainment > Sports > Basketball (0.69)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Why Adversarial Image Attacks Are No Joke

#artificialintelligenceNov-29-2021, 16:10:10 GMT

Attacking image recognition systems with carefully-crafted adversarial images has been considered an amusing but trivial proof-of-concept over the last five years. However, new research from Australia suggests that the casual use of highly popular image datasets for commercial AI projects could create an enduring new security problem. For a couple of years now, a group of academics at the University of Adelaide have been trying to explain something really important about the future of AI-based image recognition systems. It's something that would be difficult (and very expensive) to fix right now, and which would be unconscionably costly to remedy once the current trends in image recognition research have been fully developed into commercialized and industrialized deployments in 5-10 years' time. Before we get into it, let's have a look at a flower being classified as President Barack Obama, from one of the six videos that the team has published on the project page: In the above image, a facial recognition system that clearly knows how to recognize Barack Obama is fooled into 80% certainty that an anonymized man holding a crafted, printed adversarial image of a flower is also Barack Obama.

architecture, dataset, recognition system, (12 more...)

#artificialintelligence

Country:

Oceania > Australia (0.24)
Asia > China > Jiangsu Province > Nanjing (0.04)

Genre: Research Report > New Finding (0.34)

Industry:

Information Technology > Security & Privacy (1.00)
Government (0.76)

Technology:

Information Technology > Artificial Intelligence > Vision > Face Recognition (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)

Add feedback

Model Rectification via Unknown Unknowns Extraction from Deployment Samples

Abrahao, Bruno, Wang, Zheng, Ahmed, Haider, Zhu, Yuchen

arXiv.org Artificial IntelligenceFeb-8-2021

Model deficiency that results from incomplete training data is a form of structural blindness that leads to costly errors, oftentimes with high confidence. During the training of classification tasks, underrepresented class-conditional distributions that a given hypothesis space can recognize results in a mismatch between the model and the target space. To mitigate the consequences of this discrepancy, we propose Random Test Sampling and Cross-Validation (RTSCV) as a general algorithmic framework that aims to perform a post-training model rectification at deployment time in a supervised way. RTSCV extracts unknown unknowns (u.u.s), i.e., examples from the class-conditional distributions that a classifier is oblivious to, and works in combination with a diverse family of modern prediction models. RTSCV augments the training set with a sample of the test set (or deployment data) and uses this redefined class layout to discover u.u.s via cross-validation, without relying on active learning or budgeted queries to an oracle. We contribute a theoretical analysis that establishes performance guarantees based on the design bases of modern classifiers. Our experimental evaluation demonstrates RTSCV's effectiveness, using 7 benchmark tabular and computer vision datasets, by reducing a performance gap as large as 41% from the respective pre-rectification models. Last we show that RTSCV consistently outperforms state-of-the-art approaches.

classifier, model rectification, unknown unknown extraction, (10 more...)

arXiv.org Artificial Intelligence

2102.04145

Country:

North America > United States > Wisconsin > Dane County > Madison (0.04)
North America > United States > New York > New York County > New York City (0.04)
Asia > China > Shanghai > Shanghai (0.04)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.93)
(2 more...)

Add feedback

Microsoft partners with Team Gleason to build a computer vision dataset for ALS

#artificialintelligenceOct-12-2020, 15:15:33 GMT

Microsoft and Team Gleason, the nonprofit organization founded by NFL player Steve Gleason, today launched Project Insight to create an open dataset of facial imagery of people with amyotrophic lateral sclerosis (ALS). The organizations hope to foster innovation in computer vision and broaden the potential for connectivity and communication for people with accessibility challenges. Microsoft and Team Gleason assert that existing machine learning datasets don't represent the diversity of people with ALS, a condition that affects as many as 30,000 people in the U.S. Project Insight will investigate how to use data and AI with the front-facing camera already present in many assistive devices to predict where a person is looking on a screen. Team Gleason will work with Microsoft's Health Next Enable team to gather images of people with ALS looking at their computer so it can train AI models more inclusively. Participants will be given a brief medical history questionnaire and be prompted through an app to submit images of themselves using their computer.

artificial intelligence, computer vision dataset, team gleason, (6 more...)

#artificialintelligence

Country: North America > United States (0.26)

Industry:

Health & Medicine > Therapeutic Area > Rheumatology (1.00)
Health & Medicine > Therapeutic Area > Pulmonary/Respiratory Diseases (1.00)
Health & Medicine > Therapeutic Area > Neurology > Amyotrophic Lateral Sclerosis (ALS) (1.00)
Health & Medicine > Therapeutic Area > Musculoskeletal (1.00)

Technology: Information Technology > Artificial Intelligence > Vision (0.73)

Add feedback

VisualData: A Search Engine for Computer Vision Datasets

#artificialintelligenceJun-23-2019, 21:17:23 GMT

Algorithms, computation and visual data are the three pillars of computer vision (CV). Researchers, institutions and open source communities have produced sophisticated algorithms and open-sourced code; while global tech giants' supercharged cloud platforms provide all the computational power CV researchers require. However, efficiently sourcing visual data -- particularly images with high-quality annotations -- remains a challenge. Building large datasets is a time-consuming and labor-intensive task which challenges entities with limited budgets. There are hundreds of open visual datasets out there, but searching across them and their millions of entries is not a simple task.

computer vision dataset, dataset, image dataset, (4 more...)

#artificialintelligence

Industry: Information Technology > Services (0.43)

Technology:

Information Technology > Artificial Intelligence > Vision (0.99)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.43)

Add feedback

How to Develop and Demonstrate Competence With Deep Learning for Computer Vision

#artificialintelligenceMar-12-2019, 10:47:46 GMT

Computer vision is perhaps one area that has been most impacted by developments in deep learning. It can be difficult to both develop and to demonstrate competence with deep learning for problems in the field of computer vision. It is not clear how to get started, what the most important techniques are, and the types of problems and projects that can best highlight the value that deep learning can bring to the field. On approach is to systematically develop, and at the same time demonstrate competence with, data handling, modeling techniques, and application domains and present your results in a public portfolio of completed projects. This approach allows you to compound your skills from project to project.

artificial intelligence, deep learning, machine learning, (10 more...)

#artificialintelligence

Genre: Instructional Material (0.30)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback