Multi-Label Plant Species Classification with Self-Supervised Vision Transformers

Gustineli, Murilo, Miyaguchi, Anthony, Stalter, Ian

Jul-8-2024–arXiv.org Artificial Intelligence

We present a transfer learning approach using a self-supervised Vision Transformer (DINOv2) for the PlantCLEF 2024 competition, focusing on the multi-label plant species classification. Our method leverages both base and fine-tuned DINOv2 models to extract generalized feature embeddings. We train classifiers to predict multiple plant species within a single image using these rich embeddings. To address the computational challenges of the large-scale dataset, we employ Spark for distributed data processing, ensuring efficient memory management and processing across a cluster of workers. Our data processing pipeline transforms images into grids of tiles, classifying each tile, and aggregating these predictions into a consolidated set of probabilities. Our results demonstrate the efficacy of combining transfer learning with advanced data processing techniques for multi-label image classification tasks.

classification, classifier, dataset, (13 more...)

arXiv.org Artificial Intelligence

Jul-8-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Georgia > Fulton County > Atlanta (0.04)
- Europe > France
  - Auvergne-Rhône-Alpes > Isère > Grenoble (0.04)

Genre:
- Research Report > New Finding (0.54)

Industry:
- Information Technology (1.00)

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (1.00)
  - Artificial Intelligence
    - Machine Learning > Transfer Learning (0.55)
    - Vision > Image Understanding (0.48)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found