Pruning Distorted Images in MNIST Handwritten Digits
–arXiv.org Artificial Intelligence
Recognizing handwritten digits is a challenging task primarily due to the diversity of writing styles and the presence of noisy images. The widely used MNIST dataset, which is commonly employed as a benchmark for this task, includes distorted digits with irregular shapes, incomplete strokes, and varying skew in both the training and testing datasets. Consequently, these factors contribute to reduced accuracy in digit recognition. To overcome this challenge, we propose a two-stage deep learning approach. In the first stage, we create a simple neural network to identify distorted digits within the training set. This model serves to detect and filter out such distorted and ambiguous images. In the second stage, we exclude these identified images from the training dataset and proceed to retrain the model using the filtered dataset. This process aims to improve the classification accuracy and confidence levels while mitigating issues of underfitting and overfitting. Our experimental results demonstrate the effectiveness of the proposed approach, achieving an accuracy rate of over 99.5% on the testing dataset. In our future work, we intend to explore the scalability of this approach and investigate techniques to further enhance accuracy by reducing the size of the training data. NTRODUCTION Handwritten digit recognition is a complex task that finds applications in various fields, including computer vision and machine learning. It involves the identification and classification of digits written by hand, enabling tasks such as character recognition and digit analysis.
arXiv.org Artificial Intelligence
May-26-2023
- Country:
- Africa > Mali (0.04)
- Asia
- Europe
- Estonia > Harju County
- Tallinn (0.04)
- Ireland > Munster
- County Kerry > Killarney (0.04)
- Spain
- Andalusia > Cádiz Province
- Cadiz (0.04)
- Catalonia > Barcelona Province
- Barcelona (0.04)
- Andalusia > Cádiz Province
- Estonia > Harju County
- North America > United States
- California
- Los Angeles County > Long Beach (0.04)
- San Francisco County > San Francisco (0.14)
- Santa Clara County > San Jose (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Massachusetts > Suffolk County
- Boston (0.04)
- Nevada > Clark County
- Las Vegas (0.04)
- Rhode Island > Providence County
- Providence (0.04)
- Utah > Salt Lake County
- Salt Lake City (0.04)
- California
- Genre:
- Overview (1.00)
- Research Report > New Finding (1.00)
- Industry:
- Information Technology (0.46)
- Technology: