PhishSnap: Image-Based Phishing Detection Using Perceptual Hashing
Minhaz, Md Abdul Ahad, Meem, Zannatul Zahan, Hossain, Md. Shohrab
–arXiv.org Artificial Intelligence
Phishing remains one of the most prevalent online threats, exploiting human trust to harvest sensitive credentials. Existing URL- and HTML-based detection systems struggle against obfuscation and visual deception. This paper presents \textbf{PhishSnap}, a privacy-preserving, on-device phishing detection system leveraging perceptual hashing (pHash). Implemented as a browser extension, PhishSnap captures webpage screenshots, computes visual hashes, and compares them against legitimate templates to identify visually similar phishing attempts. A \textbf{2024 dataset of 10,000 URLs} (70\%/20\%/10\% train/validation/test) was collected from PhishTank and Netcraft. Due to security takedowns, a subset of phishing pages was unavailable, reducing dataset diversity. The system achieved \textbf{0.79 accuracy}, \textbf{0.76 precision}, and \textbf{0.78 recall}, showing that visual similarity remains a viable anti-phishing measure. The entire inference process occurs locally, ensuring user privacy and minimal latency.
arXiv.org Artificial Intelligence
Dec-9-2025
- Country:
- Asia > Bangladesh
- Dhaka Division > Dhaka District > Dhaka (0.06)
- Europe > Austria
- Upper Austria (0.05)
- Asia > Bangladesh
- Genre:
- Research Report (0.50)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Technology: