Adversarial Networks and Machine Learning for File Classification
Germain, Ken St., Angichiodo, Josh
–arXiv.org Artificial Intelligence
Correctly identifying the type of file under examination is a critical part of a forensic investigation. The file type alone suggests the embedded content, such as a picture, video, manuscript, spreadsheet, etc. In cases where a system owner might desire to keep their files inaccessible or file type concealed, we propose using an adversarially-trained machine learning neural network to determine a file's true type even if the extension or file header is obfuscated to complicate its discovery. Our semi-supervised generative adversarial network (SGAN) achieved 97.6% accuracy in classifying files across 11 different types. We also compared our network against a traditional standalone neural network and three other machine learning algorithms. The adversarially-trained network proved to be the most precise file classifier especially in scenarios with few supervised samples available. Our implementation of a file classifier using an SGAN is implemented on GitHub (https://ksaintg.github.io/SGAN-File-Classier).
arXiv.org Artificial Intelligence
Feb-2-2023
- Country:
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States
- Hawaii (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Maryland > Anne Arundel County
- Annapolis (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- New York (0.04)
- Europe > United Kingdom
- Genre:
- Research Report (1.00)
- Industry:
- Government (0.68)
- Information Technology > Security & Privacy (0.93)
- Technology: