Adversarial Networks and Machine Learning for File Classification

Feb-2-2023–arXiv.org Artificial Intelligence

Correctly identifying the type of file under examination is a critical part of a forensic investigation. The file type alone suggests the embedded content, such as a picture, video, manuscript, spreadsheet, etc. In cases where a system owner might desire to keep their files inaccessible or file type concealed, we propose using an adversarially-trained machine learning neural network to determine a file's true type even if the extension or file header is obfuscated to complicate its discovery. Our semi-supervised generative adversarial network (SGAN) achieved 97.6% accuracy in classifying files across 11 different types. We also compared our network against a traditional standalone neural network and three other machine learning algorithms. The adversarially-trained network proved to be the most precise file classifier especially in scenarios with few supervised samples available. Our implementation of a file classifier using an SGAN is implemented on GitHub (https://ksaintg.github.io/SGAN-File-Classier).

artificial intelligence, file type, machine learning, (16 more...)

arXiv.org Artificial Intelligence

Feb-2-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York (0.04)
  - Hawaii (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.04)
  - Maryland > Anne Arundel County
    - Annapolis (0.04)
  - Louisiana > Orleans Parish
    - New Orleans (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)

Genre:
- Research Report (1.00)

Industry:
- Information Technology > Security & Privacy (0.93)
- Government (0.68)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found