Automated Attacks on Compression-Based Classifiers

Burago, Igor

Automated Attacks on Compression-Based Classifiers

Files

Burago_oregon_0171N_11060.pdf (234.96 KB)

Date

2014-09-29

Authors

Burago, Igor

Publisher

University of Oregon

Abstract

Methods of compression-based text classification have proven their usefulness for various applications. However, in some classification problems, such as spam filtering, a classifier confronts one or many adversaries willing to induce errors in the classifier's judgment on certain kinds of input. In this thesis, we consider the problem of finding thrifty strategies for character-based text modification that allow an adversary to revert classifier's verdict on a given family of input texts. We propose three statistical statements of the problem that can be used by an attacker to obtain transformation models which are optimal in some sense. Evaluating these three techniques on a realistic spam corpus, we find that an adversary can transform a spam message (detectable as such by an entropy-based text classifier) into a legitimate one by generating and appending, in some cases, as few additional characters as 20% of the original length of the message.

Keywords

Adversarial machine learning, Compression-based classification

URI

https://hdl.handle.net/1794/18439

Collections

Theses and Dissertations
Computer Science Theses and Dissertations

Full item page

Scholars' Bank

Automated Attacks on Compression-Based Classifiers

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By