Robustness of Neural Networks for Discrete Input: An Adversarial Perspective
Loading...
Date
2019-04-30
Authors
Ebrahimi, Javid
Journal Title
Journal ISSN
Volume Title
Publisher
University of Oregon
Abstract
In the past few years, evaluating on adversarial examples has become a standard
procedure to measure robustness of deep learning models. Literature on adversarial
examples for neural nets has largely focused on image data, which are represented as
points in continuous space. However, a vast proportion of machine learning models
operate on discrete input, and thus demand a similar rigor in understanding their
vulnerabilities and robustness. We study robustness of neural network architectures
for textual and graph inputs, through the lens of adversarial input perturbations.
We will cover methods for both attacks and defense; we will focus on 1) addressing
challenges in optimization for creating adversarial perturbations for discrete data;
2) evaluating and contrasting white-box and black-box adversarial examples; and 3)
proposing efficient methods to make the models robust against adversarial attacks.
Description
Keywords
Adversarial machine learning, Graph neural networks, Machine translation