Skip to yearly menu bar Skip to main content


Poster

Attack-free Evaluating and Enhancing Adversarial Robustness on Categorical Data

Yujun Zhou · Yufei Han · Haomin Zhuang · Hongyan Bao · Xiangliang Zhang


Abstract:

Research on adversarial robustness has predominantly focused on continuous inputs, leaving categorical inputs, especially tabular attributes, less examined. To echo this challenge, our work aims to evaluate and enhance the robustness of classification over categorical attributes against adversarial perturbations.We propose a robustness evaluation metric named Integrated Gradient-Smoothed Gradient (IGSG). It is designed to evaluate the attributional sensitivity of each feature and the decision boundary of the classifier, two aspects that significantly influence adversarial risk, according to our theoretical analysis. Leveraging this metric, we develop an IGSG-based regularization to reduce adversarial risk by suppressing the sensitivity of categorical attributes. We conduct extensive empirical study over categorical datasets of various application domains. The results affirm the efficacy of both IGSG and the IGSG-based regularization. Notably, IGSG-based regularization surpasses the state-of-the-art robust training methods by a margin of approximately 0.4\% to 12.2\% on average in terms of adversarial accuracy, especially on high-dimension datasets.

Live content is unavailable. Log in and register to view live content