Timezone: »

Towards Understanding Generalization of Macro-AUC in Multi-label Learning
Guoqiang Wu · Chongxuan Li · Yilong Yin

Thu Jul 27 01:30 PM -- 03:00 PM (PDT) @ Exhibit Hall 1 #423

Macro-AUC is the arithmetic mean of the class-wise AUCs in multi-label learning and is commonly used in practice. However, its theoretical understanding is far lacking. Toward solving it, we characterize the generalization properties of various learning algorithms based on the corresponding surrogate losses w.r.t. Macro-AUC. We theoretically identify a critical factor of the dataset affecting the generalization bounds: the label-wise class imbalance. Our results on the imbalance-aware error bounds show that the widely-used univariate loss-based algorithm is more sensitive to the label-wise class imbalance than the proposed pairwise and reweighted loss-based ones, which probably implies its worse performance. Moreover, empirical results on various datasets corroborate our theory findings. To establish it, technically, we propose a new (and more general) McDiarmid-type concentration inequality, which may be of independent interest.

Author Information

Guoqiang Wu (Shandong University)
Chongxuan Li (Tsinghua University)
Yilong Yin (Shandong University)

More from the Same Authors