ICML Poster MLI Formula: A Nearly Scale-Invariant Solution with Noise Perturbation

Poster

MLI Formula: A Nearly Scale-Invariant Solution with Noise Perturbation

Bowen Tao · Xin-Chun Li · De-Chuan Zhan

Hall C 4-9 #2916

[ Abstract ] [ Paper PDF ]

[ Poster]

Abstract: Monotonic Linear Interpolation (MLI) refers to the peculiar phenomenon that the error between the initial and converged model monotonically decreases along the linear interpolation, i.e.,

(1 - α) θ_{0} + α θ_{F}

$(1-\alpha)\boldsymbol{\theta}_0 + \alpha \boldsymbol{\theta}_F$ . Previous works focus on paired initial and converged points, relating MLI to the smoothness of the optimization trajectory. In this paper, we find a shocking fact that the error curves still exhibit a monotonic decrease when

θ_{0}

$\boldsymbol{\theta}_0$ is replaced with noise or even zero values, implying that the decreasing curve may be primarily related to the property of the converged model rather than the optimization trajectory. We further explore the relationship between

α θ_{F}

$\alpha\boldsymbol{\theta}_F$ and

θ_{F}

$\boldsymbol{\theta}_F$ and propose scale invariance properties in various cases, including Generalized Scale Invariance (GSI), Rectified Scale Invariance (RSI), and Normalized Scale Invariance (NSI). From an inverse perspective, the MLI formula is essentially an equation that adds varying levels of noise (i.e.,

(1 - α) ϵ

$(1-\alpha)\boldsymbol{\epsilon}$ ) to a nearly scale-invariant network (i.e.,

α θ_{F}

$\alpha \boldsymbol{\theta}_F$ ), resulting in a monotonically increasing error as the noise level rises. MLI is a special case where

ϵ

$\boldsymbol{\epsilon}$ is equal to

θ_{0}

$\boldsymbol{\theta}_0$ .

Chat is not available.