ICML Poster Mediated Uncoupled Learning: Learning Functions without Direct Input-output Correspondences

Mediated Uncoupled Learning: Learning Functions without Direct Input-output Correspondences

[ Abstract ] [ Paper PDF ]

[ Slides]

[ Paper ]

[ Visit Poster at Spot D0 in Virtual World ]

Abstract: Ordinary supervised learning is useful when we have paired training data of input

X

$X$ and output

Y

$Y$ . However, such paired data can be difficult to collect in practice. In this paper, we consider the task of predicting

Y

$Y$ from

X

$X$ when we have no paired data of them, but we have two separate, independent datasets of

X

$X$ and

Y

$Y$ each observed with some mediating variable

U

$U$ , that is, we have two datasets

S_{X} = {(X_{i}, U_{i})}

$S_X = \{(X_i, U_i)\}$ and

S_{Y} = {(U_{j}^{'}, Y_{j}^{'})}

$S_Y = \{(U'_j, Y'_j)\}$ . A naive approach is to predict

U

$U$ from

X

$X$ using

S_{X}

$S_X$ and then

Y

$Y$ from

U

$U$ using

S_{Y}

$S_Y$ , but we show that this is not statistically consistent. Moreover, predicting

U

$U$ can be more difficult than predicting

Y

$Y$ in practice, e.g., when

U

$U$ has higher dimensionality. To circumvent the difficulty, we propose a new method that avoids predicting

U

$U$ but directly learns

Y = f (X)

$Y = f(X)$ by training

f (X)

$f(X)$ with

S_{X}

$S_{X}$ to predict

h (U)

$h(U)$ which is trained with

S_{Y}

$S_{Y}$ to approximate

Y

$Y$ . We prove statistical consistency and error bounds of our method and experimentally confirm its practical usefulness.

Chat is not available.