Poster
Single Pass Entrywise-Transformed Low Rank Approximation
Yifei Jiang · Yi Li · Yiming Sun · Jiaxin Wang · David Woodruff
Keywords: [ Semi-supervised learning ] [ Algorithms ] [ Regression ] [ Algorithms -> Missing Data; Algorithms ] [ Dimensionality Reduction ]
Abstract:
In applications such as natural language processing or computer vision, one is given a large n×n matrix A=(ai,j) and would like to compute a matrix decomposition, e.g., a low rank approximation, of a function f(A)=(f(ai,j)) applied entrywise to A. A very important special case is the likelihood function f(A)=log(|aij|+1). A natural way to do this would be to simply apply f to each entry of A, and then compute the matrix decomposition, but this requires storing all of A as well as multiple passes over its entries. Recent work of Liang et al. shows how to find a rank-k factorization to f(A) using only n⋅\poly(\eps−1klogn) words of memory, with overall error 10‖f(A)−[f(A)]k‖2F+\poly(ϵ/k)‖f(A)‖21,2, where [f(A)]k is the best rank-k approximation to f(A) and ‖f(A)‖21,2 is the square of the sum of Euclidean lengths of rows of f(A). Their algorithm uses 3 passes over the entries of A. The authors pose the open question of obtaining an algorithm with n⋅\poly(\eps−1klogn) words of memory using only a single pass over the entries of A.
In this paper we resolve this open question, obtaining the first single-pass algorithm for this problem and for the same class of functions f studied by Liang et al. Moreover, our error is ‖f(A)−[f(A)]k‖2F+\poly(ϵ/k)‖f(A)‖2F, where ‖f(A)‖2F is the sum of squares of Euclidean lengths of rows of f(A). Thus our error is significantly smaller, as it removes the factor of 10 and also ‖f(A)‖2F≤‖f(A)‖21,2.
Chat is not available.