DIMACS Theory of Computing Seminar


Title: Matrix Sketching over Streams

Speaker: Mina Ghashami, Rutgers University

Date: Wednesday, February 7, 2018 11:00am-12:00pm

Location: CoRE Bldg, Room 301, Rutgers University, Busch Campus, Piscataway, NJ


Abstract:

It is common to represent data in the form of a matrix, and a large set of data analytic tasks rely on obtaining a low-rank approximation of the data matrix. Such approximations can be computed using the Singular Value Decompositions (SVD). In many scenarios, however, data matrices are extremely large and computing their SVD exactly is infeasible. Efficient approximate solutions exist for distributed setting or when data access otherwise is limited. In the data streaming model, the data points are presented to the algorithm one by one in an arbitrary order. The algorithm is tasked with processing the stream in one pass while being severely restricted in its memory footprint. At the end of the stream, the algorithm must provide a sketch matrix which is a good approximation of the original data.

In this talk, we will discuss two recent matrix sketching methods over data streams.

See: https://sites.google.com/view/dimacs-theory-seminar/home