Skip to main content
Skip to sub nav
Menu

News & Events Navigation

ISyE Seminar - Madeleine Udell

Title: Big Data is Low Rank

Abstract: Matrices of low rank are pervasive in big data, appearing in recommender systems, movie preferences, topic models, medical records, and genomics.

While there is a vast literature on how to exploit low rank structure in these datasets, there is less attention on explaining why low rank structure appears in the first place.

In this talk, we explain the abundance of low rank matrices in big data by proving that certain latent variable models associated to piecewise analytic functions are of log-rank. Any large matrix from such a latent variable model can be approximated, up to a small error, by a low rank matrix.

Armed with this theorem, we show how to use a low rank modeling framework to exploit low rank structure even for datasets that are not numeric, with applications in the social sciences, medicine, retail, and machine learning.

Event Details

Wednesday, 31 October 2018 - 1:30pm to 2:30pm

Groseclose 402

ISyE location map

Georgia Tech Supply Chain and
Logistics Institute
H. Milton Stewart School of
Industrial & Systems Engineering
765 Ferst Drive, NW, Suite 228
Atlanta, GA 30332
Phone: 404.894.2343