Provable Methods for Non-negative Matrix Factorization
Electronic Theses of Indian Institute of Science
View Archive InfoField | Value | |
Title |
Provable Methods for Non-negative Matrix Factorization
|
|
Creator |
Pani, Jagdeep
|
|
Subject |
Non-negative Matrix Factorization (NMF)
Data Analysis Non-Negative Matrix Factorization Uniqueness NMF Algorithms Singular Value Decompositon (SVD) Algorithm UTSVD NMF Algorithm Machine Learning Computer Science |
|
Description |
Nonnegative matrix factorization (NMF) is an important data-analysis problem which concerns factoring a given d n matrix A with nonnegative entries into matrices B and C where B and C are d k and k n with nonnegative entries. It has numerous applications including Object recognition, Topic Modelling, Hyper-spectral imaging, Music transcription etc. In general, NMF is intractable and several heuristics exists to solve the problem of NMF. Recently there has been interest in investigating conditions under which NMF can be tractably recovered. We note that existing attempts make unrealistic assumptions and often the associated algorithms tend to be not scalable. In this thesis, we make three major contributions: First, we formulate a model of NMF with assumptions which are natural and is a substantial weakening of separability. Unlike requiring a bound on the error in each column of (A BC) as was done in much of previous work, our assumptions are about aggregate errors, namely spectral norm of (A BC) i.e. jjA BCjj2 should be low. This is a much weaker error assumption and the associated B; C would be much more resilient than existing models. Second, we describe a robust polynomial time SVD-based algorithm, UTSVD, with realistic provable error guarantees and can handle higher levels of noise than previous algorithms. Indeed, experimentally we show that existing NMF models, which are based on separability assumptions, degrade much faster than UTSVD, in the presence of noise. Furthermore, when the data has dominant features, UTSVD significantly outperforms existing models. On real life datasets we again see a similar outperformance of UTSVD on clustering tasks. Finally, under a weaker model, we prove a robust version of uniqueness of NMF, where again, the word \robust" refers to realistic error bounds. |
|
Contributor |
Bhattacharyya, Chiranjib
|
|
Date |
2017-10-31T07:00:49Z
2017-10-31T07:00:49Z 2017-10-31 2016 |
|
Type |
Thesis
|
|
Identifier |
http://hdl.handle.net/2005/2739
http://etd.ncsi.iisc.ernet.in/abstracts/3566/G27836-Abs.pdf |
|
Language |
en_US
|
|
Relation |
G27836
|
|