Cornell University
Library
Cornell UniversityLibrary

eCommons

Help
Log In(current)
  1. Home
  2. Cornell University Graduate School
  3. Cornell Theses and Dissertations
  4. Methods For High Dimensional Matrix Computation And Diagnostics Of Distributed System

Methods For High Dimensional Matrix Computation And Diagnostics Of Distributed System

File(s)
wc438.pdf (1.4 MB)
Permanent Link(s)
https://hdl.handle.net/1813/37139
Collections
Cornell Theses and Dissertations
Author
Chen, Wei
Abstract

Big data provides opportunities, but also brings new challenges to modern scientific computing. In this thesis, we conduct sparse principal component analysis (SPCA) on high dimensional matrices. We propose a modified curvilinear algorithm to solve eigenvalue optimization with orthogonal constraints, and combine it with an augmented Lagrangian method to improve its computational efficiency. We compare our algorithm against standard PCA on the recovery of low-rank tensors and a mean-reverted statistical arbitrage strategy. The explosion of big data has also influenced the development on distributed computing systems. For debugging purposes, we are interested in predicting server run-time based on system data early in the process. We study discriminative models in functional data analysis, and introduce generative models that capture server regime-change behaviors. We also design computational methods, including a blocked Gibbs sampler, to improve the accuracy and efficiency of model estimation.

Date Issued
2014-05-25
Keywords
sparse principal component analysis
•
generative models
•
discriminative models
Committee Chair
Wells, Martin Timothy
Committee Member
Tang, Ao
Turnbull, Bruce William
Degree Discipline
Operations Research
Degree Name
Ph. D., Operations Research
Degree Level
Doctor of Philosophy
Type
dissertation or thesis

Site Statistics | Help

About eCommons | Policies | Terms of use | Contact Us

copyright © 2002-2026 Cornell University Library | Privacy | Web Accessibility Assistance