eCommons

 

Data Dependent Random Projections

dc.contributor.authorKang, Keegan
dc.contributor.chairHooker, Giles J
dc.contributor.committeeMemberSridharan, Karthik
dc.contributor.committeeMemberMimno, David
dc.date.accessioned2017-07-07T12:48:32Z
dc.date.available2019-06-08T06:00:54Z
dc.date.issued2017-05-30
dc.description.abstractRandom projections is a technique used primarily in dimension reduction, in order to estimate distances in data. They can be thought of a linear transformation mapping a data matrix X to a lower dimensional space, where distances are preserved in expectation. However, the preservation of distances can be thought of a stepping stone to some eventual goal, such as classification, hypothesis testing, information retrieval, or even reconstructing principal components of data. In this thesis, I will give a background of the basic random projection algorithm. Next, I then look at the structure of random projection matrices and propose modifications to result in a more accurate estimation of distances, which would help in information retrieval and reconstruction of principal components. Finally, I show that it is possible to juxtapose the use of Monte Carlo variance reduction methods with random projections to improve the accuracy of distance estimates, which can then be used in an algorithm or procedure of the users' choice. Theoretical justifications are given, and empirical results are shown with synthetic data, and experiments from publicly available datasets.
dc.identifier.doihttps://doi.org/10.7298/X4XG9P8R
dc.identifier.otherKang_cornellgrad_0058F_10216
dc.identifier.otherhttp://dissertations.umi.com/cornellgrad:10216
dc.identifier.otherbibid: 9948798
dc.identifier.urihttps://hdl.handle.net/1813/51575
dc.language.isoen_US
dc.rightsAttribution-ShareAlike 4.0 International*
dc.rights.urihttps://creativecommons.org/licenses/by-sa/4.0/*
dc.subjectinformation retrieval
dc.subjectrandom projections
dc.subjectComputer science
dc.subjectStatistics
dc.subjectcontrol variates
dc.titleData Dependent Random Projections
dc.typedissertation or thesis
dcterms.licensehttps://hdl.handle.net/1813/59810
thesis.degree.disciplineStatistics
thesis.degree.grantorCornell University
thesis.degree.levelDoctor of Philosophy
thesis.degree.namePh. D., Statistics

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Kang_cornellgrad_0058F_10216.pdf
Size:
8.01 MB
Format:
Adobe Portable Document Format