Show simple item record

dc.contributor.authorStorey, Grant Justin
dc.date.accessioned2019-10-15T15:32:10Z
dc.date.available2019-10-15T15:32:10Z
dc.date.issued2019-05-30
dc.identifier.otherStorey_cornell_0058O_10491
dc.identifier.otherhttp://dissertations.umi.com/cornell:10491
dc.identifier.otherbibid: 11050423
dc.identifier.urihttps://hdl.handle.net/1813/67441
dc.description.abstractOne commonly recognized feature of the Ancient Greek corpus is that some later texts imitate and allude to model texts from earlier time periods, but analysis of this phenomenon is mostly done for specific author pairs based on close reading and highly visible instances of imitation. In this work, we use computational techniques to examine the similarity of a wide range of Ancient Greek authors, with a particular focus on similarity between authors writing many centuries apart. We represent texts and authors based on their usage of high-frequency words to capture author signatures rather than document topics. We propose the Jensen-Shannon Similarity metric for measuring similarity between authors and show that it outperforms other common metrics for vector comparison. We then use this similarity metric to analyze author similarity across distances in time, finding high similarity between specific authors and across the corpus that is not common to all languages. We analyze these similar author pairs more closely and find the similarity is the result of similar usage of many different words rather than just a few.
dc.language.isoen_US
dc.subjectClassical literature
dc.subjectComputer science
dc.subjectStylometry
dc.subjectAncient languages
dc.subjectdigital humanities
dc.subjectAncient Greek
dc.titleLike Two Pis in a Pod: Author Similarity in the Ancient Greek Corpus
dc.typedissertation or thesis
thesis.degree.disciplineComputer Science
thesis.degree.grantorCornell University
thesis.degree.levelMaster of Science
thesis.degree.nameM.S., Computer Science
dc.contributor.chairMimno, David
dc.contributor.committeeMemberRusten, Jeffrey S.
dcterms.licensehttps://hdl.handle.net/1813/59810
dc.identifier.doihttps://doi.org/10.7298/dehe-z448


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

Statistics