Like Two Pis in a Pod: Author Similarity in the Ancient Greek Corpus

dc.contributor.authorStorey, Grant Justin
dc.contributor.chairMimno, David
dc.contributor.committeeMemberRusten, Jeffrey S.
dc.description.abstractOne commonly recognized feature of the Ancient Greek corpus is that some later texts imitate and allude to model texts from earlier time periods, but analysis of this phenomenon is mostly done for specific author pairs based on close reading and highly visible instances of imitation. In this work, we use computational techniques to examine the similarity of a wide range of Ancient Greek authors, with a particular focus on similarity between authors writing many centuries apart. We represent texts and authors based on their usage of high-frequency words to capture author signatures rather than document topics. We propose the Jensen-Shannon Similarity metric for measuring similarity between authors and show that it outperforms other common metrics for vector comparison. We then use this similarity metric to analyze author similarity across distances in time, finding high similarity between specific authors and across the corpus that is not common to all languages. We analyze these similar author pairs more closely and find the similarity is the result of similar usage of many different words rather than just a few.
dc.identifier.otherbibid: 11050423
dc.subjectClassical literature
dc.subjectComputer science
dc.subjectAncient languages
dc.subjectdigital humanities
dc.subjectAncient Greek
dc.titleLike Two Pis in a Pod: Author Similarity in the Ancient Greek Corpus
dc.typedissertation or thesis
dcterms.license Science University of Science, Computer Science


Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
2.94 MB
Adobe Portable Document Format