The Population Genetic Variation of Interspersed and Tandem Repeats
No Access Until
Permanent Link(s)
Collections
Other Titles
Author(s)
Abstract
Despite making up large and essential portions of eukaryotic genomes repeat DNA has largely eluded study due to limitations in alignment and assembly of Next Generation short-read sequencing. Standard technologies and methods are largely inadequate to study the abundance and variation of repeat sequences at the population scale. Therefore, I have developed and applied new methods and algorithms to study the population variation of interspersed and tandem repeats, specifically transposable elements and simple satellites. Firstly, using hierarchical clustering and population genetic theory I have developed a method to infer transposable element clades and their age, circumventing difficult problems of phasing and mapping. This method resulted in a discovery that host piRNA regulatory mechanisms are turning over to regulate newly emerging transposable element variants shedding light on an evolutionary arms race. Secondly, I applied k-mer counting methods to human population genomics data to quantify abundance and variation of simple satellites discovering undescribed telomeric and centromeric satellites. I then applied a mixed modelling framework to find associations between centromeric ancestry and simple satellite abundances which lead to the discovery of an expansion of a centromeric satellite copy number in a cluster of African and Latin American individuals with shared centromeric ancestry. Overall, this dissertation represents an analysis framework for the development of computational strategies to mine population genomics data for repeat variation. The approaches I have developed and deployed have allowed me to make inferences about repeat abundance and variation in large population genomic datasets that were previously unfeasible.
Journal / Series
Volume & Issue
Description
Supplemental file(s) description: None.
Sponsorship
Date Issued
Publisher
Keywords
Location
Effective Date
Expiration Date
Sector
Employer
Union
Union Local
NAICS
Number of Workers
Committee Chair
Committee Co-Chair
Committee Member
Feschotte, Cedric
Mezey, Jason