I am generating big data sets related to genome regulation and applying statistical learning techniques to investigate association between ethnicity and non-genome variation in regulatory regions. This is a PDF of my thesis about the Willow software tool I developed to examine phylogenetic structure.