Software Development for Genome Sequence Analysis

Abstract

The cost of genome sequencing has decreased rapidly, expanding availability for many biological applications (Muir 2016). For example, researchers can now obtain genome sequences from multiple populations under different types of selection. Comparison of these sequences allows for identification of chromosome regions and specific genes associated with adaptive evolution (Kelly 2013). As an increasing number of researchers engage in this type of inquiry, many have created in-house computer scripts to analyze the raw sequence data (e.g., Kelly 2013), creating a gap in both continuity and standardization. Using a test dataset and preliminary results from an ongoing artificial selection experiment in Mimulus guttatus (Yellow Monkeyflower), I translated, verified, and expanded five software programs representing stages of a single analysis into one software package written in the C# programming language. This program is helping researchers to streamline their analysis and increase precision, while remaining dynamic enough that it can be expanded to any like-set of data, regardless of species

    Similar works