Show simple item record

FieldValueLanguage
dc.contributor.authorWright, Belinda
dc.contributor.authorFarquharson, Katherine A.
dc.contributor.authorMcLennan, Elspeth A.
dc.contributor.authorBelov, Katherine
dc.contributor.authorHogg, Carolyn J.
dc.contributor.authorGrueber, Catherine E.
dc.date.accessioned2020-08-18
dc.date.available2020-08-18
dc.date.issued2019-01-01en_AU
dc.identifier.urihttps://hdl.handle.net/2123/23115
dc.description.abstractBackground: Recent advances in genomics have greatly increased research opportunities for non-model species. For wildlife, a growing availability of reference genomes means that population genetics is no longer restricted to a small set of anonymous loci. When used in conjunction with a reference genome, reduced-representation sequencing (RRS) provides a cost-effective method for obtaining reliable diversity information for population genetics. Many software tools have been developed to process RRS data, though few studies of non-model species incorporate genome alignment in calling loci. A commonly-used RRS analysis pipeline, Stacks, has this capacity and so it is timely to compare its utility with existing software originally designed for alignment and analysis of whole genome sequencing data. Here we examine population genetic inferences from two species for which reference-aligned reduced-representation data have been collected. Our two study species are a threatened Australian marsupial (Tasmanian devil Sarcophilus harrisii; declining population) and an Arctic-circle migrant bird (pink-footed goose Anser brachyrhynchus; expanding population). Analyses of these data are compared using Stacks versus two widely-used genomics packages, SAM tools and GATK. We also introduce a custom R script to improve the reliability of single nucleotide polymorphism (SNP) calls in all pipelines and conduct population genetic inferences for non-model species with reference genomes. Results: Although we identified orders of magnitude fewer SNPs in our devil data set than for goose, we found remarkable symmetry between the two species in our assessment of software performance. For both datasets, all three methods were able to delineate population structure, even with varying numbers of loci. For both species, population structure inferences were influenced by the percent of missing data. Conclusions: For studies of non-model species with a reference genome, we recommend combining Stacks output with further filtering (as included in our R pipeline) for population genetic studies, paying particular attention to potential impact of missing data thresholds. We recognise SAM tools as a viable alternative for researchers more familiar with this software. We caution against the use of GATK in studies with limited computational resources or time. Keywords: Population genomics, DArTseq, Reference genome, Tasmanian devil, Pink-footed goose, Population differentiation, Stacks, SAM tools, GATKen_AU
dc.language.isoenen_AU
dc.publisherBioMed Centralen_AU
dc.relation.ispartofBMC Genomicsen_AU
dc.rightsCreative Commons Attribution 4.0en_AU
dc.subjectPopulation Genomics Wildlifeen_AU
dc.titleFrom reference genomes to population genomics: comparing three reference aligned reduced-representation sequencing pipelines in two wildlife speciesen_AU
dc.typeArticleen_AU
dc.subject.asrc06 Biological Sciencesen_AU
dc.subject.asrc0604 Geneticsen_AU
dc.identifier.doi10.1186/s12864-019-5806-y
dc.relation.arcLP140100508
usyd.facultySeS faculties schools::Faculty of Science::School of Life and Environmental Sciencesen_AU
usyd.citation.volume20en_AU
usyd.citation.spage453en_AU
workflow.metadata.onlyNoen_AU


Show simple item record

Associated file/s

Associated collections

Show simple item record

There are no previous versions of the item available.