|MAJOROS, WILLIAM - Duke University|
|CAMPBELL, MICHAEL - Cold Spring Harbor Laboratory|
|HOLT, CARSON - University Of Utah|
|DENARDO, ERIN - Cold Spring Harbor Laboratory|
|ALLEN, ANDREW - Duke University|
|YANDELL, MARK - University Of Utah|
|REDDY, TIMOTHY - Duke University|
Submitted to: Bioinformatics
Publication Type: Peer Reviewed Journal
Publication Acceptance Date: 12/12/2016
Publication Date: 12/30/2016
Citation: Majoros, W.H., Campbell, M.S., Holt, C., Denardo, E., Ware, D., Allen, A.S., Yandell, M., Reddy, T. 2016. High-throughput interpretation of gene structure changes in human and nonhuman resequencing data, using ACE. Bioinformatics. doi: 10.1093/bioinformatics/btw799.
Interpretive Summary: The accurate interpretation of genetic variants is critical for characterizing genotype-phenotype associations. Because the effects of genetic variants can depend strongly on their local genomic context, accurate genome annotations are essential. Furthermore, as some variants have the potential to disrupt or alter gene structure, variant interpretation efforts stand to gain from the use of individualized annotations that account for differences in gene structure between individuals or strains.
Technical Abstract: We describe a suite of software tools for identifying possible functional changes in gene structure that may result from sequence variants. ACE (“Assessing Changes to Exons”) converts phased genotype calls to a collection of explicit haplotype sequences, maps transcript annotations onto them, detects gene-structure changes and their possible repercussions, and identifies several classes of possible loss of function. Novel transcripts predicted by ACE are commonly supported by spliced RNA-seq reads, and can be used to improve read alignment and transcript quantification when an individual-specific genome sequence is available. Using publicly-available RNA-seq data, we show that ACE predictions confirm earlier results regarding the quantitative effects of nonsense-mediated decay, and we show that predicted loss-of-function events are highly concordant with patterns of intolerance to mutations across the human population. ACE can be readily applied to diverse species including animals and plants, making it a broadly useful tool for use in eukaryotic population-based resequencing projects, particularly for assessing the joint impact of all variants at a locus.