Location: Plant, Soil and Nutrition Research
Title: A multiple genome alignment workflow shows the impact of repeat masking and parameter tuning on alignment of functional regions in plantsAuthor
WU, YAOYAO - Cornell University | |
JOHNSON, LYNN - Cornell University | |
SONG, BAOXING - Cornell University | |
ROMAY, MARIA CINTA - Cornell University | |
STITZER, MICHELLE - Cornell University | |
SIEPEL, ADAM - Cold Spring Harbor Laboratory | |
Buckler, Edward - Ed | |
SCHEBEN, ARMIN - Cold Spring Harbor Laboratory |
Submitted to: bioRxiv
Publication Type: Other Publication Acceptance Date: 6/2/2021 Publication Date: 6/21/2021 Citation: Wu, Y., Johnson, L., Song, B., Romay, M., Stitzer, M., Siepel, A., Buckler IV, E.S., Scheben, A. 2021. A multiple genome alignment workflow shows the impact of repeat masking and parameter tuning on alignment of functional regions in plants. bioRxiv. https://doi.org/10.1101/2021.06.01.446647. DOI: https://doi.org/10.1101/2021.06.01.446647 Interpretive Summary: In order to identify the underlying genetic causes of differences between species, one must first compare their genome sequences. Existing tools can accomplish this, but require specialist knowledge to implement. . The many requirements and types of software involved can make the seemingly straightforward task of multiple sequence comparison technically challenging for individual researchers. We developed the msa_pipeline workflow (https://bitbucket.org/bucklerlab/msa_pipeline) to allow comparison of diverged plant genomes with minimal user inputs. The msa_pipeline leverages existing tools to provide a practical solution for rapid multiple alignment of genomes with minimal user effort. As the pace of genome sequencing and assembly accelerates, comparison of the genomes of tens to hundreds of species will drive biological discovery in plants. Our workflow presented here provides a practical first step to perform these comparisons. Technical Abstract: In order to identify the underlying genetic causes of differences between species, one must first compare their genome sequences. Existing tools can accomplish this, but require specialist knowledge to implement. . The many requirements and types of software involved can make the seemingly straightforward task of multiple sequence comparison technically challenging for individual researchers. We developed the msa_pipeline workflow (https://bitbucket.org/bucklerlab/msa_pipeline) to allow comparison of diverged plant genomes with minimal user inputs. The msa_pipeline leverages existing tools to provide a practical solution for rapid multiple alignment of genomes with minimal user effort. As the pace of genome sequencing and assembly accelerates, comparison of the genomes of tens to hundreds of species will drive biological discovery in plants. Our workflow presented here provides a practical first step to perform these comparisons. |