Location: Arthropod-Borne Animal Diseases Research
Project Number: 5430-32000-003-08
Start Date: May 25, 2012
End Date: Jun 30, 2015
De novo sequencing, assembly and annotation of transcriptomes for female Culicoides sonorensis including: (1) teneral state (2) blood fed (3) sugar fed (4) blue tongue virus infected (5) blue tongue virus recovered. Two-day-old unmated females will remain unfed (teneral), or will be fed blood, sugar water, or blue tongue virus (in blood). Fed flies will be killed for RNA extraction 3 days post feeding, and for virus infection studies, virus presence or absence (“infected” vs. “recovered”) will be determined before pooling midges accordingly. A paired-end (PE) library will be constructed from RNA of whole, pooled female midges (minimum n=10) subjected to each condition above, with two biological replicates for each. In the absence of a reference genome for transcript mapping, trimmed PE sequences will be subject to de novo clustering and assembly into a set of contigs and singletons with suitable software such as Oases, Trinity, Velvet, Trans-ABySS, or RNnotator. This assembly will represent the unigene for each sample. Each individual sample/treatment will be analyzed for unique functional signatures; the unigenes will be assigned gene ontology (GO) terms, and the enzyme codes and pathways determined as follows. The unigene assembly will be the subject for functional annotation using REFseq gene models. Annotation will be performed using the Blast2GO software package, where unigene sequences will be BLASTed to the non-redundant protein database (GenBank), mapped to GO term database, and a GO annotation returned. Enzyme codes will be obtained by mapping GO terms associated with each unigene, and functional protein signatures determined by using the InterProScan service. Enzyme codes will be highlighted on KEGG biochemical pathway maps. Thus, from the transcriptome sequencing we will identify proteins involved in the biological processes of interest, and their function in cellular-biochemical pathways (i.e., functional transcriptomics). Comparative transcriptomics will be used to compare (1) teneral to blood to sugar feeding processes and (2) blood to virus infected to virus recovered processes. These analyses will allow us to look at the differential expression of genes across conditions compared to their appropriate control state. For comparative transcriptomics, all biological replicates will be binned, normalized, and compared to identify and quantify differentially expressed transcripts as a result of various treatments. Quantitative differential expression analysis, and determination of spliceforms and paralogs will be used to assess genes and cellular processes involved in the biology of this vector when comparing the conditions/states described above. Gene expression levels will be quantified and labeled as upregulated, downregulated, or no change when compared to control level gene expression (teneral for blood and sugar fed; blood fed for virus infected or recovered) using EdgeR software. Statistical assessment of enrichment of GO terms between the sequence datasets will be determined with the Gossip software, which employs a Fisher’s exact test.