Show simple item record

dc.contributor.authorMORRIS, DEREK
dc.contributor.authorANNEY, RICHARD JAMES LEON
dc.contributor.authorCORVIN, AIDEN PETER
dc.contributor.authorGILL, MICHAEL
dc.date.accessioned2013-07-12T08:21:40Z
dc.date.available2013-07-12T08:21:40Z
dc.date.issued2013
dc.date.submitted2013en
dc.identifier.citationQuinn EM, Cormican P, Kenny EM, Hill M, Anney R, Gill M, Corvin AP, Morris DW, Development of Strategies for SNP Detection in RNA-Seq Data: Application to Lymphoblastoid Cell Lines and Evaluation Using 1000 Genomes Data., PloS one, 8, 3, 2013, e58815en
dc.identifier.otherY
dc.descriptionPUBLISHEDen
dc.description.abstractNext-generation RNA sequencing (RNA-seq) maps and analyzes transcriptomes and generates data on sequence variation in expressed genes. There are few reported studies on analysis strategies to maximize the yield of quality RNA-seq SNP data. We evaluated the performance of different SNP-calling methods following alignment to both genome and transcriptome by applying them to RNA-seq data from a HapMap lymphoblastoid cell line sample and comparing results with sequence variation data from 1000 Genomes. We determined that the best method to achieve high specificity and sensitivity, and greatest number of SNP calls, is to remove duplicate sequence reads after alignment to the genome and to call SNPs using SAMtools. The accuracy of SNP calls is dependent on sequence coverage available. In terms of specificity, 89% of RNA-seq SNPs calls were true variants where coverage is >10X. In terms of sensitivity, at >10X coverage 92% of all expected SNPs in expressed exons could be detected. Overall, the results indicate that RNA-seq SNP data are a very useful by-product of sequence-based transcriptome analysis. If RNA-seq is applied to disease tissue samples and assuming that genes carrying mutations relevant to disease biology are being expressed, a very high proportion of these mutations can be detecteden
dc.description.sponsorshipNext generation sequencing was performed in TrinSeq (Trinity Genome Sequencing Laboratory; http://www.medicine.tcd.ie/sequencing), a core fac ility funded by Science Foundation Ireland (SFI) under Grant No. [SFI/07/RFP/GEN/F327/EC07] to Dr. Morris. Ms. Quinn?s PhD studentship is funded by the We llcome Trust and SFI. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.en
dc.format.extente58815en
dc.language.isoenen
dc.relation.ispartofseriesPloS one;
dc.relation.ispartofseries8;
dc.relation.ispartofseries3;
dc.rightsYen
dc.subjectSNPs SAMtoolsen
dc.subject.lcshSNPs SAMtoolsen
dc.titleDevelopment of Strategies for SNP Detection in RNA-Seq Data: Application to Lymphoblastoid Cell Lines and Evaluation Using 1000 Genomes Data.en
dc.typeJournal Articleen
dc.type.supercollectionscholarly_publicationsen
dc.type.supercollectionrefereed_publicationsen
dc.identifier.peoplefinderurlhttp://people.tcd.ie/acorvin
dc.identifier.peoplefinderurlhttp://people.tcd.ie/morrisdw
dc.identifier.peoplefinderurlhttp://people.tcd.ie/anneyr
dc.identifier.peoplefinderurlhttp://people.tcd.ie/mgill
dc.identifier.rssinternalid85030
dc.contributor.sponsorScience Foundation Ireland (SFI)en
dc.contributor.sponsorGrantNumber07/RFP/GEN/F327/EC07en
dc.identifier.urihttp://hdl.handle.net/2262/66708


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record