The X-linked dystrophin gene is well known for its involvement in Duchenne/Becker muscular dystrophies and for its exceptional megabase size. This locus at Xp21 is prone to frequent random molecular changes, including large deletions and duplications, but also smaller variations. To cope with such huge sequence analysis requirements in forthcoming diagnostic applications, we employed the power of the parallel 454 GS-FLX pyrosequencer to the dystrophin locus. We enriched the genomic region of interest by the robust amplification of 62 fragments under universal conditions by the long-PCR protocol yielding 244,707 bp of sequence. Pooled PCR products were fragmented and used for library preparation and DNA sequencing. To evaluate the entire procedure we analyzed four male DNA samples for sequence coverage and accuracy in DNA sequence variation and for any potential bias. We identified 562 known variations and 55 additional variants not yet reported, among which we detected a causative Arg1844Stop mutation in one sample. Sanger sequencing confirmed all changes. Unexpectedly, only 3 x coverage was sufficient for 99.9993% accuracy. Our results show that long PCR combined to massive pyrosequencing is very reliable for the analysis of the biggest gene of the human genome and open the doors to other demanding applications in molecular diagnostics.

Reliable resequencing of the human dystrophin locus by universal long polymerase chain reaction and massive pyrosequencing

Condorelli G;
2010-01-01

Abstract

The X-linked dystrophin gene is well known for its involvement in Duchenne/Becker muscular dystrophies and for its exceptional megabase size. This locus at Xp21 is prone to frequent random molecular changes, including large deletions and duplications, but also smaller variations. To cope with such huge sequence analysis requirements in forthcoming diagnostic applications, we employed the power of the parallel 454 GS-FLX pyrosequencer to the dystrophin locus. We enriched the genomic region of interest by the robust amplification of 62 fragments under universal conditions by the long-PCR protocol yielding 244,707 bp of sequence. Pooled PCR products were fragmented and used for library preparation and DNA sequencing. To evaluate the entire procedure we analyzed four male DNA samples for sequence coverage and accuracy in DNA sequence variation and for any potential bias. We identified 562 known variations and 55 additional variants not yet reported, among which we detected a causative Arg1844Stop mutation in one sample. Sanger sequencing confirmed all changes. Unexpectedly, only 3 x coverage was sufficient for 99.9993% accuracy. Our results show that long PCR combined to massive pyrosequencing is very reliable for the analysis of the biggest gene of the human genome and open the doors to other demanding applications in molecular diagnostics.
2010
Next Generation Sequencing; Genetics; Muscle Distrophy
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11699/4198
Citazioni
  • ???jsp.display-item.citation.pmc??? 6
  • Scopus 15
  • ???jsp.display-item.citation.isi??? 12
social impact