Pengfeng Xiao

Find an error

Name:
Organization: Southeast University
Department: State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering
Title:
Co-reporter:Dan Pu
RSC Advances (2011-Present) 2017 vol. 7(Issue 64) pp:40141-40151
Publication Date(Web):2017/08/16
DOI:10.1039/C7RA06202H
A real-time decoding sequencing developed by our group offers long read length, high sequencing accuracy and high compatibility, making it have great potential in high throughput sequencing (HTS) platforms. Here, we first discuss its potential advantages in HTS in terms of read length, sequencing accuracy, and turnaround time. We then discuss its disadvantages including homopolymers and chain decoding mistakes. How to handle these two major disadvantages is also discussed with respect to resequencing and de novo sequencing. We also provide the characteristics of this technology for HTS in terms of error-correcting and discriminating SNP/deletion/insertion. Finally, the existing sequencing platforms with which this technology is compatible are discussed. This technology is not only compatible with the first-generation sequencing platform, but also the second-generation and even the third-generation sequencing platforms. It will further improve the advantages of existing sequencing platforms (read length of PGM and 454 system) and compensate some disadvantages of other next generation sequencing (NGS) platforms (sequencing accuracy of PGM sequencer). We fully hope it will provide a new promising technology for researchers and customers to extend applications of the current and upcoming platforms in almost every area in life and biomedical sciences.
Co-reporter:Changchang Cao;Rongfang Pan;Jun Tan;Xiao Sun
Molecular Genetics and Genomics 2017 Volume 292( Issue 5) pp:1069-1081
Publication Date(Web):13 June 2017
DOI:10.1007/s00438-017-1332-2
Identifying single nucleotide polymorphism (SNPs) from pooled samples is critical for many studies and applications. SNPs determined by next-generation sequencing results may suffer from errors in both base calling and read mapping. Taking advantage of dual mononucleotide addition-based pyrosequencing, we present Epds, a method to efficiently identify SNPs from pooled DNA samples. On the basis of only five patterns of non-synchronistic extensions between the wild and mutant sequences using dual mononucleotide addition-based pyrosequencing, we employed an enumerative algorithm to infer the mutant locus and estimate the proportion of mutant sequence. According to the profiles resulting from three runs with distinct dual mononucleotide additions, Epds could recover the mutant bases. Results showed that our method had a false-positive rate of less than 3%. Series of simulations revealed that Epds outperformed the current method (PSM) in many situations. Finally, experiments based on profiles produced by real sequencing proved that our method could be successfully applied for the identification of mutants from pooled samples. The software for implementing this method and the experimental data are available at http://bioinfo.seu.edu.cn/Epds.
Co-reporter:Rongfang Pan
Analytical and Bioanalytical Chemistry 2016 Volume 408( Issue 29) pp:8263-8271
Publication Date(Web):2016 November
DOI:10.1007/s00216-016-9936-7
Molecular haplotyping is becoming increasingly important for studying the disease association of a specific allele because of its ability of providing more information than any single nucleotide polymorphism (SNP). Computational analysis and experimental techniques are usually performed for haplotypic determination. However, established methods are not suitable for analyzing haplotypes of massive natural DNA samples. Here we present a simple molecular approach to analyze haplotypes of conventional polymerase chain reaction (PCR) products quantitatively in a single sequencing run. In this approach, specific types and proportions of haplotypes in both individual and pooled samples could be determined by solving equations constructed from nonsynchronous pyrosequencing with di-base addition. Two SNPs (rs11176013 and rs11564148) in the gene for leucine-rich repeat kinase 2 (LRRK2) related to Parkinson’s disease were selected as experimental sites. A series of DNA samples, including these two heterozygous loci, were investigated. This approach could accurately identify multiple DNA samples indicating that the approach is likely to be applied for haplotyping of unrestricted conventional PCR products from natural samples, and be especially applicable for analyzing short sequences in clinical diagnosis.
Co-reporter:Dan Pu;Chengguang Mao;Lunbiao Cui;Zhiyang Shi
Analytical and Bioanalytical Chemistry 2016 Volume 408( Issue 12) pp:3113-3123
Publication Date(Web):2016 May
DOI:10.1007/s00216-016-9359-5
We develop color code-based pyrosequencing with di-base addition for analysis of single nucleotide polymorphisms (SNPs). When a di-base is added into the polymerization, one or several two-color code(s) containing the type and the number of incorporated nucleotides will be produced. The code information obtained in a single run is useful to genotype SNPs as each allelic variant will give a specific pattern compared to the two other variants. Special care has to be taken while designing the di-base dispensation order. Here, we present a detailed protocol for establishing sequence-specific di-base addition to avoid nonsynchronous extension at the SNP sites. By using this technology, as few as 50 copies of DNA templates were accurately sequenced. Higher signals were produced and thus a relatively lower sample amount was required. Furthermore, the read length of per flow was increased, making simultaneous identification of multiple SNPs in a single sequencing run possible. Validation of the method was performed by using templates with two SNPs covering 37 bp and with three SNPs covering 58 bp as well as 82 bp. These SNPs were successfully genotyped by using only a sequencing primer in a single PCR/sequencing run. Our results demonstrated that this technology could be potentially developed into a powerful methodology to accurately determine SNPs so as to diagnose clinical settings.
Co-reporter:Dan Pu, Yuhua Qi, Lunbiao Cui, Pengfeng Xiao, Zuhong Lu
Analytica Chimica Acta 2014 Volume 852() pp:274-283
Publication Date(Web):10 December 2014
DOI:10.1016/j.aca.2014.09.009
•Templates are determined without directly measuring the base sequence in this method.•Templates are sequenced with the incorporation of AG/CT, AC/GT or AT/CG.•Templates will be sequentially decoded by two sets of encodings.•This method applies fewer cycles to obtain longer read length.•This method is able to be applied to differentiate nucleic acid sequences.We propose a real-time decoding sequencing strategy in which a template is determined without directly measuring base sequence but by decoding two sets of encodings obtained from two parallel sequencing runs. This strategy relies on adding a mixture of different two-base pair, A + G, C + T, A + C, G + T, A + T or C + G (abbreviated as AG, CT, AC, GT, AT, or CG), into the reaction each time. When a template is cyclically interrogated twice with any two kinds of dual mononucleotide addition (AG/CT, AC/GT, and AT/CG), two sets of encodings are obtained sequentially. The two sets of encodings allow for the bases to be sequentially decoded, moving from first to last, in a deterministic manner. This strategy applies fewer cycles to obtain longer read length compared to the traditional real-time sequencing strategy [1]. Partial rnpB gene was applied to verify the applicability of the decoding strategy via pyrosequencing. The results indicated that the sequence could be reconstructed by decoding two sets of encodings. Moreover, streptococcal strains could be differentiated by comparing signal intensity in each cycle and encoding size of each template. This strategy is likely to be applied to differentiate nucleic acid sequence as encoding size and signal intensity in each cycle vary with the base size and composition. Furthermore, it has the potential in building a promising strategy that could be utilized as an alternative to conventional sequencing systems.
Co-reporter:Yanqiang Li, Zhiqiang Pan, Jing Tang, Dan Pu, Pengfeng Xiao and Zuhong Lu  
Analyst 2012 vol. 137(Issue 19) pp:4421-4424
Publication Date(Web):24 Jul 2012
DOI:10.1039/C2AN35595G
Sequencing-by-ligation (SBL) is one of the next-generation sequencing methods for massive parallel sequencing. The ligated probes used in SBL should be accurately cleaved for a better ligation in the next cycle. Here, a novel kind of oligonucleotide probe that could be accurately cleaved at the given position was proposed. Deoxynucleoside phosphorothioates were introduced into the deoxyoxanosine-containing oligonucleotide probes in order to increase the cleavage accuracy of endonuclease V on double-stranded DNA templates. The results illustrated that incorporating deoxynucleoside phosphorothioates could greatly reduce the effect of the nonsynchronous sequencing primer, and the queried bases of the DNA templates were unambiguously identified with 5 cycles of sequencing ligations. Additionally, the read length can reach up to 25 bp with high accuracy. The SBL-based method is inexpensive, has high-throughput, and is easy to operate allowing massive scale-up, miniaturization and automation.
Co-reporter:Jing Tang, Pengfeng Xiao
Biosensors and Bioelectronics 2009 Volume 24(Issue 7) pp:1817-1824
Publication Date(Web):15 March 2009
DOI:10.1016/j.bios.2008.09.018
The immobilization of nucleic acids on solid supports has been widely used in the detection of DNA and other biomolecules in sensor technology. Because three dimensional (3-D) hydrogel matrixes offer significant advantages for capturing probes over more conventional two dimensional (2-D) rigid substrates and the ability to provide a solution-mimicking environment, they are becoming increasingly attractive as desired supports for bio-analysis. Acrylamide-modified nucleic acids and acrylamide monomers being polymerized directly to immobilize nucleic acids is only one-step chemical process which is not interfered by exterior surroundings, and the 3-D polyacrylamide gel fabricated by this method is not required to be activated by some labile chemical treatments. Moreover, the attachment is extremely stable to withstand the cycling process involved in the polymerase chain reaction (PCR). In this paper, the development of polymerizing immobilization of acrylamide-modified nucleic acids is reviewed, and its applications in DNA sequence high-throughput analysis including mutation analysis and the whole genome sequencing are summarized.
3H-Indolium, 2-[5-[1-[6-[(2,5-dioxo-1-pyrrolidinyl)oxy]-6-oxohexyl]-1,3-dihydro-3,3-dimethyl-5-sulfo-2H-indol-2-ylidene]-1,3-pentadien-1-yl]-1-ethyl-3,3-
[5-(2-amino-6-oxo-3h-purin-9-yl)-3,4-dihydroxyoxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-4-hydroxy-2-(hydroxymethyl)oxolan-3-yl] Hydrogen Phosphate