Menu
July 19, 2019

SMRT Sequencing of long tandem nucleotide repeats in SCA10 reveals unique insight of repeat expansion structure.

A large, non-coding ATTCT repeat expansion causes the neurodegenerative disorder, spinocerebellar ataxia type 10 (SCA10). In a subset of SCA10 patients, interruption motifs are present at the 5′ end of the expansion and strongly correlate with epileptic seizures. Thus, interruption motifs are a predictor of the epileptic phenotype and are hypothesized to act as a phenotypic modifier in SCA10. Yet, the exact internal sequence structure of SCA10 expansions remains unknown due to limitations in current technologies for sequencing across long extended tracts of tandem nucleotide repeats. We used the third generation sequencing technology, Single Molecule Real Time (SMRT) sequencing, to obtain full-length contiguous expansion sequences, ranging from 2.5 to 4.4 kb in length, from three SCA10 patients with different clinical presentations. We obtained sequence spanning the entire length of the expansion and identified the structure of known and novel interruption motifs within the SCA10 expansion. The exact interruption patterns in expanded SCA10 alleles will allow us to further investigate the potential contributions of these interrupting sequences to the pathogenic modification leading to the epilepsy phenotype in SCA10. Our results also demonstrate that SMRT sequencing is useful for deciphering long tandem repeats that pose as “gaps” in the human genome sequence.


July 19, 2019

HLA Class-II associated HIV polymorphisms predict escape from CD4+ T Cell responses.

Antiretroviral therapy, antibody and CD8+ T cell-mediated responses targeting human immunodeficiency virus-1 (HIV-1) exert selection pressure on the virus necessitating escape; however, the ability of CD4+ T cells to exert selective pressure remains unclear. Using a computational approach on HIV gag/pol/nef sequences and HLA-II allelic data, we identified 29 HLA-II associated HIV sequence polymorphisms or adaptations (HLA-AP) in an African cohort of chronically HIV-infected individuals. Epitopes encompassing the predicted adaptation (AE) or its non-adapted (NAE) version were evaluated for immunogenicity. Using a CD8-depleted IFN-? ELISpot assay, we determined that the magnitude of CD4+ T cell responses to the predicted epitopes in controllers was higher compared to non-controllers (p<0.0001). However, regardless of the group, the magnitude of responses to AE was lower as compared to NAE (p<0.0001). CD4+ T cell responses in patients with acute HIV infection (AHI) demonstrated poor immunogenicity towards AE as compared to NAE encoded by their transmitted founder virus. Longitudinal data in AHI off antiretroviral therapy demonstrated sequence changes that were biologically confirmed to represent CD4+ escape mutations. These data demonstrate an innovative application of HLA-associated polymorphisms to identify biologically relevant CD4+ epitopes and suggests CD4+ T cells are active participants in driving HIV evolution.


July 19, 2019

SMRT sequencing only de novo assembly of the sugar beet (Beta vulgaris) chloroplast genome.

Third generation sequencing methods, like SMRT (Single Molecule, Real-Time) sequencing developed by Pacific Biosciences, offer much longer read length in comparison to Next Generation Sequencing (NGS) methods. Hence, they are well suited for de novo- or re-sequencing projects. Sequences generated for these purposes will not only contain reads originating from the nuclear genome, but also a significant amount of reads originating from the organelles of the target organism. These reads are usually discarded but they can also be used for an assembly of organellar replicons. The long read length supports resolution of repetitive regions and repeats within the organelles genome which might be problematic when just using short read data. Additionally, SMRT sequencing is less influenced by GC rich areas and by long stretches of the same base.We describe a workflow for a de novo assembly of the sugar beet (Beta vulgaris ssp. vulgaris) chloroplast genome sequence only based on data originating from a SMRT sequencing dataset targeted on its nuclear genome. We show that the data obtained from such an experiment are sufficient to create a high quality assembly with a higher reliability than assemblies derived from e.g. Illumina reads only. The chloroplast genome is especially challenging for de novo assembling as it contains two large inverted repeat (IR) regions. We also describe some limitations that still apply even though long reads are used for the assembly.SMRT sequencing reads extracted from a dataset created for nuclear genome (re)sequencing can be used to obtain a high quality de novo assembly of the chloroplast of the sequenced organism. Even with a relatively small overall coverage for the nuclear genome it is possible to collect more than enough reads to generate a high quality assembly that outperforms short read based assemblies. However, even with long reads it is not always possible to clarify the order of elements of a chloroplast genome sequence reliantly which we could demonstrate with Fosmid End Sequences (FES) generated with Sanger technology. Nevertheless, this limitation also applies to short read sequencing data but is reached in this case at a much earlier stage during finishing.


July 19, 2019

Heterosexual transmission of subtype C HIV-1 selects consensus-like variants without increased replicative capacity or interferon-a resistance.

Heterosexual transmission of HIV-1 is characterized by a genetic bottleneck that selects a single viral variant, the transmitted/founder (TF), during most transmission events. To assess viral characteristics influencing HIV-1 transmission, we sequenced 167 near full-length viral genomes and generated 40 infectious molecular clones (IMC) including TF variants and multiple non-transmitted (NT) HIV-1 subtype C variants from six linked heterosexual transmission pairs near the time of transmission. Consensus-like genomes sensitive to donor antibodies were selected for during transmission in these six transmission pairs. However, TF variants did not demonstrate increased viral fitness in terms of particle infectivity or viral replicative capacity in activated peripheral blood mononuclear cells (PBMC) and monocyte-derived dendritic cells (MDDC). In addition, resistance of the TF variant to the antiviral effects of interferon-a (IFN-a) was not significantly different from that of non-transmitted variants from the same transmission pair. Thus neither in vitro viral replicative capacity nor IFN-a resistance discriminated the transmission potential of viruses in the quasispecies of these chronically infected individuals. However, our findings support the hypothesis that within-host evolution of HIV-1 in response to adaptive immune responses reduces viral transmission potential.


July 19, 2019

Emergence of ebola virus escape variants in infected nonhuman primates treated with the MB-003 antibody cocktail.

MB-003, a plant-derived monoclonal antibody cocktail used effectively in treatment of Ebola virus infection in non-human primates, was unable to protect two of six animals when initiated 1 or 2 days post-infection. We characterized a mechanism of viral escape in one of the animals, after observation of two clusters of genomic mutations that resulted in five nonsynonymous mutations in the monoclonal antibody target sites. These mutations were linked to a reduction in antibody binding and later confirmed to be present in a viral isolate that was not neutralized in vitro. Retrospective evaluation of a second independent study allowed the identification of a similar case. Four SNPs in previously identified positions were found in this second fatality, suggesting that genetic drift could be a potential cause for treatment failure. These findings highlight the importance selecting different target domains for each component of the cocktail to minimize the potential for viral escape. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.


July 19, 2019

The dentin phosphoprotein repeat region and inherited defects of dentin.

Nonsyndromic dentin defects classified as type II dentin dysplasia and types II and III dentinogenesis imperfecta are caused by mutations in DSPP (dentin sialophosphoprotein). Most reported disease-causing DSPP mutations occur within the repetitive DPP (dentin phosphoprotein) coding sequence. We characterized the DPP sequences of five probands with inherited dentin defects using single molecule real-time (SMRT) DNA sequencing. Eight of the 10 sequences matched previously reported DPP length haplotypes and two were novel. Alignment with known DPP sequences showed 32 indels arranged in 36 different patterns. Sixteen of the 32 indels were not represented in more than one haplotype. The 25 haplotypes with confirmed indels were aligned to generate a tree that describes how the length variations might have evolved. Some indels were independently generated in multiple lines. A previously reported disease-causing DSPP mutation in Family 1 was confirmed and its position clarified (c.3135delC; p.Ser1045Argfs*269). A novel frameshift mutation (c.3504_3508dup; p.Asp1170Alafs*146) caused the dentin defects in Family 2. A COL1A2 (c.2027G>A or p.Gly676Asp) missense mutation, discovered by whole-exome sequencing, caused the dentin defects in Family 3. We conclude that SMRT sequencing characterizes the DPP repeat region without cloning and can improve our understanding of normal and pathological length variations in DSPP alleles.


July 19, 2019

Heterogeneous composition of key metabolic gene clusters in a vent mussel symbiont population.

Chemosynthetic symbiosis is one of the successful systems for adapting to a wide range of habitats including extreme environments, and the metabolic capabilities of symbionts enable host organisms to expand their habitat ranges. However, our understanding of the adaptive strategies that enable symbiotic organisms to expand their habitats is still fragmentary. Here, we report that a single-ribotype endosymbiont population in an individual of the host vent mussel, Bathymodiolus septemdierum has heterogeneous genomes with regard to the composition of key metabolic gene clusters for hydrogen oxidation and nitrate reduction. The host individual harbours heterogeneous symbiont subpopulations that either possess or lack the gene clusters encoding hydrogenase or nitrate reductase. The proportions of the different symbiont subpopulations in a host appeared to vary with the environment or with the host’s development. Furthermore, the symbiont subpopulations were distributed in patches to form a mosaic pattern in the gill. Genomic heterogeneity in an endosymbiont population may enable differential utilization of diverse substrates and confer metabolic flexibility. Our findings open a new chapter in our understanding of how symbiotic organisms alter their metabolic capabilities and expand their range of habitats.


July 19, 2019

Single molecule real-time sequencing of Xanthomonas oryzae genomes reveals a dynamic structure and complex TAL (transcription activator-like) effector gene relationships.

Pathogen-injected, direct transcriptional activators of host genes, TAL (transcription activator-like) effectors play determinative roles in plant diseases caused by Xanthomonas spp. A large domain of nearly identical, 33-35 aa repeats in each protein mediates DNA recognition. This modularity makes TAL effectors customizable and thus important also in biotechnology. However, the repeats render TAL effector (tal) genes nearly impossible to assemble using next-generation, short reads. Here, we demonstrate that long-read, single molecule real-time (SMRT) sequencing solves this problem. Taking an ensemble approach to first generate local, tal gene contigs, we correctly assembled de novo the genomes of two strains of the rice pathogen X. oryzae completed previously using the Sanger method and even identified errors in those references. Sequencing two more strains revealed a dynamic genome structure and a striking plasticity in tal gene content. Our results pave the way for population-level studies to inform resistance breeding, improve biotechnology and probe TAL effector evolution.


July 19, 2019

Lineage-specific methyltransferases define the methylome of the globally disseminated Escherichia coli ST131 clone.

Escherichia coli sequence type 131 (ST131) is a clone of uropathogenic E. coli that has emerged rapidly and disseminated globally in both clinical and community settings. Members of the ST131 lineage from across the globe have been comprehensively characterized in terms of antibiotic resistance, virulence potential, and pathogenicity, but to date nothing is known about the methylome of these important human pathogens. Here we used single-molecule real-time (SMRT) PacBio sequencing to determine the methylome of E. coli EC958, the most-well-characterized completely sequenced ST131 strain. Our analysis of 52,081 methylated adenines in the genome of EC958 discovered three (m6)A methylation motifs that have not been described previously. Subsequent SMRT sequencing of isogenic knockout mutants identified the two type I methyltransferases (MTases) and one type IIG MTase responsible for (m6)A methylation of novel recognition sites. Although both type I sites were rare, the type IIG sites accounted for more than 12% of all methylated adenines in EC958. Analysis of the distribution of MTase genes across 95 ST131 genomes revealed their prevalence is highly conserved within the ST131 lineage, with most variation due to the presence or absence of mobile genetic elements on which individual MTase genes are located.DNA modification plays a crucial role in bacterial regulation. Despite several examples demonstrating the role of methyltransferase (MTase) enzymes in bacterial virulence, investigation of this phenomenon on a whole-genome scale has remained elusive until now. Here we used single-molecule real-time (SMRT) sequencing to determine the first complete methylome of a strain from the multidrug-resistant E. coli sequence type 131 (ST131) lineage. By interrogating the methylome computationally and with further SMRT sequencing of isogenic mutants representing previously uncharacterized MTase genes, we defined the target sequences of three novel ST131-specific MTases and determined the genomic distribution of all MTase target sequences. Using a large collection of 95 previously sequenced ST131 genomes, we identified mobile genetic elements as a major factor driving diversity in DNA methylation patterns. Overall, our analysis highlights the potential for DNA methylation to dramatically influence gene regulation at the transcriptional level within a well-defined E. coli clone. Copyright © 2015 Forde et al.


July 19, 2019

The power of Single Molecule Real-Time sequencing technology in the de novo assembly of a eukaryotic genome.

Second-generation sequencers (SGS) have been game-changing, achieving cost-effective whole genome sequencing in many non-model organisms. However, a large portion of the genomes still remains unassembled. We reconstructed azuki bean (Vigna angularis) genome using single molecule real-time (SMRT) sequencing technology and achieved the best contiguity and coverage among currently assembled legume crops. The SMRT-based assembly produced 100 times longer contigs with 100 times smaller amount of gaps compared to the SGS-based assemblies. A detailed comparison between the assemblies revealed that the SMRT-based assembly enabled a more comprehensive gene annotation than the SGS-based assemblies where thousands of genes were missing or fragmented. A chromosome-scale assembly was generated based on the high-density genetic map, covering 86% of the azuki bean genome. We demonstrated that SMRT technology, though still needed support of SGS data, achieved a near-complete assembly of a eukaryotic genome.


July 19, 2019

Precision methylome characterization of Mycobacterium tuberculosis complex (MTBC) using PacBio single-molecule real-time (SMRT) technology.

Tuberculosis (TB) remains one of the most common infectious diseases caused by Mycobacterium tuberculosis complex (MTBC). To panoramically analyze MTBC’s genomic methylation, we completed the genomes of 12 MTBC strains (Mycobacterium bovis; M. bovis BCG; M. microti; M. africanum; M. tuberculosis H37Rv; H37Ra; and 6 M. tuberculosis clinical isolates) belonging to different lineages and characterized their methylomes using single-molecule real-time (SMRT) technology. We identified three (m6)A sequence motifs and their corresponding methyltransferase (MTase) genes, including the reported mamA, hsdM and a newly discovered mamB. We also experimentally verified the methylated motifs and functions of HsdM and MamB. Our analysis indicated the MTase activities varied between 12 strains due to mutations/deletions. Furthermore, through measuring ‘the methylated-motif-site ratio’ and ‘the methylated-read ratio’, we explored the methylation status of each modified site and sequence-read to obtain the ‘precision methylome’ of the MTBC strains, which enabled intricate analysis of MTase activity at whole-genome scale. Most unmodified sites overlapped with transcription-factor binding-regions, which might protect these sites from methylation. Overall, our findings show enormous potential for the SMRT platform to investigate the precise character of methylome, and significantly enhance our understanding of the function of DNA MTase.© The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.


July 19, 2019

Major improvements to the Heliconius melpomene genome assembly used to confirm 10 chromosome fusion events in 6 million years of butterfly evolution.

The Heliconius butterflies are a widely studied adaptive radiation of 46 species spread across Central and South America, several of which are known to hybridize in the wild. Here, we present a substantially improved assembly of the Heliconius melpomene genome, developed using novel methods that should be applicable to improving other genome assemblies produced using short read sequencing. First, we whole-genome-sequenced a pedigree to produce a linkage map incorporating 99% of the genome. Second, we incorporated haplotype scaffolds extensively to produce a more complete haploid version of the draft genome. Third, we incorporated ~20x coverage of Pacific Biosciences sequencing, and scaffolded the haploid genome using an assembly of this long-read sequence. These improvements result in a genome of 795 scaffolds, 275 Mb in length, with an N50 length of 2.1 Mb, an N50 number of 34, and with 99% of the genome placed, and 84% anchored on chromosomes. We use the new genome assembly to confirm that the Heliconius genome underwent 10 chromosome fusions since the split with its sister genus Eueides, over a period of about 6 million yr. Copyright © 2016 Davey et al.


July 19, 2019

Phase variation of a Type IIG restriction-modification enzyme alters site-specific methylation patterns and gene expression in Campylobacter jejuni strain NCTC11168.

Phase-variable restriction-modification systems are a feature of a diverse range of bacterial species. Stochastic, reversible switches in expression of the methyltransferase produces variation in methylation of specific sequences. Phase-variable methylation by both Type I and Type III methyltransferases is associated with altered gene expression and phenotypic variation. One phase-variable gene of Campylobacter jejuni encodes a homologue of an unusual Type IIG restriction-modification system in which the endonuclease and methyltransferase are encoded by a single gene. Using both inhibition of restriction and PacBio-derived methylome analyses of mutants and phase-variants, the cj0031c allele in C. jejuni strain NCTC11168 was demonstrated to specifically methylate adenine in 5’CCCGA and 5’CCTGA sequences. Alterations in the levels of specific transcripts were detected using RNA-Seq in phase-variants and mutants of cj0031c but these changes did not correlate with observed differences in phenotypic behaviour. Alterations in restriction of phage growth were also associated with phase variation (PV) of cj0031c and correlated with presence of sites in the genomes of these phages. We conclude that PV of a Type IIG restriction-modification system causes changes in site-specific methylation patterns and gene expression patterns that may indirectly change adaptive traits.© The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.


July 19, 2019

Genome analysis of the fruiting body forming myxobacterium Chondromyces crocatus reveals high potential for natural product biosynthesis.

Here we report the first complete genome sequence of the type strain of the myxobacterial genus Chondromyces – Chondromyces crocatus Cm c5. It presents one of the largest prokaryotic genomes featuring a single circular chromosome and no plasmids. Analysis revealed an enlarged set of tRNA genes, along with reduced pressure on preferred codon usage compared to other bacterial genomes. The large coding capacity and the plethora of encoded secondary metabolite biosynthetic gene clusters is in line with the capability of Cm c5 to produce an arsenal of anti-bacterial, anti-fungal and cytotoxic compounds. Known pathways of the ajudazol, chondramide, chondrochloren, crocacin, crocapeptin and thuggacin compound families are complemented by many more natural compound biosynthetic gene clusters in the chromosome. Whole-genome comparison of the fruiting-body forming type-strain (Cm c5 = DSM 14714) to an accustomed laboratory strain which has lost this ability (Cm c5 fr-) revealed genetic changes in three loci. In addition to the low synteny found with the closest sequenced representative of the same family, Sorangium cellulosum, extensive genetic information duplication, and broad application of eukaryotic-type signal transduction systems are hallmarks of this 11.3 Mbp prokaryotic genome. Copyright © 2016, American Society for Microbiology. All Rights Reserved.


July 19, 2019

The complete genome sequence of the murine pathobiont Helicobacter typhlonius.

Immuno-compromised mice infected with Helicobacter typhlonius are used to model microbially inducted inflammatory bowel disease (IBD). The specific mechanism through which H. typhlonius induces and promotes IBD is not fully understood. Access to the genome sequence is essential to examine emergent properties of this organism, such as its pathogenicity. To this end, we present the complete genome sequence of H. typhlonius MIT 97-6810, obtained through single-molecule real-time sequencing.The genome was assembled into a single circularized contig measuring 1.92 Mbp with an average GC content of 38.8%. In total 2,117 protein-encoding genes and 43 RNA genes were identified. Numerous pathogenic features were found, including a putative pathogenicity island (PAIs) containing components of type IV secretion system, virulence-associated proteins and cag PAI protein. We compared the genome of H. typhlonius to those of the murine pathobiont H. hepaticus and human pathobiont H. pylori. H. typhlonius resembles H. hepaticus most with 1,594 (75.3%) of its genes being orthologous to genes in H. hepaticus. Determination of the global methylation state revealed eight distinct recognition motifs for adenine and cytosine methylation. H. typhlonius shares four of its recognition motifs with H. pylori.The complete genome sequence of H. typhlonius MIT 97-6810 enabled us to identify many pathogenic features suggesting that H. typhlonius can act as a pathogen. Follow-up studies are necessary to evaluate the true nature of its pathogenic capabilities. We found many methylated sites and a plethora of restriction-modification systems. The genome, together with the methylome, will provide an essential resource for future studies investigating gene regulation, host interaction and pathogenicity of H. typhlonius. In turn, this work can contribute to unraveling the role of Helicobacter in enteric disease.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.