Menu
April 21, 2020

Long-read assembly of the Chinese rhesus macaque genome and identification of ape-specific structural variants.

We present a high-quality de novo genome assembly (rheMacS) of the Chinese rhesus macaque (Macaca mulatta) using long-read sequencing and multiplatform scaffolding approaches. Compared to the current Indian rhesus macaque reference genome (rheMac8), rheMacS increases sequence contiguity 75-fold, closing 21,940 of the remaining assembly gaps (60.8 Mbp). We improve gene annotation by generating more than two million full-length transcripts from ten different tissues by long-read RNA sequencing. We sequence resolve 53,916 structural variants (96% novel) and identify 17,000 ape-specific structural variants (ASSVs) based on comparison to ape genomes. Many ASSVs map within ChIP-seq predicted enhancer regions where apes and macaque show diverged enhancer activity and gene expression. We further characterize a subset that may contribute to ape- or great-ape-specific phenotypic traits, including taillessness, brain volume expansion, improved manual dexterity, and large body size. The rheMacS genome assembly serves as an ideal reference for future biomedical and evolutionary studies.


April 21, 2020

CRISPR/CAS9 targeted CAPTURE of mammalian genomic regions for characterization by NGS.

The robust detection of structural variants in mammalian genomes remains a challenge. It is particularly difficult in the case of genetically unstable Chinese hamster ovary (CHO) cell lines with only draft genome assemblies available. We explore the potential of the CRISPR/Cas9 system for the targeted capture of genomic loci containing integrated vectors in CHO-K1-based cell lines followed by next generation sequencing (NGS), and compare it to popular target-enrichment sequencing methods and to whole genome sequencing (WGS). Three different CRISPR/Cas9-based techniques were evaluated; all of them allow for amplification-free enrichment of target genomic regions in the range from 5 to 60 fold, and for recovery of ~15 kb-long sequences with no sequencing artifacts introduced. The utility of these protocols has been proven by the identification of transgene integration sites and flanking sequences in three CHO cell lines. The long enriched fragments helped to identify Escherichia coli genome sequences co-integrated with vectors, and were further characterized by Whole Genome Sequencing (WGS). Other advantages of CRISPR/Cas9-based methods are the ease of bioinformatics analysis, potential for multiplexing, and the production of long target templates for real-time sequencing.


April 21, 2020

Strain-level metagenomic assignment and compositional estimation for long reads with MetaMaps.

Metagenomic sequence classification should be fast, accurate and information-rich. Emerging long-read sequencing technologies promise to improve the balance between these factors but most existing methods were designed for short reads. MetaMaps is a new method, specifically developed for long reads, capable of mapping a long-read metagenome to a comprehensive RefSeq database with >12,000 genomes in <16?GB or RAM on a laptop computer. Integrating approximate mapping with probabilistic scoring and EM-based estimation of sample composition, MetaMaps achieves >94% accuracy for species-level read assignment and r2?>?0.97 for the estimation of sample composition on both simulated and real data when the sample genomes or close relatives are present in the classification database. To address novel species and genera, which are comparatively harder to predict, MetaMaps outputs mapping locations and qualities for all classified reads, enabling functional studies (e.g. gene presence/absence) and detection of incongruities between sample and reference genomes.


April 21, 2020

Genome analysis of the rice coral Montipora capitata.

Corals comprise a biomineralizing cnidarian, dinoflagellate algal symbionts, and associated microbiome of prokaryotes and viruses. Ongoing efforts to conserve coral reefs by identifying the major stress response pathways and thereby laying the foundation to select resistant genotypes rely on a robust genomic foundation. Here we generated and analyzed a high quality long-read based ~886 Mbp nuclear genome assembly and transcriptome data from the dominant rice coral, Montipora capitata from Hawai’i. Our work provides insights into the architecture of coral genomes and shows how they differ in size and gene inventory, putatively due to population size variation. We describe a recent example of foreign gene acquisition via a bacterial gene transfer agent and illustrate the major pathways of stress response that can be used to predict regulatory components of the transcriptional networks in M. capitata. These genomic resources provide insights into the adaptive potential of these sessile, long-lived species in both natural and human influenced environments and facilitate functional and population genomic studies aimed at Hawaiian reef restoration and conservation.


April 21, 2020

Interspecies conservation of organisation and function between nonhomologous regional centromeres.

Despite the conserved essential function of centromeres, centromeric DNA itself is not conserved. The histone-H3 variant, CENP-A, is the epigenetic mark that specifies centromere identity. Paradoxically, CENP-A normally assembles on particular sequences at specific genomic locations. To gain insight into the specification of complex centromeres, here we take an evolutionary approach, fully assembling genomes and centromeres of related fission yeasts. Centromere domain organization, but not sequence, is conserved between Schizosaccharomyces pombe, S. octosporus and S. cryophilus with a central CENP-ACnp1 domain flanked by heterochromatic outer-repeat regions. Conserved syntenic clusters of tRNA genes and 5S rRNA genes occur across the centromeres of S. octosporus and S. cryophilus, suggesting conserved function. Interestingly, nonhomologous centromere central-core sequences from S. octosporus and S. cryophilus are recognized in S. pombe, resulting in cross-species establishment of CENP-ACnp1 chromatin and functional kinetochores. Therefore, despite the lack of sequence conservation, Schizosaccharomyces centromere DNA possesses intrinsic conserved properties that promote assembly of CENP-A chromatin.


April 21, 2020

Arcobacter cryaerophilus Isolated From New Zealand Mussels Harbor a Putative Virulence Plasmid.

A wide range of Arcobacter species have been described from shellfish in various countries but their presence has not been investigated in Australasia, in which shellfish are a popular delicacy. Since several arcobacters are considered to be emerging pathogens, we undertook a small study to evaluate their presence in several different shellfish, including greenshell mussels, oysters, and abalone (paua) in New Zealand. Arcobacter cryaerophilus, a species associated with human gastroenteritis, was the only species isolated, from greenshell mussels. Whole-genome sequencing revealed a range of genomic traits in these strains that were known or associated virulence factors. Furthermore, we describe the first putative virulence plasmid in Arcobacter, containing lytic, immunoavoidance, adhesion, antibiotic resistance, and gene transfer traits, among others. Complete genome sequence determination using a combination of long- and short-read genome sequencing strategies, was needed to identify the plasmid, clearly identifying its benefits. The potential for plasmids to disseminate virulence traits among Arcobacter and other species warrants further consideration by researchers interested in the risks to public health from these organisms.


April 21, 2020

Associated Bacteria Affect Sexual Reproduction by Altering Gene Expression and Metabolic Processes in a Biofilm Inhabiting Diatom.

Diatoms are unicellular algae with a fundamental role in global biogeochemical cycles as major primary producers at the base of aquatic food webs. In recent years, chemical communication between diatoms and associated bacteria has emerged as a key factor in diatom ecology, spurred by conceptual and technological advancements to study the mechanisms underlying these interactions. Here, we use a combination of physiological, transcriptomic, and metabolomic approaches to study the influence of naturally co-existing bacteria, Maribacter sp. and Roseovarius sp., on the sexual reproduction of the biofilm inhabiting marine pennate diatom Seminavis robusta. While Maribacter sp. severely reduces the reproductive success of S. robusta cultures, Roseovarius sp. slightly enhances it. Contrary to our expectation, we demonstrate that the effect of the bacterial exudates is not caused by altered cell-cycle regulation prior to the switch to meiosis. Instead, Maribacter sp. exudates cause a reduced production of diproline, the sexual attraction pheromone of S. robusta. Transcriptomic analyses show that this is likely an indirect consequence of altered intracellular metabolic fluxes in the diatom, especially those related to amino acid biosynthesis, oxidative stress response, and biosynthesis of defense molecules. This study provides the first insights into the influence of bacteria on diatom sexual reproduction and adds a new dimension to the complexity of a still understudied phenomenon in natural diatom populations.


April 21, 2020

Genomic Analyses Reveal Evidence of Independent Evolution, Demographic History, and Extreme Environment Adaptation of Tibetan Plateau Agaricus bisporus.

Agaricus bisporus distributed in the Tibetan Plateau of China has high-stress resistance that is valuable for breeding improvements. However, its evolutionary history, specialization, and adaptation to the extreme Tibetan Plateau environment are largely unknown. Here, we performed de novo genome sequencing of a representative Tibetan Plateau wild strain ABM and comparative genomic analysis with the reported European strain H97 and H39. The assembled ABM genome was 30.4 Mb in size, and comprised 8,562 protein-coding genes. The ABM genome shared highly conserved syntenic blocks and a few inversions with H97 and H39. The phylogenetic tree constructed by 1,276 single-copy orthologous genes in nine fungal species showed that the Tibetan Plateau and European A. bisporus diverged ~5.5 million years ago. Population genomic analysis using genome resequencing of 29 strains revealed that the Tibetan Plateau population underwent significant differentiation from the European and American populations and evolved independently, and the global climate changes critically shaped the demographic history of the Tibetan Plateau population. Moreover, we identified key genes that are related to the cell wall and membrane system, and the development and defense systems regulated A. bisporus adapting to the harsh Tibetan Plateau environment. These findings highlight the value of genomic data in assessing the evolution and adaptation of mushrooms and will enhance future genetic improvements of A. bisporus.


April 21, 2020

Multi-platform discovery of haplotype-resolved structural variation in human genomes.

The incomplete identification of structural variants (SVs) from whole-genome sequencing data limits studies of human genetic diversity and disease association. Here, we apply a suite of long-read, short-read, strand-specific sequencing technologies, optical mapping, and variant discovery algorithms to comprehensively analyze three trios to define the full spectrum of human genetic variation in a haplotype-resolved manner. We identify 818,054 indel variants (<50?bp) and 27,622 SVs (=50?bp) per genome. We also discover 156 inversions per genome and 58 of the inversions intersect with the critical regions of recurrent microdeletion and microduplication syndromes. Taken together, our SV callsets represent a three to sevenfold increase in SV detection compared to most standard high-throughput sequencing studies, including those from the 1000 Genomes Project. The methods and the dataset presented serve as a gold standard for the scientific community allowing us to make recommendations for maximizing structural variation sensitivity for future genome sequencing studies.


April 21, 2020

Occurrence and Characterization of mcr-1-Positive Escherichia coli Isolated From Food-Producing Animals in Poland, 2011-2016.

The emergence of plasmid-mediated colistin resistance (mcr genes) threatens the effectiveness of polymyxins, which are last-resort drugs to treat infections by multidrug- and carbapenem-resistant Gram-negative bacteria. Based on the occurrence of colistin resistance the aims of the study were to determine possible resistance mechanisms and then characterize the mcr-positive Escherichia coli. The research used material from the Polish national and EU harmonized antimicrobial resistance (AMR) monitoring programs. A total of 5,878 commensal E. coli from fecal samples of turkeys, chickens, pigs, and cattle collected in 2011-2016 were screened by minimum inhibitory concentration (MIC) determination for the presence of resistance to colistin (R) defined as R > 2 mg/L. Strains with MIC = 2 mg/L isolated in 2014-2016 were also included. A total of 128 isolates were obtained, and most (66.3%) had colistin MIC of 2 mg/L. PCR revealed mcr-1 in 80 (62.5%) isolates recovered from 61 turkeys, 11 broilers, 2 laying hens, 1 pig, and 1 bovine. No other mcr-type genes (including mcr-2 to -5) were detected. Whole-genome sequencing (WGS) of the mcr-1-positive isolates showed high diversity in the multi-locus sequence types (MLST) of E. coli, plasmid replicons, and AMR and virulence genes. Generally mcr-1.1 was detected on the same contig as the IncX4 (76.3%) and IncHI2 (6.3%) replicons. One isolate harbored mcr-1.1 on the chromosome. Various extended-spectrum beta-lactamase (blaSHV-12, blaCTX-M-1, blaCTX-M-15, blaTEM-30, blaTEM-52, and blaTEM-135) and quinolone resistance genes (qnrS1, qnrB19, and chromosomal gyrA, parC, and parE mutations) were present in the mcr-1.1-positive E. coli. A total of 49 sequence types (ST) were identified, ST354, ST359, ST48, and ST617 predominating. One isolate, identified as ST189, belonged to atypical enteropathogenic E. coli. Our findings show that mcr-1.1 has spread widely among production animals in Poland, particularly in turkeys and appears to be transferable mainly by IncX4 and IncHI2 plasmids spread across diverse E. coli lineages. Interestingly, most of these mcr-1-positive E. coli would remain undetected using phenotypic methods with the current epidemiological cut-off value (ECOFF). The appearance and spread of mcr-1 among various animals, but notably in turkeys, might be considered a food chain, and public health hazard.


April 21, 2020

Efomycins K and L From a Termite-Associated Streptomyces sp. M56 and Their Putative Biosynthetic Origin.

Two new elaiophylin derivatives, efomycins K (1) and L (2), and five known elaiophylin derivatives (3-7) were isolated from the termite-associated Streptomyces sp. M56. The structures were determined by 1D and 2D NMR and HR-ESIMS analyses and comparative CD spectroscopy. The putative gene cluster responsible for the production of the elaiophylin and efomycin derivatives was identified based on significant homology to related clusters. Phylogenetic analysis of gene cluster domains was used to provide a biosynthetic rational for these new derivatives and to demonstrate how a single biosynthetic pathway can produce diverse structures.


April 21, 2020

Comparative Genomics of Marine Sponge-Derived Streptomyces spp. Isolates SM17 and SM18 With Their Closest Terrestrial Relatives Provides Novel Insights Into Environmental Niche Adaptations and Secondary Metabolite Biosynthesis Potential.

The emergence of antibiotic resistant microorganisms has led to an increased need for the discovery and development of novel antimicrobial compounds. Frequent rediscovery of the same natural products (NPs) continues to decrease the likelihood of the discovery of new compounds from soil bacteria. Thus, efforts have shifted toward investigating microorganisms and their secondary metabolite biosynthesis potential, from diverse niche environments, such as those isolated from marine sponges. Here we investigated at the genomic level two Streptomyces spp. strains, namely SM17 and SM18, isolated from the marine sponge Haliclona simulans, with previously reported antimicrobial activity against clinically relevant pathogens; using single molecule real-time (SMRT) sequencing. We performed a series of comparative genomic analyses on SM17 and SM18 with their closest terrestrial relatives, namely S. albus J1074 and S. pratensis ATCC 33331 respectively; in an effort to provide further insights into potential environmental niche adaptations (ENAs) of marine sponge-associated Streptomyces, and on how these adaptations might be linked to their secondary metabolite biosynthesis potential. Prediction of secondary metabolite biosynthetic gene clusters (smBGCs) indicated that, even though the marine isolates are closely related to their terrestrial counterparts at a genomic level; they potentially produce different compounds. SM17 and SM18 displayed a better ability to grow in high salinity medium when compared to their terrestrial counterparts, and further analysis of their genomes indicated that they possess a pool of 29 potential ENA genes that are absent in S. albus J1074 and S. pratensis ATCC 33331. This ENA gene pool included functional categories of genes that are likely to be related to niche adaptations and which could be grouped based on potential biological functions such as osmotic stress, defense; transcriptional regulation; symbiotic interactions; antimicrobial compound production and resistance; ABC transporters; together with horizontal gene transfer and defense-related features.


April 21, 2020

Complete Assembly of the Genome of an Acidovorax citrulli Strain Reveals a Naturally Occurring Plasmid in This Species.

Acidovorax citrulli is the causal agent of bacterial fruit blotch (BFB), a serious threat to cucurbit crop production worldwide. Based on genetic and phenotypic properties, A. citrulli strains are divided into two major groups: group I strains have been generally isolated from melon and other non-watermelon cucurbits, while group II strains are closely associated with watermelon. In a previous study, we reported the genome of the group I model strain, M6. At that time, the M6 genome was sequenced by MiSeq Illumina technology, with reads assembled into 139 contigs. Here, we report the assembly of the M6 genome following sequencing with PacBio technology. This approach not only allowed full assembly of the M6 genome, but it also revealed the occurrence of a ~53 kb plasmid. The M6 plasmid, named pACM6, was further confirmed by plasmid extraction, Southern-blot analysis of restricted fragments and obtention of M6-derivative cured strains. pACM6 occurs at low copy numbers (average of ~4.1 ± 1.3 chromosome equivalents) in A. citrulli M6 and contains 63 open reading frames (ORFs), most of which (55.6%) encoding hypothetical proteins. The plasmid contains several genes encoding type IV secretion components, and typical plasmid-borne genes involved in plasmid maintenance, replication and transfer. The plasmid also carries an operon encoding homologs of a Fic-VbhA toxin-antitoxin (TA) module. Transcriptome data from A. citrulli M6 revealed that, under the tested conditions, the genes encoding the components of this TA system are among the highest expressed genes in pACM6. Whether this TA module plays a role in pACM6 maintenance is still to be determined. Leaf infiltration and seed transmission assays revealed that, under tested conditions, the loss of pACM6 did not affect the virulence of A. citrulli M6. We also show that pACM6 or similar plasmids are present in several group I strains, but absent in all tested group II strains of A. citrulli.


April 21, 2020

A Newly Isolated Bacillus subtilis Strain Named WS-1 Inhibited Diarrhea and Death Caused by Pathogenic Escherichia coli in Newborn Piglets.

Bacillus subtilis is recognized as a safe and reliable human and animal probiotic and is associated with bioactivities such as production of vitamin and immune stimulation. Additionally, it has great potential to be used as an alternative to antimicrobial drugs, which is significant in the context of antibiotic abuse in food animal production. In this study, we isolated one strain of B. subtilis, named WS-1, from apparently healthy pigs growing with sick cohorts on one Escherichia coli endemic commercial pig farm in Guangdong, China. WS-1 can strongly inhibit the growth of pathogenic E. coli in vitro. The B. subtilis strain WS-1 showed typical Bacillus characteristics by endospore staining, biochemical test, enzyme activity analysis, and 16S rRNA sequence analysis. Genomic analysis showed that the B. subtilis strain WS-1 shares 100% genomic synteny with B. subtilis with a size of 4,088,167 bp. Importantly, inoculation of newborn piglets with 1.5 × 1010 CFU of B. subtilis strain WS-1 by oral feeding was able to clearly inhibit diarrhea (p < 0.05) and death (p < 0.05) caused by pathogenic E. coli in piglets. Furthermore, histopathological results showed that the WS-1 strain could protect small intestine from lesions caused by E. coli infection. Collectively, these findings suggest that the probiotic B. subtilis strain WS-1 acts as a potential biocontrol agent protecting pigs from pathogenic E. coli infection. Importance: In this work, one B. subtilis strain (WS-1) was successfully isolated from apparently healthy pigs growing with sick cohorts on one E. coli endemic commercial pig farm in Guangdong, China. The B. subtilis strain WS-1 was identified to inhibit the growth of pathogenic E. coli both in vitro and in vivo, indicating its potential application in protecting newborn piglets from diarrhea caused by E. coli infections. The isolation and characterization will help better understand this bacterium, and the strain WS-1 can be further explored as an alternative to antimicrobial drugs to protect human and animal health.


April 21, 2020

Deep convolutional neural networks for accurate somatic mutation detection.

Accurate detection of somatic mutations is still a challenge in cancer analysis. Here we present NeuSomatic, the first convolutional neural network approach for somatic mutation detection, which significantly outperforms previous methods on different sequencing platforms, sequencing strategies, and tumor purities. NeuSomatic summarizes sequence alignments into small matrices and incorporates more than a hundred features to capture mutation signals effectively. It can be used universally as a stand-alone somatic mutation detection method or with an ensemble of existing methods to achieve the highest accuracy.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.