Menu
April 21, 2020

Improved annotation of the domestic pig genome through integration of Iso-Seq and RNA-seq data.

Our understanding of the pig transcriptome is limited. RNA transcript diversity among nine tissues was assessed using poly(A) selected single-molecule long-read isoform sequencing (Iso-seq) and Illumina RNA sequencing (RNA-seq) from a single White cross-bred pig. Across tissues, a total of 67,746 unique transcripts were observed, including 60.5% predicted protein-coding, 36.2% long non-coding RNA and 3.3% nonsense-mediated decay transcripts. On average, 90% of the splice junctions were supported by RNA-seq within tissue. A large proportion (80%) represented novel transcripts, mostly produced by known protein-coding genes (70%), while 17% corresponded to novel genes. On average, four transcripts per known gene (tpg) were identified; an increase over current EBI (1.9 tpg) and NCBI (2.9 tpg) annotations and closer to the number reported in human genome (4.2 tpg). Our new pig genome annotation extended more than 6000 known gene borders (5′ end extension, 3′ end extension, or both) compared to EBI or NCBI annotations. We validated a large proportion of these extensions by independent pig poly(A) selected 3′-RNA-seq data, or human FANTOM5 Cap Analysis of Gene Expression data. Further, we detected 10,465 novel genes (81% non-coding) not reported in current pig genome annotations. More than 80% of these novel genes had transcripts detected in >?1 tissue. In addition, more than 80% of novel intergenic genes with at least one transcript detected in liver tissue had H3K4me3 or H3K36me3 peaks mapping to their promoter and gene body, respectively, in independent liver chromatin immunoprecipitation data. These validated results show significant improvement over current pig genome annotations.


April 21, 2020

The transcriptome of Darwin’s bark spider silk glands predicts proteins contributing to dragline silk toughness.

Darwin’s bark spider (Caerostris darwini) produces giant orb webs from dragline silk that can be twice as tough as other silks, making it the toughest biological material. This extreme toughness comes from increased extensibility relative to other draglines. We show C. darwini dragline-producing major ampullate (MA) glands highly express a novel silk gene transcript (MaSp4) encoding a protein that diverges markedly from closely related proteins and contains abundant proline, known to confer silk extensibility, in a unique GPGPQ amino acid motif. This suggests C. darwini evolved distinct proteins that may have increased its dragline’s toughness, enabling giant webs. Caerostris darwini’s MA spinning ducts also appear unusually long, potentially facilitating alignment of silk proteins into extremely tough fibers. Thus, a suite of novel traits from the level of genes to spinning physiology to silk biomechanics are associated with the unique ecology of Darwin’s bark spider, presenting innovative designs for engineering biomaterials.


April 21, 2020

Hybrid-Transcriptome Sequencing and Associated Metabolite Analysis Reveal Putative Genes Involved in Flower Color Difference in Rose Mutants.

Gene mutation is a common phenomenon in nature that often leads to phenotype differences, such as the variations in flower color that frequently occur in roses. With the aim of revealing the genomic information and inner mechanisms, the differences in the levels of both transcription and secondary metabolism between a pair of natural rose mutants were investigated by using hybrid RNA-sequencing and metabolite analysis. Metabolite analysis showed that glycosylated derivatives of pelargonidin, e.g., pelargonidin 3,5 diglucoside and pelargonidin 3-glucoside, which were not detected in white flowers (Rosa ‘Whilte Mrago Koster’), constituted the major pigments in pink flowers. Conversely, the flavonol contents of petal, such as kaempferol-3-glucoside, quercetin 3-glucoside, and rutin, were higher in white flowers. Hybrid RNA-sequencing obtained a total of 107,280 full-length transcripts in rose petal which were annotated in major databases. Differentially expressed gene (DEG) analysis showed that the expression of genes involved in the flavonoid biosynthesis pathway was significantly different, e.g., CHS, FLS, DFR, LDOX, which was verified by qRT-PCR during flowering. Additionally, two MYB transcription factors were found and named RmMYBAN2 and RmMYBPA1, and their expression patterns during flowering were also analyzed. These findings indicate that these genes may be involved in the flower color difference in the rose mutants, and competition between anthocyanin and flavonol biosynthesis is a primary cause of flower color variation, with its regulation reflected by transcriptional and secondary metabolite levels.


April 21, 2020

Full-length transcriptome sequencing and methyl jasmonate-induced expression profile analysis of genes related to patchoulol biosynthesis and regulation in Pogostemon cablin.

Pogostemon cablin (Blanco) Benth. (Patchouli) is an important aromatic and medicinal plant and widely used in traditional Chinese medicine as well as in the perfume industry. Patchoulol is the primary bioactive component in P. cablin, its biosynthesis has attracted widespread interests. Previous studies have surveyed the putative genes involved in patchoulol biosynthesis using next-generation sequencing method; however, technical limitations generated by short-read sequencing restrict the yield of full-length genes. Additionally, little is known about the expression pattern of genes especially patchoulol biosynthesis related genes in response to methyl jasmonate (MeJA). Our understanding of patchoulol biosynthetic pathway still remained largely incomplete to date.In this study, we analyzed the morphological character and volatile chemical compounds of P. cablin cv. ‘Zhanxiang’, and 39 volatile chemical components were detected in the patchouli leaf using GC-MS, most of which were sesquiterpenes. Furthermore, high-quality RNA isolated from leaves and stems of P. cablin were used to generate the first full-length transcriptome of P. cablin using PacBio isoform sequencing (Iso-Seq). In total, 9.7 Gb clean data and 82,335 full-length UniTransModels were captured. 102 transcripts were annotated as 16 encoding enzymes involved in patchouli alcohol biosynthesis. Accorded with the uptrend of patchoulol content, the vast majority of genes related to the patchoulol biosynthesis were up-regulated after MeJA treatment, indicating that MeJA led to an increasing synthesis of patchoulol through activating the expression level of genes involved in biosynthesis pathway of patchoulol. Moreover, expression pattern analysis also revealed that transcription factors participated in JA regulation of patchoulol biosynthesis were differentially expressed.The current study comprehensively reported the morphological specificity, volatile chemical compositions and transcriptome characterization of the Chinese-cultivated P. cablin cv. ‘Zhanxiang’, these results contribute to our better understanding of the physiological and molecular features of patchouli, especially the molecular mechanism of biosynthesis of patchoulol. Our full-length transcriptome data also provides a valuable genetic resource for further studies in patchouli.


April 21, 2020

Analysis of Transcriptome and Epitranscriptome in Plants Using PacBio Iso-Seq and Nanopore-Based Direct RNA Sequencing.

Nanopore sequencing from Oxford Nanopore Technologies (ONT) and Pacific BioSciences (PacBio) single-molecule real-time (SMRT) long-read isoform sequencing (Iso-Seq) are revolutionizing the way transcriptomes are analyzed. These methods offer many advantages over most widely used high-throughput short-read RNA sequencing (RNA-Seq) approaches and allow a comprehensive analysis of transcriptomes in identifying full-length splice isoforms and several other post-transcriptional events. In addition, direct RNA-Seq provides valuable information about RNA modifications, which are lost during the PCR amplification step in other methods. Here, we present a comprehensive summary of important applications of these technologies in plants, including identification of complex alternative splicing (AS), full-length splice variants, fusion transcripts, and alternative polyadenylation (APA) events. Furthermore, we discuss the impact of the newly developed nanopore direct RNA-Seq in advancing epitranscriptome research in plants. Additionally, we summarize computational tools for identifying and quantifying full-length isoforms and other co/post-transcriptional events and discussed some of the limitations with these methods. Sequencing of transcriptomes using these new single-molecule long-read methods will unravel many aspects of transcriptome complexity in unprecedented ways as compared to previous short-read sequencing approaches. Analysis of plant transcriptomes with these new powerful methods that require minimum sample processing is likely to become the norm and is expected to uncover novel co/post-transcriptional gene regulatory mechanisms that control biological outcomes during plant development and in response to various stresses.


April 21, 2020

Transcriptome analysis based on a combination of sequencing platforms provides insights into leaf pigmentation in Acer rubrum.

Red maple (Acer rubrum L.) is one of the most common and widespread trees with colorful leaves. We found a mutant with red, yellow, and green leaf phenotypes in different branches, which provided ideal materials with the same genetic relationship, and little interference from the environment, for the study of complex metabolic networks that underly variations in the coloration of leaves. We applied a combination of NGS and SMRT sequencing to various red maple tissues.A total of 125,448 unigenes were obtained, of which 46 and 69 were thought to be related to the synthesis of anthocyanins and carotenoids, respectively. In addition, 88 unigenes were presumed to be involved in the chlorophyll metabolic pathway. Based on a comprehensive analysis of the pigment gene expression network, the mechanisms of leaf color were investigated. The massive accumulation of Cy led to its higher content and proportion than other pigments, which caused the redness of leaves. Yellow coloration was the result of the complete decomposition of chlorophyll pigments, the unmasking of carotenoid pigments, and a slight accumulation of Cy.This study provides a systematic analysis of color variations in the red maple. Moreover, mass sequence data obtained by deep sequencing will provide references for the controlled breeding of red maple.


April 21, 2020

Iso-Seq analysis of the Taxus cuspidata transcriptome reveals the complexity of Taxol biosynthesis.

Taxus cuspidata is well known worldwide for its ability to produce Taxol, one of the top-selling natural anticancer drugs. However, current Taxol production cannot match the increasing needs of the market, and novel strategies should be considered to increase the supply of Taxol. Since the biosynthetic mechanism of Taxol remains largely unknown, elucidating this pathway in detail will be very helpful in exploring alternative methods for Taxol production.Here, we sequenced Taxus cuspidata transcriptomes with next-generation sequencing (NGS) and third-generation sequencing (TGS) platforms. After correction with Illumina reads and removal of redundant reads, more than 180,000 nonredundant transcripts were generated from the raw Iso-Seq data. Using Cogent software and an alignment-based method, we identified a total of 139 cytochrome P450s (CYP450s), 31 BAHD acyltransferases (ACTs) and 1940 transcription factors (TFs). Based on phylogenetic and coexpression analysis, we identified 9 CYP450s and 7 BAHD ACTs as potential lead candidates for Taxol biosynthesis and 6 TFs that are possibly involved in the regulation of this process. Using coexpression analysis of genes known to be involved in Taxol biosynthesis, we elucidated the stem biosynthetic pathway. In addition, we analyzed the expression patterns of 12 characterized genes in the Taxol pathway and speculated that the isoprene precursors for Taxol biosynthesis were mainly synthesized via the MEP pathway. In addition, we found and confirmed that the alternative splicing patterns of some genes varied in different tissues, which may be an important tissue-specific method of posttranscriptional regulation.A strategy was developed to generate corrected full-length or nearly full-length transcripts without assembly to ensure sequence accuracy, thus greatly improving the reliability of coexpression and phylogenetic analysis and greatly facilitating gene cloning and characterization. This strategy was successfully utilized to elucidate the Taxol biosynthetic pathway, which will greatly contribute to the goals of improving the Taxol content in Taxus spp. using molecular breeding or plant management strategies and synthesizing Taxol in microorganisms using synthetic biological technology.


April 21, 2020

De novo transcriptome assembly of the cubomedusa Tripedalia cystophora, including the analysis of a set of genes involved in peptidergic neurotransmission.

The phyla Cnidaria, Placozoa, Ctenophora, and Porifera emerged before the split of proto- and deuterostome animals, about 600 million years ago. These early metazoans are interesting, because they can give us important information on the evolution of various tissues and organs, such as eyes and the nervous system. Generally, cnidarians have simple nervous systems, which use neuropeptides for their neurotransmission, but some cnidarian medusae belonging to the class Cubozoa (box jellyfishes) have advanced image-forming eyes, probably associated with a complex innervation. Here, we describe a new transcriptome database from the cubomedusa Tripedalia cystophora.Based on the combined use of the Illumina and PacBio sequencing technologies, we produced a highly contiguous transcriptome database from T. cystophora. We then developed a software program to discover neuropeptide preprohormones in this database. This script enabled us to annotate seven novel T. cystophora neuropeptide preprohormone cDNAs: One coding for 19 copies of a peptide with the structure pQWLRGRFamide; one coding for six copies of a different RFamide peptide; one coding for six copies of pQPPGVWamide; one coding for eight different neuropeptide copies with the C-terminal LWamide sequence; one coding for thirteen copies of a peptide with the RPRAamide C-terminus; one coding for four copies of a peptide with the C-terminal GRYamide sequence; and one coding for seven copies of a cyclic peptide, of which the most frequent one has the sequence CTGQMCWFRamide. We could also identify orthologs of these seven preprohormones in the cubozoans Alatina alata, Carybdea xaymacana, Chironex fleckeri, and Chiropsalmus quadrumanus. Furthermore, using TBLASTN screening, we could annotate four bursicon-like glycoprotein hormone subunits, five opsins, and 52 other family-A G protein-coupled receptors (GPCRs), which also included two leucine-rich repeats containing G protein-coupled receptors (LGRs) in T. cystophora. The two LGRs are potential receptors for the glycoprotein hormones, while the other GPCRs are candidate receptors for the above-mentioned neuropeptides.By combining Illumina and PacBio sequencing technologies, we have produced a new high-quality de novo transcriptome assembly from T. cystophora that should be a valuable resource for identifying the neuronal components that are involved in vision and other behaviors in cubomedusae.


April 21, 2020

Pentatricopeptide repeat poly(A) binding protein KPAF4 stabilizes mitochondrial mRNAs in Trypanosoma brucei.

In Trypanosoma brucei, most mitochondrial mRNAs undergo editing, and 3′ adenylation and uridylation. The internal sequence changes and terminal extensions are coordinated: pre-editing addition of the short (A) tail protects the edited transcript against 3′-5′ degradation, while post-editing A/U-tailing renders mRNA competent for translation. Participation of a poly(A) binding protein (PABP) in coupling of editing and 3′ modification processes has been inferred, but its identity and mechanism of action remained elusive. We report identification of KPAF4, a pentatricopeptide repeat-containing PABP which sequesters the A-tail and impedes mRNA degradation. Conversely, KPAF4 inhibits uridylation of A-tailed transcripts and, therefore, premature A/U-tailing of partially-edited mRNAs. This quality check point likely prevents translation of incompletely edited mRNAs. We also find that RNA editing substrate binding complex (RESC) mediates the interaction between the 5′ end-bound pyrophosphohydrolase MERS1 and 3′ end-associated KPAF4 to enable mRNA circularization. This event appears to be critical for edited mRNA stability.


April 21, 2020

Comparative transcriptome analysis identified candidate genes involved in mycelium browning in Lentinula edodes.

Lentinula edodes is one of the most popular edible mushroom species in the world and contains useful medicinal components, such as lentinan. The light-induced formation of brown film on the vegetative mycelial tissues of L. edodes is an important process for ensuring the quantity and quality of this edible mushroom. To understand the molecular mechanisms underlying this critical developmental process in L. edodes, we characterized the morphological phenotypic changes in a strain, Chamaram, associated with abnormal brown film formation and compared its genome-wide transcriptional features.In the present study, we performed genome-wide transcriptome analyses of different vegetative mycelium growth phenotypes, namely, early white, normal brown, and defective dark yellow partial brown films phenotypes which were exposed to different light conditions. The analysis revealed the identification of clusters of genes specific to the light-induced brown film phenotypes. These genes were significantly associated with light sensing via photoreceptors such as FMN- and FAD-bindings, signal transduction by kinases and GPCRs, melanogenesis via activation of tyrosinases, and cell wall degradation by glucanases, chitinases, and laccases, which suggests these processes are involved in the formation of mycelial browning in L. edodes. Interestingly, hydrophobin genes such as SC1 and SC3 exhibited divergent expression levels in the normal and abnormal brown mycelial films, indicating the ability of these genes to act in fruiting body initiation and formation of dikaryotic mycelia. Furthermore, we identified the up-regulation of glycoside hydrolase domain-containing genes in the normal brown film but not in the abnormal film phenotype, suggesting that cell wall degradation in the normal brown film phenotype is crucial in the developmental processes related to the initiation and formation of fruiting bodies.This study systematically analysed the expression patterns of light-induced browning-related genes in L. edodes. Our findings provide information for further investigations of browning formation mechanisms in L. edodes and a foundation for future L. edodes breeding.


April 21, 2020

The genome of the soybean cyst nematode (Heterodera glycines) reveals complex patterns of duplications involved in the evolution of parasitism genes.

Heterodera glycines, commonly referred to as the soybean cyst nematode (SCN), is an obligatory and sedentary plant parasite that causes over a billion-dollar yield loss to soybean production annually. Although there are genetic determinants that render soybean plants resistant to certain nematode genotypes, resistant soybean cultivars are increasingly ineffective because their multi-year usage has selected for virulent H. glycines populations. The parasitic success of H. glycines relies on the comprehensive re-engineering of an infection site into a syncytium, as well as the long-term suppression of host defense to ensure syncytial viability. At the forefront of these complex molecular interactions are effectors, the proteins secreted by H. glycines into host root tissues. The mechanisms of effector acquisition, diversification, and selection need to be understood before effective control strategies can be developed, but the lack of an annotated genome has been a major roadblock.Here, we use PacBio long-read technology to assemble a H. glycines genome of 738 contigs into 123?Mb with annotations for 29,769 genes. The genome contains significant numbers of repeats (34%), tandem duplicates (18.7?Mb), and horizontal gene transfer events (151 genes). A large number of putative effectors (431 genes) were identified in the genome, many of which were found in transposons.This advance provides a glimpse into the host and parasite interplay by revealing a diversity of mechanisms that give rise to virulence genes in the soybean cyst nematode, including: tandem duplications containing over a fifth of the total gene count, virulence genes hitchhiking in transposons, and 107 horizontal gene transfers not reported in other plant parasitic nematodes thus far. Through extensive characterization of the H. glycines genome, we provide new insights into H. glycines biology and shed light onto the mystery underlying complex host-parasite interactions. This genome sequence is an important prerequisite to enable work towards generating new resistance or control measures against H. glycines.


April 21, 2020

Characterization of a male specific region containing a candidate sex determining gene in Atlantic cod.

The genetic mechanisms determining sex in teleost fishes are highly variable and the master sex determining gene has only been identified in few species. Here we characterize a male-specific region of 9?kb on linkage group 11 in Atlantic cod (Gadus morhua) harboring a single gene named zkY for zinc knuckle on the Y chromosome. Diagnostic PCR test of phenotypically sexed males and females confirm the sex-specific nature of the Y-sequence. We identified twelve highly similar autosomal gene copies of zkY, of which eight code for proteins containing the zinc knuckle motif. 3D modeling suggests that the amino acid changes observed in six copies might influence the putative RNA-binding specificity. Cod zkY and the autosomal proteins zk1 and zk2 possess an identical zinc knuckle structure, but only the Y-specific gene zkY was expressed at high levels in the developing larvae before the onset of sex differentiation. Collectively these data suggest zkY as a candidate master masculinization gene in Atlantic cod. PCR amplification of Y-sequences in Arctic cod (Arctogadus glacialis) and Greenland cod (Gadus macrocephalus ogac) suggests that the male-specific region emerged in codfishes more than 7.5 million years ago.


April 21, 2020

Using Pan RNA-Seq Analysis to Reveal the Ubiquitous Existence of 5′ and 3′ End Small RNAs.

In this study, we used pan RNA-seq analysis to reveal the ubiquitous existence of both 5′ and 3′ end small RNAs (5′ and 3′ sRNAs). 5′ and 3′ sRNAs alone can be used to annotate nuclear non-coding and mitochondrial genes at 1-bp resolution and identify new steady RNAs, which are usually transcribed from functional genes. Then, we provided a simple and cost effective way for the annotation of nuclear non-coding and mitochondrial genes and the identification of new steady RNAs, particularly long non-coding RNAs (lncRNAs). Using 5′ and 3′ sRNAs, the annotation of human mitochondrial was corrected and a novel ncRNA named non-coding mitochondrial RNA 1 (ncMT1) was reported for the first time in this study. We also found that most of human tRNA genes have downstream lncRNA genes as lncTRS-TGA1-1 and corrected the misunderstanding of them in previous studies. Using 5′, 3′, and intronic sRNAs, we reported for the first time that enzymatic double-stranded RNA (dsRNA) cleavage and RNA interference (RNAi) might be involved in the RNA degradation and gene expression regulation of U1 snRNA in human. We provided a different perspective on the regulation of gene expression in U1 snRNA. We also provided a novel view on cancer and virus-induced diseases, leading to find diagnostics or therapy targets from the ribonuclease III (RNase III) family and its related pathways. Our findings pave the way toward a rediscovery of dsRNA cleavage and RNAi, challenging classical theories.


April 21, 2020

The tech for the next decade: promises and challenges in genome biology.

The 19th Annual Advances in Genome Biology and Technology (AGBT) meeting came back to Marco Island, Florida, and was held in the renovated venue from 27 February to 2 March 2019. The meeting showed a variety of new technology, both in wet lab and in bioinformatics. This year’s themes included single-cell technology and applications, spatially resolved gene expression measurements, new sequencing platforms, genome assembly and variation, and long and linked reads.


April 21, 2020

Transcriptomic profiles of 33 opium poppy samples in different tissues, growth phases, and cultivars.

Opium poppy is one of the most important medicinal plants and remains the only commercial resource of morphinan-based painkillers. However, little is known about the regulatory mechanisms involved in benzylisoquinoline alkaloids (BIAs) biosynthesis in opium poppy. Herein, the full-length transcriptome dataset of opium poppy was constructed for the first time in accompanied with the 33 samples of Illumina transcriptome data from different tissues, growth phases and cultivars. The long-read sequencing produced 902,140 raw reads with 55,114 high-quality transcripts, and short-read sequencing produced 1,923,679,864 clean reads with an average Q30 rate of 93%. The high-quality transcripts were subsequently quantified using the short reads, and the expression of each unigene among different samples was calculated as reads per kilobase per million mapped reads (RPKM). These data provide a foundation for opium poppy transcriptomic analysis, which may aid in capturing splice variants and some non-coding RNAs involved in the regulation of BIAs biosynthesis. It can also be used for genome assembly and annotation which will favor in new transcript identification.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.