Menu
April 21, 2020

Analysis of transcripts and splice isoforms in Medicago sativa L. by single-molecule long-read sequencing.

The full-length transcriptome of alfalfa was analyzed with PacBio single-molecule long-read sequencing technology. The transcriptome data provided full-length sequences and gene isoforms of transcripts in alfalfa, which will improve genome annotation and enhance our understanding of the gene structure of alfalfa. As an important forage, alfalfa (Medicago sativa L.) is world-wide planted. For its complexity of genome and unfinished whole genome sequencing, the sequences and complete structure of mRNA transcripts remain unclear in alfalfa. In this study, single-molecule long-read sequencing was applied to investigate the alfalfa transcriptome using the Pacific Biosciences platform, and a total of 113,321 transcripts were obtained from young, mature and senescent leaves. We identified 72,606 open reading frames including 46,616 full-length ORFs, 1670 transcription factors from 54 TF families and 44,040 simple sequence repeats from 30,797 sequences. A total of 7568 alternative splicing events was identified and the majority of alternative splicing events in alfalfa was intron retention. In addition, we identified 17,740 long non-coding RNAs. Our results show the feasibility of deep sequencing full-length RNA from alfalfa transcriptome on a single-molecule level.


April 21, 2020

Alternative polyadenylation coordinates embryonic development, sexual dimorphism and longitudinal growth in Xenopus tropicalis.

RNA alternative polyadenylation contributes to the complexity of information transfer from genome to phenome, thus amplifying gene function. Here, we report the first X. tropicalis resource with 127,914 alternative polyadenylation (APA) sites derived from embryos and adults. Overall, APA networks play central roles in coordinating the maternal-zygotic transition (MZT) in embryos, sexual dimorphism in adults and longitudinal growth from embryos to adults. APA sites coordinate reprogramming in embryos before the MZT, but developmental events after the MZT due to zygotic genome activation. The APA transcriptomes of young adults are more variable than growing adults and male frog APA transcriptomes are more divergent than females. The APA profiles of young females were similar to embryos before the MZT. Enriched pathways in developing embryos were distinct across the MZT and noticeably segregated from adults. Briefly, our results suggest that the minimal functional units in genomes are alternative transcripts as opposed to genes.


April 21, 2020

SMRT sequencing of the full-length transcriptome of the Sunda pangolin (Manis javanica).

It is widely known that transcriptional diversity contributes greatly to biological regulation in eukaryotes. With the development of next-generation sequencing (NGS) technologies, several studies on RNA sequencing have considerably improved our understanding of transcriptome complexity. However, obtaining full-length (FL) transcripts remains a considerable challenge because of difficulties in short read-based assembly. In the present study, single-molecule real-time (SMRT) sequencing and NGS were combined to generate the complete and FL transcriptome of Manis javanica. The results provide a comprehensive set of reference transcripts and hence contribute to the improved annotation of the M. javanica genome. We obtained 45,530 high-confidence transcripts from 19,109 genic loci, of which 8014 genes have not yet been annotated within the M. javanica genome. Furthermore, we revealed 8824 long-chain noncoding RNAs (lncRNAs). A total of 30,199 alternative splicing (AS) and 11,184 alternative polyadenylation (APA) events were identified in the sequencing data. The structure and expression level of 59 digestive enzyme genes, including 13 carbohydrase genes, 28 lipase genes and 18 protease genes, were analyzed, which might provide original data for further research on M. javanica. Copyright © 2019 Elsevier B.V. All rights reserved.


April 21, 2020

The developmental dynamics of the Populus stem transcriptome.

The Populus shoot undergoes primary growth (longitudinal growth) followed by secondary growth (radial growth), which produces biomass that is an important source of energy worldwide. We adopted joint PacBio Iso-Seq and RNA-seq analysis to identify differentially expressed transcripts along a developmental gradient from the shoot apex to the fifth internode of Populus Nanlin895. We obtained 87 150 full-length transcripts, including 2081 new isoforms and 62 058 new alternatively spliced isoforms, most of which were produced by intron retention, that were used to update the Populus annotation. Among these novel isoforms, there are 1187 long non-coding RNAs and 356 fusion genes. Using this annotation, we found 15 838 differentially expressed transcripts along the shoot developmental gradient, of which 1216 were transcription factors (TFs). Only a few of these genes were reported previously. The differential expression of these TFs suggests that they may play important roles in primary and secondary growth. AP2, ARF, YABBY and GRF TFs are highly expressed in the apex, whereas NAC, bZIP, PLATZ and HSF TFs are likely to be important for secondary growth. Overall, our findings provide evidence that long-read sequencing can complement short-read sequencing for cataloguing and quantifying eukaryotic transcripts and increase our understanding of the vital and dynamic process of shoot development. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.


April 21, 2020

Full-Length Transcriptome Analysis of the Genes Involved in Tocopherol Biosynthesis in Torreya grandis.

The seeds of Torreya grandis (Cephalotaxaceae) are rich in tocopherols, which are essential components of the human diet as a result of their function in scavenging reactive oxygen and free radicals. Different T. grandis cultivars (10 cultivars selected in this study were researched, and their information is shown in Table S1 of the Supporting Information) vary enormously in their tocopherol contents (0.28-11.98 mg/100 g). However, little is known about the molecular basis and regulatory mechanisms of tocopherol biosynthesis in T. grandis kernels. Here, we applied single-molecule real-time (SMRT) sequencing to T. grandis (X08 cultivar) for the first time and obtained a total of 97?211 full-length transcripts. We proposed the biosynthetic pathway of tocopherol and identified eight full-length transcripts encoding enzymes potentially involved in tocopherol biosynthesis in T. grandis. The results of the correlation analysis between the tocopherol content and gene expression level in the 10 selected cultivars and different kernel developmental stages of the X08 cultivar suggested that homogentisate phytyltransferase coding gene ( TgVTE2b) and ?-tocopherol methyltransferase coding gene ( TgVTE4) may be key players in tocopherol accumulation in the kernels of T. grandis. Subcellular localization assays showed that both TgVTE2b and TgVTE4 were localized to the chloroplast. We also identified candidate regulatory genes similar to WRI1 and DGAT1 in Arabidopsis that may be involved in the regulation of tocopherol biosynthesis. Our findings provide valuable genetic information for T. grandis using full-length transcriptomic analysis, elucidating the candidate genes and key regulatory genes involved in tocopherol biosynthesis. This information will be critical for further molecular-assisted screening and breeding of T. grandis genotypes with high tocopherol contents.


April 21, 2020

Midrib Sucrose Accumulation and Sugar Transporter Gene Expression in YCS-Affected Sugarcane Leaves

Sucrose accumulation and decreased photosynthesis are early symptoms of yellow canopy syndrome (YCS) in sugarcane (Saccharum spp.), and precede the visual yellowing of the leaves. To investigate broad-scale gene expression changes during YCS-onset, transcriptome analyses coupled to metabolome analyses were performed. Across leaf tissues, the greatest number of differentially expressed genes related to the chloroplast, and the metabolic processes relating to nitrogen and carbohydrates. Five genes represented 90% of the TPM (Transcripts Per Million) associated with the downregulation of transcription during YCS-onset, which included PSII D1 (PsbA). This differential expression was consistent with a feedback regulatory effect upon photosynthesis. Broad-scale gene expression analyses did not reveal a cause for leaf sugar accumulation during YCS-onset. Interestingly, the midrib showed the greatest accumulation of sugars, followed by symptomatic lamina. To investigate if phloem loading/reloading may be compromised on a gene expression level – to lead to leaf sucrose accumulation – sucrose transport-related proteins of SWEETs, Sucrose Transporters (SUTs), H+-ATPases and H+-pyrophosphatases (H+-PPases) were characterised from a sugarcane transcriptome and expression analysed. Two clusters of Type I H+-PPases, with one upregulated and the other downregulated, were evident. Although less pronounced, a similar pattern of change was observed for the H+-ATPases. The disaccharide transporting SWEETs were downregulated after visual symptoms were present, and a monosaccharide transporting SWEET upregulated preceding, as well as after, symptom development. SUT gene expression was the least responsive to YCS development. The results are consistent with a reduction of photoassimilate movement through the phloem leading to sucrose build-up in the leaf.


April 21, 2020

Full-Length Transcriptome Sequencing and the Discovery of New Transcripts in the Unfertilized Eggs of Zebrafish (Danio rerio).

Understanding early gene expression in zebrafish embryos is a prerequisite for developmental biology research. In this study, 1,629,447 polymerase reads were obtained from the unfertilized eggs of zebrafish via full-length transcriptome sequencing using the PacBio RS II platform first. Then, 102,920 unique isoforms were obtained by correction, clustering and comparison with the zebrafish genome. 12,782 genes in the genome were captured, accounting for 39.71% of the all annotated genes. Approximately 62.27% of the 12,782 genes have been alternatively spliced. GO and KEGG annotations revealed that the unfertilized eggs primarily stored genes that participate in RNA processing and nuclear protein complex composition. According to this PacBio data that aligned with the genome, 3,970 fusion genes, 819 ncRNAs, and 84 new transcripts were predicted. Illumina RNA-seq and RT-qPCR detection found that the expression of two new transcripts, PB.5289.1 and PB.10209.1, were significantly up-regulated at the 2-cell stage and down-regulated rapidly thereafter, suggesting their involvement in minor ZGA during early embryonic development. This study indicated that the unfertilized eggs of zebrafish may have retained genes directly related to cell division and development to initiate the subsequent development in a limited space and time. On the other hand, NTRs or new transcriptome regions in the genome were discovered, which provided new clues regarding ZGA of MZT during early embryonic development in fish.Copyright © 2019 Mehjabin et al.


April 21, 2020

Genome and transcriptome sequencing of the astaxanthin-producing green microalga, Haematococcus pluvialis.

Haematococcus pluvialis is a freshwater species of Chlorophyta, family Haematococcaceae. It is well known for its capacity to synthesize high amounts of astaxanthin, which is a strong antioxidant that has been utilized in aquaculture and cosmetics. To improve astaxanthin yield and to establish genetic resources for H. pluvialis, we performed whole-genome sequencing, assembly, and annotation of this green microalga. A total of 83.1 Gb of raw reads were sequenced. After filtering the raw reads, we subsequently generated a draft assembly with a genome size of 669.0?Mb, a scaffold N50 of 288.6?kb, and predicted 18,545 genes. We also established a robust phylogenetic tree from 14 representative algae species. With additional transcriptome data, we revealed some novel potential genes that are involved in the synthesis, accumulation, and regulation of astaxanthin production. In addition, we generated an isoform-level reference transcriptome set of 18,483 transcripts with high confidence. Alternative splicing analysis demonstrated that intron retention is the most frequent mode. In summary, we report the first draft genome of H. pluvialis. These genomic resources along with transcriptomic data provide a solid foundation for the discovery of the genetic basis for theoretical and commercial astaxanthin enrichment.


April 21, 2020

Characterization of Mauritian cynomolgus macaque Fc?R alleles using long-read sequencing.

The Fc?Rs are immune cell surface proteins that bind IgG and facilitate cytokine production, phagocytosis, and Ab-dependent, cell-mediated cytotoxicity. Fc?Rs play a critical role in immunity; variation in these genes is implicated in autoimmunity and other diseases. Cynomolgus macaques are an excellent animal model for many human diseases, and Mauritian cynomolgus macaques (MCMs) are particularly useful because of their restricted genetic diversity. Previous studies of MCM immune gene diversity have focused on the MHC and killer cell Ig-like receptor. In this study, we characterize Fc?R diversity in 48 MCMs using PacBio long-read sequencing to identify novel alleles of each of the four expressed MCM Fc?R genes. We also developed a high-throughput Fc?R genotyping assay, which we used to determine allele frequencies and identify Fc?R haplotypes in more than 500 additional MCMs. We found three alleles for Fc?R1A, seven each for Fc?R2A and Fc?R2B, and four for Fc?R3A; these segregate into eight haplotypes. We also assessed whether different Fc?R alleles confer different Ab-binding affinities by surface plasmon resonance and found minimal difference in binding affinities across alleles for a panel of wild type and Fc-engineered human IgG. This work suggests that although MCMs may not fully represent the diversity of Fc?R responses in humans, they may offer highly reproducible results for mAb therapy and toxicity studies. Copyright © 2018 by The American Association of Immunologists, Inc.


April 21, 2020

Full-length transcriptome sequences obtained by a combination of sequencing platforms applied to heat shock proteins and polyunsaturated fatty acids biosynthesis in Pyropia haitanensis

Pyropia haitanensis is a high-yield commercial seaweed in China. Pyropia haitanensis farms often suffer from problems such as severe germplasm degeneration, while the mechanisms underlying resistance to abiotic stresses remain unknown because of lacking genomic information. Although many previous studies focused on using next-generation sequencing (NGS) technologies, the short-read sequences generated by NGS generally prevent the assembly of full-length transcripts, and then limit screening functional genes. In the present study, which was based on hybrid sequencing (NGS and single-molecular real-time sequencing) of the P. haitanensis thallus transcriptome, we obtained high-quality full-length transcripts with a mean length of 2998 bp and an N50 value of 3366 bp. A total of 14,773 unigenes (93.52%) were annotated in at least one database, while approximately 60% of all unigenes were assembled by short Illumina reads. Moreover, we herein suggested that the genes involved in the biosynthesis of polyunsaturated fatty acids and heat shock proteins play an important role in the process of development and resistance to abiotic stresses in P. haitanensis. The present study, together with previously published ones, may facilitate seaweed transcriptome research.


April 21, 2020

Label-free quantitative proteomic analysis of Panax ginseng leaves upon exposure to heat stress.

Ginseng is one of the well-known medicinal plants, exhibiting diverse medicinal effects. Its roots possess anticancer and antiaging properties and are being used in the medical systems of East Asian countries. It is grown in low-light and low-temperature conditions, and its growth is strongly inhibited at temperatures above 25°C. However, the molecular responses of ginseng to heat stress are currently poorly understood, especially at the protein level.We used a shotgun proteomics approach to investigate the effect of heat stress on ginseng leaves. We monitored their photosynthetic efficiency to confirm physiological responses to a high-temperature stress.The results showed a reduction in photosynthetic efficiency on heat treatment (35°C) starting at 48 h. Label-free quantitative proteome analysis led to the identification of 3,332 proteins, of which 847 were differentially modulated in response to heat stress. The MapMan analysis showed that the proteins with increased abundance were mainly associated with antioxidant and translation-regulating activities, whereas the proteins related to the receptor and structural-binding activities exhibited decreased abundance. Several other proteins including chaperones, G-proteins, calcium-signaling proteins, transcription factors, and transfer/carrier proteins were specifically downregulated.These results increase our understanding of heat stress responses in the leaves of ginseng at the protein level, for the first time providing a resource for the scientific community.


April 21, 2020

NCF1 (p47phox)-deficient chronic granulomatous disease: comprehensive genetic and flow cytometric analysis.

Mutations in NCF1 (p47phox) cause autosomal recessive chronic granulomatous disease (CGD) with abnormal dihydrorhodamine (DHR) assay and absent p47phox protein. Genetic identification of NCF1 mutations is complicated by adjacent highly conserved (>98%) pseudogenes (NCF1B and NCF1C). NCF1 has GTGT at the start of exon 2, whereas the pseudogenes each delete 1 GT (?GT). In p47phox CGD, the most common mutation is ?GT in NCF1 (c.75_76delGT; p.Tyr26fsX26). Sequence homology between NCF1 and its pseudogenes precludes reliable use of standard Sanger sequencing for NCF1 mutations and for confirming carrier status. We first established by flow cytometry that neutrophils from p47phox CGD patients had negligible p47phox expression, whereas those from p47phox CGD carriers had ~60% of normal p47phox expression, independent of the specific mutation in NCF1 We developed a droplet digital polymerase chain reaction (ddPCR) with 2 distinct probes, recognizing either the wild-type GTGT sequence or the ?GT sequence. A second ddPCR established copy number by comparison with the single-copy telomerase reverse transcriptase gene, TERT We showed that 84% of p47phox CGD patients were homozygous for ?GT NCF1 The ddPCR assay also enabled determination of carrier status of relatives. Furthermore, only 79.2% of normal volunteers had 2 copies of GTGT per 6 total (NCF1/NCF1B/NCF1C) copies, designated 2/6; 14.7% had 3/6, and 1.6% had 4/6 GTGT copies. In summary, flow cytometry for p47phox expression quickly identifies patients and carriers of p47phox CGD, and genomic ddPCR identifies patients and carriers of ?GT NCF1, the most common mutation in p47phox CGD.


April 21, 2020

Identification and characterisation of anti – Pseudomonas aeruginosa proteins in mucus of the brown garden snail, Cornu aspersum.

Background: Novel antimicrobial treatments are urgently needed. Previous work has shown that the mucus of the brown garden snail (Cornu aspersum) has antimicrobial properties, in particular against type culture collection strains of Pseudomonas aeruginosa. We hypothesised that it would also be effective against clinical isolates of the bacterium and that investigation of fractions of the mucus would identify one or more proteins with anti-pseudomonal properties, which could be further characterised. Materials and methods: Mucus was extracted from snails collected from the wild. Antimicrobial activity against laboratory and clinical isolates of Ps. aeruginosa was determined in disc diffusion assays. Mucus was purified using size exclusion chromatography and fractions containing anti-pseudomonal activity identified. Mass spectroscopy and high performance liquid chromatography analysis of these fractions yielded partial peptide sequences. These were used to interrogate an RNA transcriptome generated from whole snails. Results: Mucus from C. aspersum inhibited growth of type collection strains and clinical isolates of Ps. aeruginosa. Four novel C. aspersum proteins were identified; at least three are likely to have antimicrobial properties. The most interesting is a 37.4 kDa protein whilst smaller proteins, one 17.5 kDa and one 18.6 kDa also appear to have activity against Ps. aeruginosa.Conclusions: The study has identified novel proteins with antimicrobial properties which could be used to develop treatments for use in human medicine.


April 21, 2020

Hybrid sequencing reveals insight into heat sensing and signaling of bread wheat.

Wheat (Triticum aestivum L.), a globally important crop, is challenged by increasing temperatures (heat stress, HS). However its polyploid nature, the incompleteness of its genome sequences and annotation, the lack of comprehensive HS-responsive transcriptomes and the unexplored heat sensing and signaling of wheat hinder our full understanding of its adaptations to HS. The recently released genome sequences of wheat, as well as emerging single-molecular sequencing technologies, provide an opportunity to thoroughly investigate the molecular mechanisms of the wheat response to HS. We generated a high-resolution spatio-temporal transcriptome map of wheat flag leaves and filling grain under HS at 0 min, 5 min, 10 min, 30 min, 1 h and 4 h by combining full-length single-molecular sequencing and Illumina short reads sequencing. This hybrid sequencing newly discovered 4947 loci and 70 285 transcripts, generating the comprehensive and dynamic list of HS-responsive full-length transcripts and complementing the recently released wheat reference genome. Large-scale analysis revealed a global landscape of heat adaptations, uncovering unexpected rapid heat sensing and signaling, significant changes of more than half of HS-responsive genes within 30 min, heat shock factor-dependent and -independent heat signaling, and metabolic alterations in early HS-responses. Integrated analysis also demonstrated the differential responses and partitioned functions between organs and subgenomes, and suggested a differential pattern of transcriptional and alternative splicing regulation in the HS response. This study provided comprehensive data for dissecting molecular mechanisms of early HS responses in wheat and highlighted the genomic plasticity and evolutionary divergence of polyploidy wheat. © 2019 The Authors. The Plant Journal published by John Wiley & Sons Ltd and Society for Experimental Biology.


April 21, 2020

Plant ISOform sequencing database (PISO): a comprehensive repertory of full-length transcripts in plants.

In higher eukaryotes, alternative splicing (AS) and alternative polyadenylation (APA) events can produce multiple transcript isoforms in the majority of genes, which significantly increase the protein- coding potential of a genome (Pan et al., 2008; Anvar et al., 2018). Different transcript isoforms might encode proteins with different functions or affect the mRNA stability and translational capacity, in some sense AS and APA events can dramatically increase the complexity and flexibility of the entire transcriptome and proteome (Yang et al., 2016; Feng et al., 2015; Li et al., 2017a; Wang et al., 2017a). Many databases contained AS events and transcripts in animals are available in some public resources such as ASTD and MAASE (Zheng et al., 2005), whereas there is no database containing full-length transcripts and AS events in plants up to now. Next-generation sequencing (NGS) technology has limitation for identifying AS and APA events due to short reads and low accuracy. In recent years, isoform sequencing (Iso-Seq) using Pacbio single molecule real-time sequencing (SMRT) platform can generate full-length sequences and provide accurate information about AS and transcriptional start sites (Li et al., 2017a). In this study, we collected the plant Iso-Seq data sequenced by Pacbio platform from NCBI database up to the end of 2017, and employed unified pipelines to process all the full-length transcripts in different species. Based on these data, we constructed Plant ISOform sequencing database (PISO, http://cbi.hzau.edu.cn/piso/).


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.