Menu

Scientific publications

Publications featuring PacBio long-read + short-read sequencing data

BMC Genomics  |  2024

Sequencing by binding rivals SMOR error-corrected sequencing by synthesis technology for accurate detection and quantification of minor subpopulation variants

Christopher J. Allender, Candice L. Wike, W. Tanner Porter, Dean Ellis, Darrin Lemmer, Stephanie J. K. Pond & David M. Engelthaler

In the FIRST ever peer-reviewed paper published on Onso and SBB technology, researchers demonstrate the superior accuracy and precision of Onso when compared with ILMN SBS technology.
bioRxiv  |  2024

Experimental and computational methods for allelic imbalance analysis from single nucleus RNA-seq data

Sean K. Simmons1,2,3*, Xian Adiconis1,2,3, Nathan Haywood1,2,3, Jacob Parker1,4, Zechuan Lin1,4, Zhixiang Liao1,5, Idil Tuncali1,5, Aziz M. Al'Khafaji6, Asa Shin6, Karthik Jagadeesh2, Kirk Gosik2, Michael Gatzen7, Jonathan T. Smith7, Daniel N. El Kodsi1,5, Yuliya Kuras1,5, Clare Baecher-Allan8, Geidy E. Serrano1,9, Thomas G. Beach1,9, Kiran Garimella7, Orit RozenblattRosen2,10, Aviv Regev2,10, Xianjun Dong1,4, Clemens R. Scherzer1,4, Joshua Z. Levin1,2,3 1Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase,MD 20815, USA 2 Klarman Cell Observatory, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA 3 Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA 4 Stephen & Denise Adams Center for Parkinson’s Disease Research of Yale School of Medicine, New Haven, CT 06510, USA 5 Precision Neurology Program of Brigham & Women's Hospital, Harvard Medical School, Boston, MA 02115, USA 6 Broad Clinical Labs, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA 7 Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA 8 Dept. of Neurology, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02115, USA 9 Banner Sun Health Research Institute, Sun City, AZ 85351, USA 10Present address: Genentech, South San Francisco, CA 94080, USA.

In a recent preprint, researchers from Aligning Science Across Parkinson’s MD, Broad, Yale, Harvard, Banner Sun AZ, Genentech conducted a detailed of analysis of protocols for “allele-specific expression (ASE) analysis to better understand how variation in the human genome affects RNA expression at the single-cell level”. The study employed single-nucleus RNA-Seq (snRNA-Seq) as introns in pre-mRNA that were enriched for heterozygous variants, thus facilitating allelic assignments for ASE, concluding that allele-specific expression (ASE) analysis is better with HiFi – more accurate, cost-effective, and enabling isoform resolution vs. ILMN short-reads.
bioRxiv  |  2024

A familial, telomere-to-telomere reference for human de novomutation and recombination from a four-generation pedigree

David Porubsky, Harriet Dashnow, Thomas A. Sasani, Glennis A. Logsdon, Pille Hallast, Michelle D. Noyes, Zev N. Kronenberg, Tom Mokveld, Nidhi Koundinya, Cillian Nolan, Cody J. Steely, Andrea Guarracino, Egor Dolzhenko, William T. Harvey, William J. Rowell, Kirill Grigorev, Thomas J. Nicholas, Keisuke K. Oshima, Jiadong Lin, Peter Ebert, W. Scott Watkins, Tiffany Y. Leung, Vincent C.T. Hanlon, Sean McGee, Brent S. Pedersen, Michael E. Goldberg, Hannah C. Happ, Hyeonsoo Jeong, Katherine M. Munson, Kendra Hoekzema, Daniel D. Chan, Yanni Wang, Jordan Knuth, Gage H. Garcia, Cairbre Fanslow, Christine Lambert, Charles Lee, Joshua D. Smith, Shawn Levy, Christopher E. Mason, Erik Garrison, Peter M. Lansdorp, Deborah W. Neklason, Lynn B. Jorde, Aaron R. Quinlan, Michael A. Eberle, Evan E. Eichler

In a preprint led by UW and 18 additional institutions, researchers unlock the secrets of genetic inheritance across four generations using HiFi sequencing. This study marks “the most comprehensive, publicly available “truth set” of all classes of genomic variants”
bioRxiv  |  2024

Complete sequencing of ape genomes

DongAhn Yoo, Arang Rhie, Prajna Hebbar, Francesca Antonacci, Glennis A. Logsdon, Steven J. Solar, Dmitry Antipov, Brandon D. Pickett, Yana Safonova, Francesco Montinaro, Yanting Luo, Joanna Malukiewicz, Jessica M. Storer, Jiadong Lin, Abigail N. Sequeira, Riley J. Mangan, Glenn Hickey, Graciela Monfort Anez, Parithi Balachandran, Anton Bankevich, Christine R. Beck, Arjun Biddanda, Matthew Borchers, Gerard G. Bouffard, Emry Brannan, Shelise Y. Brooks, Lucia Carbone, Laura Carrel, Agnes P. Chan, Juyun Crawford, Mark Diekhans, Eric Engelbrecht, Cedric Feschotte, Giulio Formenti, Gage H. Garcia, Luciana de Gennaro, David Gilbert, Richard E. Green, Andrea Guarracino, Ishaan Gupta, Diana Haddad, Junmin Han, Robert S. Harris, Gabrielle A. Hartley, William T. Harvey, Michael Hiller, Kendra Hoekzema, Marlys L. Houck, Hyeonsoo Jeong, Kaivan Kamali, Manolis Kellis, Bryce Kille, Chul Lee, Youngho Lee, William Lees, Alexandra P. Lewis, Qiuhui Li, Mark Loftus, Yong Hwee Eddie Loh, Hailey Loucks, Jian Ma, Yafei Mao, Juan F. I. Martinez, Patrick Masterson, Rajiv C. McCoy, Barbara McGrath, Sean McKinney, Britta S. Meyer, Karen H. Miga, Saswat K. Mohanty, Katherine M. Munson, Karol Pal, Matt Pennell, Pavel A. Pevzner, David Porubsky, Tamara Potapova, Francisca R. Ringeling, Joana L. Rocha, Oliver A. Ryder, Samuel Sacco, Swati Saha, Takayo Sasaki, Michael C. Schatz, Nicholas J. Schork, Cole Shanks, Linnéa Smeds, Dongmin R. Son, Cynthia Steiner, Alexander P. Sweeten, Michael G. Tassia, Françoise Thibaud-Nissen, Edmundo Torres-González, Mihir Trivedi, Wenjie Wei, Julie Wertz, Muyu Yang, Panpan Zhang, Shilong Zhang, Yang Zhang, Zhenmiao Zhang, Sarah A. Zhao, Yixin Zhu, Erich D. Jarvis, Jennifer L. Gerton, Iker Rivas-González, Benedict Paten, Zachary A. Szpiech, Christian D. Huber, Tobias L. Lenz, Miriam K. Konkel, Soojin V. Yi, Stefan Canzar, Corey T. Watson, Peter H. Sudmant, Erin Molloy, Erik Garrison, Craig B. Lowe, Mario Ventura, Rachel J. O’Neill, Sergey Koren, Kateryna D. Makova, Adam M. Phillippy, Evan E. Eichler

a preprint led by UW in combination with 59 other institutions offers a major leap in our understanding of what it means to be human. Through this study, researchers walk away with “a definitive baseline for all future evolutionary studies of humans and our closest living ape relatives”
bioRxiv  |  2024

TRGT-denovo: accurate detection of de novo tandem repeat mutations

T. Mokveld, E. Dolzhenko, H. Dashnow, T. J. Nicholas, T. Sasani, B. van der Sanden, B. Jadhav, B. Pedersen, Z. Kronenberg, A. Tucci, A. J. Sharp, A. R. Quinlan, C. Gilissen, A. Hoischen, M. A. Eberle

In this preprint, researchers from PacBio, U UT, Radboud Netherlands, Mt. Sinai, Genomics England UK found that an expansion of TRGT identifies “all types of de novo TR mutations (including expansions, contractions, and compositional changes) within family trios”, allows detection of subtle variations often overlooked in vcf files, and “improves precision and specificity of de novo mutation (DNM) identification, reducing the number of de novo candidates by an order of magnitude compared to genotype-based approaches”
bioRxiv  |  2024

Resolving the chromatin impact of mosaic variants with targeted Fiber-seq

Stephanie C. Bohaczuk, Zachary J. Amador, Chang Li, Benjamin J. Mallory, Elliott G. Swanson, Jane Ranchalis, Mitchell R. Vollger, Katherine M. Munson, Tom Walsh, Morgan O. Hamm, Yizi Mao, Andre Lieber, Andrew B. Stergachis

In this preprint, authors find that “targeted Fiber-seq enables the production of targeted long-read sequencing chromatin maps to resolve the heterogeneity of genetic and chromatin architectures with single-molecule precision”
Biorxiv  |  2024

Microflora Danica: the atlas of Danish environmental microbiomes

Singleton, Jensen, Delogu,Sørensen, Jørgensen, Karst, Yang, Knudsen, Sereika, Petriglieri, Knutsson, Dall, Kirkegaard, Kristensen, Woodcroft, Speth, Aroney, The Microflora Danica Consortium, Wagner, Dueholm, Nielsen, Albertsen

In this preprint, scientists from Denmark, Australia, and Austria, conducted a study where HiFi sequencing was used for rRNA operon sequencing for 449 (multiplexed in pools of 92 samples) microbiome samples (14.9 million bacterial (median 4,528 bp, containing both 16S and 23S) and 13.4 million eukaryotic rRNA operon sequences (median 4,035 bp, containing both 18S and 28S)). “This dataset is an order of magnitude larger than the current most comprehensive database SILVA 138.1”, and provides “an unprecedented resource and the foundation for answering fundamental questions underlying microbial ecology: what drives microbial diversity, distribution and function.”
Biorxiv  |  2024

A haplotype-resolved view of human gene regulation

Mitchell R. Vollger1*, Elliott G. Swanson2*, Shane J. Neph1, Jane Ranchalis1, Katherine M. Munson2, Ching-Huang Ho3, Adriana E. Sedeño-Cortés1, William E. Fondrie4, Stephanie C. Bohaczuk1, Yizi Mao1, Nancy L. Parmalee5, Benjamin J. Mallory2, William T. Harvey2, Younjun Kwon2, Gage H. Garcia2, Kendra Hoekzema2, Jeffrey G. Meyer6, Mine Cicek6, Evan E. Eichler2,7, William S. Noble2,8, Daniela M. Witten9, James T. Bennett10, John P. Ray2,3,11, Andrew B. Stergachis1,2,12,†

Most human cells contain two non-identical genomes, and differences in their regulation underlie human development and disease. We demonstrate that Fiber-seq Inferred Regulatory Elements (FIREs) enable the accurate quantification of chromatin accessibility across the 6 Gbp diploid human genome with single-molecule and single-nucleotide precision. We find that cells can harbor >1,000 regulatory elements with haplotype-selective chromatin accessibility (HSCA) and show that these elements preferentially localize to genomic loci containing the most human genetic diversity, with the human leukocyte antigen (HLA) locus showing the largest amount of HSCA genome-wide in immune cells. Furthermore, we uncover HSCA elements with sequence non-deterministic chromatin accessibility, representing likely somatic epimutations, and show that productive transcription from the inactive X chromosome is buttressed by clustered promoter-proximal elements that escape X chromosome inactivation.
Biorxiv  |  2024

Structural polymorphism and diversity of human segmental duplications

Hyeonsoo Jeong, Philip C. Dishuck, DongAhn Yoo, William T. Harvey, Katherine M. Munson, Alexandra P. Lewis, Jennifer Kordosky, Gage H. Garcia, Human Genome Structural Variation Consortium (HGSVC), Feyza Yilmaz, Pille Hallast, Charles Lee, Tomi Pastinen, Evan E. Eichler

In this preprint, HiFi reveals segment duplications (SDs) unresolvable by short reads. This missing info is essential for understanding human disease, evolution and diversity. Researchers from HGSVC, UW, Altos Labs, JAX, CMKC conducted a study including a “population genetics survey of SDs by analyzing 170 [all HiFi] human genome assemblies where the majority of SDs are fully resolved using long-read sequence assembly.”
Nature  |  2024

The complex polyploid genome architecture of sugarcane

A. L. Healey, O. Garsmeur, J. T. Lovell, S. Shengquiang, A. Sreedasyam, J. Jenkins, C. B. Plott, N. Piperidis, N. Pompidor, V. Llaca, C. J. Metcalfe, J. Doležel, P. Cápal, J. W. Carlson, J. Y. Hoarau, C. Hervouet, C. Zini, A. Dievart, A. Lipzen, M. Williams, L. B. Boston, J. Webber, K. Keymanesh, S. Tejomurthula, S. Rajasekar, R. Suchecki, A. Furtado, G. May, P. Parakkal, B. A. Simmons, K. Barry, R. J. Henry, J. Grimwood, K. S. Aitken, J. Schmutz & A. D’Hont

Sugarcane, the world’s most harvested crop by tonnage, has shaped global history, trade and geopolitics, and is currently responsible for 80% of sugar production worldwide1. While traditional sugarcane breeding methods have effectively generated cultivars adapted to new environments and pathogens, sugar yield improvements have recently plateaued2. The cessation of yield gains may be due to limited genetic diversity within breeding populations, long breeding cycles and the complexity of its genome, the latter preventing breeders from taking advantage of the recent explosion of whole-genome sequencing that has benefited many other crops. Thus, modern sugarcane hybrids are the last remaining major crop without a reference-quality genome. Here we take a major step towards advancing sugarcane biotechnology by generating a polyploid reference genome for R570, a typical modern cultivar derived from interspecific hybridization between the domesticated species (Saccharum officinarum) and the wild species (Saccharum spontaneum). In contrast to the existing single haplotype (‘monoploid’) representation of R570, our 8.7 billion base assembly contains a complete representation of unique DNA sequences across the approximately 12 chromosome copies in this polyploid genome. Using this highly contiguous genome assembly, we filled a previously unsized gap within an R570 physical genetic map to describe the likely causal genes underlying the single-copy Bru1 brown rust resistance locus. This polyploid genome assembly with fine-grain descriptions of genome architecture and molecular targets for biotechnology will help accelerate molecular and transgenic breeding and adaptation of sugarcane to future environmental conditions.
Biorxiv  |  2024

Genome-wide profiling of highly similar paralogous genes using HiFi sequencing

Xiao Chen, Daniel Baker, Egor Dolzhenko, Joseph M Devaney, Jessica Noya, April S Berlyoung, Rhonda Brandon, Kathleen S Hruska, Lucas Lochovsky, Paul Kruszka, Scott Newman, Emily Farrow, Isabelle Thiffault, Tomi Pastinen, Dalia Kasperaviciute, Christian Gilissen, Lisenka Vissers, Alexander Hoischen, Seth Berger, Eric Vilain, Emmanuèle Délot, Genomics Research to Elucidate the Genetics of Rare Diseases (GREGoR) Consortium, Michael A Eberle

Variant calling is hindered in segmental duplications by sequence homology. We developed Paraphase, a HiFi-based informatics method that resolves highly similar genes by phasing all haplotypes of a gene family. We applied Paraphase to 160 long (>10 kb) segmental duplication regions across the human genome with high (>99%) sequence similarity, encoding 316 genes. Analysis across five ancestral populations revealed highly variable copy numbers of these regions. We identified 23 families with exceptionally low within-family diversity, where extensive gene conversion and unequal-crossing over have resulted in highly similar gene copies. Furthermore, our analysis of 36 trios identified 7 de novo SNVs and 4 de novo gene conversion events, 2 of which are non-allelic. Finally, we summarized extensive genetic diversity in 9 medically relevant genes previously considered challenging to genotype. Paraphase provides a framework for resolving gene paralogs, enabling accurate testing in medically relevant genes and population-wide studies of previously inaccessible genes.
Nature  |  2024

Single-cell long-read sequencing-based mapping reveals specialized splicing patterns in developing and adult mouse and human brain

Anoushka Joglekar, Wen Hu, Bei Zhang, Oleksandr Narykov, Mark Diekhans, Jordan Marrocco, Jennifer Balacco, Lishomwa C. Ndhlovu, Teresa A. Milner, Olivier Fedrigo, Erich D. Jarvis, Gloria Sheynkman, Dmitry Korkin, M. Elizabeth Ross & Hagen U. Tilgner

RNA isoforms influence cell identity and function. However, a comprehensive brain isoform map was lacking. We analyze single-cell RNA isoforms across brain regions, cell subtypes, developmental time points and species. For 72% of genes, full-length isoform expression varies along one or more axes. Splicing, transcription start and polyadenylation sites vary strongly between cell types, influence protein architecture and associate with disease-linked variation. Additionally, neurotransmitter transport and synapse turnover genes harbor cell-type variability across anatomical regions. Regulation of cell-type-specific splicing is pronounced in the postnatal day 21-to-postnatal day 28 adolescent transition. Developmental isoform regulation is stronger than regional regulation for the same cell type. Cell-type-specific isoform regulation in mice is mostly maintained in the human hippocampus, allowing extrapolation to the human brain. Conversely, the human brain harbors additional cell-type specificity, suggesting gain-of-function isoforms. Together, this detailed single-cell atlas of full-length isoform regulation across development, anatomical regions and species reveals an unappreciated degree of isoform variability across multiple axes.
Quick search

Quick search is faster but may return fewer results.

Advanced search

Advanced search allows you to search more fields but may take longer.

Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.