Menu
April 21, 2020

A quick guide for student-driven community genome annotation.

Authors: Hosmani, Prashant S and Shippy, Teresa and Miller, Sherry and Benoit, Joshua B and Munoz-Torres, Monica and Flores-Gonzalez, Mirella and Mueller, Lukas A and Wiersma-Koch, Helen and D'Elia, Tom and Brown, Susan J and Saha, Surya

High quality gene models are necessary to expand the molecular and genetic tools available for a target organism, but these are available for only a handful of model organisms that have undergone extensive curation and experimental validation over the course of many years. The majority of gene models present in biological databases today have been identified in draft genome assemblies using automated annotation pipelines that are frequently based on orthologs from distantly related model organisms and usually have minor or major errors. Manual curation is time consuming and often requires substantial expertise, but is instrumental in improving gene model structure and identification. Manual annotation may seem to be a daunting and cost-prohibitive task for small research communities but involving undergraduates in community genome annotation consortiums can be mutually beneficial for both education and improved genomic resources. We outline a workflow for efficient manual annotation driven by a team of primarily undergraduate annotators. This model can be scaled to large teams and includes quality control processes through incremental evaluation. Moreover, it gives students an opportunity to increase their understanding of genome biology and to participate in scientific research in collaboration with peers and senior researchers at multiple institutions.

Journal: PLoS computational biology
DOI: 10.1371/journal.pcbi.1006682
Year: 2019

Read publication

Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.