Archive

  • Visit JGI.DOE.GOV
Our Science
Home › Science Highlights › Overcoming short-read genome sequence assembly challenges

April 20, 2012

Overcoming short-read genome sequence assembly challenges

The genome assembly challenges posed by short sequence reads from sequencing platforms such as 454/Roche and Illumina are well-documented; the lack of reference genome data can hinder attempts to put together the myriad of short DNA sequences. Researchers from the Georgia Institute of Technology and the DOE JGI wanted to determine the impact of short reads from next-generation sequencing (NGS) platforms on assembling individual genomes from complex microbial communities.


Kayaking at Sunset on Lake Lanier by troutkarmarv, on Flickr
Sunset on Georgia’s Lake Lanier, which was a source of data samples used in this study. 

Creative Commons Attribution-Noncommercial-Share Alike 2.0 Generic License  by  troutkarmarv 




As described in an article published in the April issue of The ISME Journal, the team used Illumina data consisting of 100-basepair paired-end sequences from soil and freshwater metagenomes, as well as simulated datasets for their studies. For example, they “spiked” a dataset from a freshwater planktonic community sampled from a lake in Atlanta, Georgia with a reference bacterial genome and then compared the results of deriving that reference genome against assembling it from the genome reads alone.

The team reported that they were able to accurately assemble a single genome from the complex community when the NGS platform used had at least 20X coverage. At less coverage, they added, they found more errors and problems with individual genome assembly.

“The results presented here reveal the errors and limitations as well as the strengths of metagenomics for population analysis, and provided practical standards and guidelines for experimental design and analysis,” the team concluded. “Some of our results should be independent of the NGS platform used and therefore broadly applicable to short-read sequencing.”

Share this:

  • Click to share on Facebook (Opens in new window)
  • Click to share on LinkedIn (Opens in new window)
  • Click to share on Pinterest (Opens in new window)
  • Click to share on Twitter (Opens in new window)
  • Click to print (Opens in new window)

Filed Under: Science Highlights Tagged With: metagenomics

More from the JGI archives:

  • Software Tools
  • Science Highlights
  • News Releases
  • Blog
  • User Proposals
  • 2018-24 Strategic Plan
  • Progress Reports
  • Historical Primers
  • Legacy Projects
  • Past Events
  • JGI.DOE.GOV
  • Disclaimer
  • Accessibility / Section 508
Lawrence Berkeley National Lab Biosciences Area
A project of the US Department of Energy, Office of Science

JGI is a DOE Office of Science User Facility managed by Lawrence Berkeley National Laboratory

© 1997-2025 The Regents of the University of California