Assembly

DNA sequence data has become an indispensable tool for Molecular Biology & Evolutionary Biology. Study in these fields now require a genome sequence to work from. We call this a ‘Reference Sequence.’ We need to build a reference for each species. We do this by Genome Assembly. De novo Genome Assembly is the process of reconstructing the original DNA sequence from the fragment reads alone.

Requirements

Before diving into this topic, we recommend you to have a look at:

Material

You can view the tutorial materials in different languages by clicking the dropdown icon next to the slides (slides) and tutorial (tutorial) buttons below.
Lesson Slides Hands-on Recordings Input dataset Workflows
An Introduction to Genome Assembly
An introduction to get started in genome assembly and annotation
Assembly of metagenomic sequencing data
Assembly of the mitochondrial genome from PacBio HiFi reads
Chloroplast genome assembly
De Bruijn Graph Assembly
Decontamination of a genome assembly
Deeper look into Genome Assembly algorithms
ERGA post-assembly QC
Genome Assembly Quality Control
Genome Assembly of MRSA from Oxford Nanopore MinION data (and optionally Illumina data)
Genome Assembly of a bacterial genome (MRSA) sequenced using Illumina MiSeq Data
Genome assembly using PacBio data
Large genome assembly and polishing
Making sense of a newly assembled genome
Unicycler Assembly
Unicycler assembly of SARS-CoV-2 genome with preprocessing to remove human genome reads
Using the VGP workflows to assemble a vertebrate genome with HiFi and Hi-C data
Vertebrate genome assembly using HiFi, Bionano and Hi-C data - Step by Step

Frequently Asked Questions

Common questions regarding this topic have been collected on a dedicated FAQ page . Common questions related to specific tutorials can be accessed from the tutorials themselves.

Follow topic updates rss-feed with our RSS Feed

Community Resources

Community Home Maintainer Home

Editorial Board

This material is reviewed by our Editorial Board:

orcid logoSimon Gladman avatar Simon GladmanAnton Nekrutenko avatar Anton Nekrutenkoorcid logoDelphine Lariviere avatar Delphine Lariviereorcid logoCristóbal Gallardo avatar Cristóbal Gallardo

Contributors

This material was contributed to by:

orcid logoNate Coraor avatar Nate Coraororcid logoSimon Gladman avatar Simon Gladmanorcid logoAlex Ostrovsky avatar Alex Ostrovskyorcid logoStéphanie Robin avatar Stéphanie Robinorcid logoSaskia Hiltemann avatar Saskia Hiltemannorcid logoPolina Polunina avatar Polina PoluninaMarcella Sozzoni avatar Marcella SozzoniBazante Sanders avatar Bazante Sandersorcid logoHelena Rasche avatar Helena RascheGiulio Formenti avatar Giulio FormentiDeepti Varshney avatar Deepti VarshneyMatúš Kalaš avatar Matúš KalašFabian Recktenwald avatar Fabian Recktenwaldorcid logoLaura Leroi avatar Laura Leroiorcid logoMiaomiao Zhou avatar Miaomiao ZhouBrandon Pickett avatar Brandon Pickettorcid logoWolfgang Maier avatar Wolfgang Maierorcid logoTom Brown avatar Tom Brownorcid logoCristóbal Gallardo avatar Cristóbal Gallardoorcid logoAnthony Bretaudeau avatar Anthony BretaudeauAnton Nekrutenko avatar Anton Nekrutenkoorcid logoErwan Corre avatar Erwan Correorcid logoBérénice Batut avatar Bérénice Batutorcid logoLinelle Abueg avatar Linelle Abuegorcid logoAnna Syme avatar Anna SymeTeresa Müller avatar Teresa Müllerorcid logoDelphine Lariviere avatar Delphine Lariviereorcid logoBjörn Grüning avatar Björn Grüningorcid logoAlexandre Cormier avatar Alexandre Cormier

Funding

These individuals or organisations provided funding support for the development of this resource