New Tutorial: VGP assembly pipeline

new tutorial assembly pacbio vgp

Posted on: 14 March 2022 purlPURL:

We are proud to announce that, as result of the collaboration with the Vertebrate Genomes Project (VGP), a new training describing the VGP assembly pipeline is now available in the Galaxy Training Network. The Vertebrate Genomes Project aims to generate high-quality, near-error-free, gap-free, chromosome-level, haplotype-phased, annotated reference genome assemblies for every vertebrate species.

VGP pipeline. Open image in new tab

Figure 1: VPG Pipeline 2.0. The pipeline starts with assembly of the HiFi reads into contigs, yielding the primary and alternate assemblies. Then, duplicated and erroneously assigned contigs will be removed by using purge_dups. Finally, Bionano optical maps and HiC data are used to generate a scaffolded primary assembly.

The tutorial organized in four sections: genome profile, HiFi phased assembly, post-assembly pocessing and hybrid scaffolding. During the genome profiling stage, diverse tools based on the analsys of k-mer frequencies are used for infering the properties of the genome. After that, a draft assembly is generated by using high accuracy long-read PacBio HiFi reads. In the third stage, the initial assembly is preprocessed for identifying and reassign allelic contigs. Finally, in the last step the assembed contigs are assembled into scaffolds by using two additional technologies: Bionano optical maps and Hi-C data.

View Material

Recent News

See all news

Phylogenetics tutorial takes researchers back to basics!

13 June 2024   Phylogenetics Tutorial GTN Australian BioCommons

A new Galaxy Training Network tutorial has been created to take researchers back to basics to uncover the principles of phylogenetics and how tree-building methods work. A longstanding collaboration between Professor Michael Charleston from the University of Tasmania and Australian BioCommons has delivered this self-guided tutorial featuring videos and hands-on exercises. To maximise its impact, the resource was tailored specifically to be shared globally via the Galaxy Training Network, and will form the basis of an upcoming live training workshop.

From GTN Intern to Tutorial Author to Bioinformatician

13 June 2024   single-cell training education trajectory user contributor

With growing access and interest in sequencing data, Galaxy is a knight in shining armor for wet lab scientists hoping to analyze their own data. With long term intentions of increasing access to bioinformatic analyses, the Galaxy Training Network (GTN) creates a safe space where non-computer-scientists may analyze their own data and even learn to code: an invaluable skill in today’s scientific world. Galaxy introduced me to brand new skills as an undergraduate and ultimately changed the trajectory of my career. Here is my story as a biology undergraduate with no coding experience turned GTN contributor &, eventually, coding bioinformatician: thanks to Galaxy.