Assembly of heterozygous genomes
Abstract
Heterozygous genome assembly remains a complex task for which no satisfying solution exists. Phasing chromosomes in diploid or polyploid species or in metagenomes is still an open problem, despite the emergence of new... [ view full abstract ]
Heterozygous genome assembly remains a complex task for which no
satisfying solution exists. Phasing chromosomes in diploid or
polyploid species or in metagenomes is still an open problem,
despite the emergence of new long-read technologies and other
dedicated approaches. In particular, intra-species or
inter-species variations are usually discarded and/or result in
highly fragmented assemblies.
In this work we propose Bwise, a novel de Bruijn graph-based
assembler that fully takes advantage of paired-end relationships
between reads, thereby allowing correct paths to be determined
while resolving haplotypes and genomic repeats. Preliminary
results on simulated datasets show that Bwise prevails the
Spades state-of-the-art assembler in particular on
hard problems: when heterozygosity rate is medium (~1%). Results
also show that Bwise uses limited ressources (a human genome may
be assembled using 50GB memory).
Beyond paired-end reads, the proposed framework allows in
principle the integration of long-range information (mate pairs,
3Cseq/Hi-C, Single Molecule Real-Time (SMRT) as well as Nanopore
reads) to determine accurate long paths in the assembly graph,
with the ultimate goal of generating a single high-quality contig
per chromosome.
Availability: https://github.com/Malfoy/BWISE
Authors
-
Antoine Limasset
(IRISA/INRIA, Rennes)
-
Camille Marchet
(IRISA/INRIA, Rennes)
-
Jean-François Flot
(Université Libre de Bruxelles)
-
Pierre Peterlongo
(IRISA/INRIA, Rennes)
Topic Areas
Whole genome assemblers and integration of next generation dataTopic #1 , De novo assemblers for short reads, hybrid assemblers
Session
OS-4 » Informatics, Assembly & Analysis (11:20 - Wednesday, 17th May, La Fonda Ballroom)
Presentation Files
The presenter has not uploaded any presentation files.