Skip to content

Assembly size seems to double between contigger and polishing stage #766

@davidecarlson

Description

@davidecarlson

I'm assembling a somewhat low coverage (~20x) human genome using ONT R10.4.1 reads with flye v2.9.5-b1801.

During the contigger stage, the contigs.fasta file was ~2.9 GB in size. However, the final assembly.fasta file ended up being nearly double that size, ~5.8 GB.

I don't have any particular reason to think that the heterozygosity of this sample is especially high. Is it possible that alternative haplotypes were not collapsed?

In case it's useful, here is the command I ran:

flye \
--nano-hq ./ONT_data_combined.fastq.gz \
-o flye_out \
-t 96 \
--resume \
--scaffold

Thanks!
Dave

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions