Skip to content

Resolving Oversequencing with Flye #774

@gillianpatton

Description

@gillianpatton

Hi! I am currently using ONT long read to identify plasmids in Escherichia coli clinical isolates. I am getting decent coverage of my genomes (50-500x coverage) and am having minimal assembly problems. However, other people in my lab sequencing C. diff clinical isolates have been able to recover assemblies that failed assembly with flye by subsampling their ONT reads. They are currently subsampling to 50% of the reads present using filtlong, but we are interested in doing this filtering with the --asm-coverage flag in flye instead to help preserve our plasmids.

Any advice on whether oversequencing is a problem with long reads, and whether anyone has used the --asm-coverage flag to resolve oversequencing? We are also interested in achieving optimal basecalling accuracy for possible SNP analysis in the future. We are also polishing with short reads with polypolish.

Also, the --asm-coverage flag requires the use of the --genome-size flag as well. What happens if we slightly over or under-estimate the genome size? What exactly does flye use this flag to do on the back end?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions