Supported files

Sequence files

A sequence file is required. Supported sequence inputs are:

  • FASTA

  • GenBank

  • EMBL

  • GFF3 with embedded sequence (##FASTA)

Without a sequence file, seqhiker cannot open a genome view. If a file with sequence and annotation is used, the annotation will be loaded together with the sequence.

This means a standalone GFF3 file with an embedded FASTA section behaves like a GenBank or EMBL file: it loads both sequence and annotations from one file.

These same sequence inputs can also be added to comparison view.

Annotation files

Supported annotation input:

  • GFF3 (with or without sequence)

If a reference is already loaded and you then drop a GFF3 with embedded sequence, seqhiker checks that the embedded sequence matches the loaded genome. If it matches, the file is treated as annotation input and the annotations are merged into the current genome instead of replacing it.

In comparison view, dropping a FASTA together with its matching GFF3 loads one comparison genome with annotations.

Compressed files

Sequence/annotation files can be uncompressed, or compressed with any of gz, xz, zstd, bzip2.

Read alignment files

Supported read alignment input:

  • BAM

BAM files must be:

  • coordinate-sorted

  • indexed, with .bam.bai or .bai present

Variant files

Supported variant input:

  • VCF

  • compressed VCF (.vcf.gz, .vcf.xz, .vcf.zst)

Variant files are loaded into a shared VCF browser track with one row per sample.

Download by accession

seqhiker can also download genome data by accession from inside the app.

Use this when you want to fetch a reference and annotations without preparing local files first. The downloaded data can be loaded into either:

  • browser view

  • comparison view

depending on which view is currently active.

Comparison session files

Comparison view can save and load self-contained comparison session files:

  • .seqhikercmp

These store the loaded comparison genomes and the comparison state so you can reopen the session later.

Typical combinations

Common ways to use seqhiker:

  • FASTA + GFF3

  • GFF3 with embedded sequence

  • FASTA + BAM

  • FASTA + VCF

  • FASTA + GFF3 + VCF

  • FASTA + GFF3 + BAM

  • FASTA + GFF3 + BAM + VCF

  • GenBank alone

  • EMBL alone

  • comparison of multiple FASTA/GFF3 genomes loaded one-by-one