# Supported files ## Sequence files A sequence file is required. Supported sequence inputs are: - FASTA - GenBank - EMBL - GFF3 with embedded sequence (`##FASTA`) Without a sequence file, `seqhiker` cannot open a genome view. If a file with sequence and annotation is used, the annotation will be loaded together with the sequence. This means a standalone GFF3 file with an embedded FASTA section behaves like a GenBank or EMBL file: it loads both sequence and annotations from one file. These same sequence inputs can also be added to comparison view. ## Annotation files Supported annotation input: - GFF3 (with or without sequence) If a reference is already loaded and you then drop a GFF3 with embedded sequence, `seqhiker` checks that the embedded sequence matches the loaded genome. If it matches, the file is treated as annotation input and the annotations are merged into the current genome instead of replacing it. In comparison view, dropping a FASTA together with its matching GFF3 loads one comparison genome with annotations. ## Compressed files Sequence/annotation files can be uncompressed, or compressed with any of gz, xz, zstd, bzip2. ## Read alignment files Supported read alignment input: - BAM BAM files must be: - coordinate-sorted - indexed, with .bam.bai or .bai present ## Variant files Supported variant input: - VCF - compressed VCF (`.vcf.gz`, `.vcf.xz`, `.vcf.zst`) Variant files are loaded into a shared `VCF` browser track with one row per sample. ## Download by accession `seqhiker` can also download genome data by accession from inside the app. Use this when you want to fetch a reference and annotations without preparing local files first. The downloaded data can be loaded into either: - browser view - comparison view depending on which view is currently active. ## Comparison session files Comparison view can save and load self-contained comparison session files: - `.seqhikercmp` These store the loaded comparison genomes and the comparison state so you can reopen the session later. ## Typical combinations Common ways to use `seqhiker`: - FASTA + GFF3 - GFF3 with embedded sequence - FASTA + BAM - FASTA + VCF - FASTA + GFF3 + VCF - FASTA + GFF3 + BAM - FASTA + GFF3 + BAM + VCF - GenBank alone - EMBL alone - comparison of multiple FASTA/GFF3 genomes loaded one-by-one