3. Glossary¶
- BAI
- The index file for a file generated in the BAM format. (This is a non-standard file type.)
- BAM
- Binary version of the Sequence Alignment Map (SAM) format.
- BED
- Format that defines the data lines displayed in an annotation track.
- DSRC
- A compression tool dedicated to FastQ files
- FASTA
- FASTA-formatted sequence files contains either nucleic acid sequence (such as DNA) or protein sequence information. FASTA files store multiple sequences in a single file.
- GFF
- General Feature Format, used for describing genes and other features associated with DNA, RNA and Protein sequences.
- JSON
- A human-readable data serialization language commonly used in configuration files. See https://en.wikipedia.org/wiki/JSON
- SAM
- Sequence Alignment Map is a generic nucleotide alignment format that describes the alignment of query sequences or sequencing reads to a reference sequence or assembly
- VCF
- Variant Call Format, for use with the variant calling pipeline
- YAML
- A human-readable data serialization language commonly used in configuration files. See https://en.wikipedia.org/wiki/YAML