Overview of analytical stepsΒΆ
During a typical analyses, the Phylogenetic pipeline will proceed through the following steps.
Genotyping
- SNPs are called on the provided regions using SAMTools, and the resulting SNPs are filtered using the 'paleomix vcf_filter' tool.
- FASTA sequences are constructed from for the regions of interest, using the filtered SNPs generated above, one FASTA file per set of regions and per sample.
Multiple sequence alignment
- Per-sample files generated in step 1 are collected, and used to build unaligned multi-FASTA files, one per region of interest.
- If enabled, multiple-sequence alignment is carried out on these files using MAFFT, to generate aligned multi-FASTA files.
Phylogenetic inference
Following construction of (aligned) multi-FASTA sequences, phylogenetic inference may be carried out using a partioned maximum likelihood approach via ExaML.