Overview of analytical stepsΒΆ
During a typical analyses, the Phylogenetic pipeline will proceed through the following steps.
Genotyping
SNPs are called on the provided regions using SAMTools, and the resulting SNPs are filtered using the 'paleomix vcf_filter' tool.
FASTA sequences are constructed from for the regions of interest, using the filtered SNPs generated above, one FASTA file per set of regions and per sample.
Multiple sequence alignment
Per-sample files generated in step 1 are collected, and used to build unaligned multi-FASTA files, one per region of interest.
If enabled, multiple-sequence alignment is carried out on these files using MAFFT, to generate aligned multi-FASTA files.
Phylogenetic inference
Following construction of (aligned) multi-FASTA sequences, phylogenetic inference may be carried out using a partioned maximum likelihood approach via ExaML.