SNPAAMapper-Python

SNPAAMapper is a downstream variant annotation program that can effectively classify variants by region (e.g. exon, intron, etc.), predict amino acid change type (e.g. synonymous, non-synonymous mutation, etc.), and prioritize mutation effects (e.g. CDS versus 5'UTR, etc.).

Major Features

Instructions

Please download and dump all files in the same directory on a Unix or Mac machine. The user can simply type
  ./run_SNPAAMapper-Python.sh config.txt
or run the following steps in sequential order (Note: the first step was compiled for the human hg19 genome and output files have already been generated):

  1. Process exon annotation files and generate feature start and gene mapping files
  2. python3 Algorithm_preprocessing_exon_annotation_RR.py ChrAll_knownGene.txt.exon
  3. Classify variants by regions (CDS, Upstream, Downstream Intron, UTRs...)
  4. python3 Algorithm_mapping_variants_reporting_class_intronLocation_updown.py ChrAll_knownGene.txt.exon VCF_input_file_in_tab_delimited_format.vcf
    (python3 Algorithm_mapping_variants_reporting_class_intronLocation_updown.py ChrAll_knownGene.txt.exon 007_crop.vcf)
    OR
    python3 Algorithm_mapping_variants_reporting_class_intronLocation_updown.py ChrAll_knownGene.txt.exon VCF_input_file_in_tab_delimited_format.vcf IntronExon_boundary_in_bp
    (python3 Algorithm_mapping_variants_reporting_class_intronLocation_updown.py ChrAll_knownGene.txt.exon 007_crop.vcf 6)
  5. Predict amino acid change type
  6. python3 Algorithm_predicting_full_AA_change_samtools_updown.py VCF_input_file_in_tab_delimited_format.vcf.append kgXref.txt hg19_CDSIntronWithSign.txt.out ChrAll_knownGene.txt > VCF_input_file_in_tab_delimited_format.vcf.out.txt
    (python3 Algorithm_predicting_full_AA_change_samtools_updown.py 007_crop.vcf.append kgXref.txt hg19_CDSIntronWithSign.txt.out ChrAll_knownGene.txt > 007_crop.vcf.out.txt)
  7. Prioritize mutation effects
  8. python3 Algorithm_prioritizing_mutation_headerTop_updown.py VCF_input_file_in_tab_delimited_format.vcf.append.out.txt
    (python3 Algorithm_prioritizing_mutation_headerTop_updown.py 007_crop.vcf.append.out.txt)
The final output file is *.append.out.txt.prioritized_out.

Source Code Download

Github

References