Skip to content

Tutorial

Sebastian Keller edited this page Dec 13, 2021 · 26 revisions

Step 1: Input preparation

The standard input format of StructMAn is called simple mutation list format (.smlf). It is a tab or space separated text file format containing the proteomic positions to be mapped and analyzed by StructMAn.

Examples would look like:

P04637  annotate_whole_protein
P04637-3  annotate_isoform
Q96HA4 A31 annotate_specific_position
Q96HA4 P50 annotate_another_specific_position
Q96HA4 77 annotate_position_without_giving_amino_acid_type_also_possible
Q96HA4 P200V annotate_SNV
Q96HA4 P200A annotate_another_SNV
Q96HA4 200D also_possible
Q16637 P210_P241del deletion_example
P04439 S337_D338insGGEGVK insertion_example
P04637 R333_R337delinsKGKEK substitution_example
1JOO:A  PDB_style_input_annotates whole chain
1DPO:A M180 specific_position_PDB_style
1DPO:A V181_A SNVs_need_an_underscore_in_PDB_style
1DPO:A K188A this_annotates_position_188_with_insertion_code_A
1DPO:A K188A_A this_codes_the_SNV_of_lysine_position_188_with_insertion_code_A_to_alanine
1DPO:A G188_A this_codes_the_SNV_of_glycine_position_188_to_alanine

StructMAn can also process whole proteins given by a file in fasta-format.

Step 2: Running StructMAn

StructMAn is a fully automated pipeline. Only one command is needed in order to process one input dataset placed in the designated input folder:

 structman

or, if the input file is not in that folder:

 structman -i [path to the input file]

If you want to perform a simple run that would take only one line of a .smlf-file as input, there is no need for creating a separate input file. Instead of giving a path to an input file, simply give the input with the '-i' flag:

 structman -i Q9Q288

There are multiple levels of verbosity available by adding --verbosity <0-5> to the command (default is level 1).

Step 3: Enjoying the Results

The results are written into the designated output folder. There are two default output files:

  1. the classification table
  2. the statistics file
  3. the feature table

Step 4: Advanced usage

Database mode

StructMAn is able to maintain once processed results in a local database. More info about this feature can be found here

Deploying local resources

Most results provided by StructMAn are derived from structural information gained from the PDB. Hence a local instance of it can significantly reduce the computation times of StructMAn. Here is explained how to create local resources.