This is an old revision of the document!
When opening a Mascot identification file (dat file), user should specifiy parameters for Mascot parse. Indeed, the first mandatory step is to parse the dat file using free of charge parser distributed by Matrix Science.
A dialog box allow user to define parse setting. These parameters are the same as the one you can specify in Mascot Search engine. Mascot will use these informations in order to build the report.
Ion with score less than the ion score cutoff are ignored.
Subset threshold definition: fractions core for a protein to be counted as a subset. Its score must be equal or greater than
Master_protein_score * (1-subset threshold)
If subset threshold is set to 1
*Score for a protein to be counted as a subset must be ≥ Master_protein_score * (1-1)
*Score for a protein to be counted as a subset must be ≥ Master_protein_score * (0)
*Score for a protein to be counted as a subset must be ≥ 0
All proteins sharing at least one peptide with the Master protein are counted as subsets
If subset threshold is set to 0
*Score for a protein to be counted as a subset must be ≥ Master_protein_score * (1-0)
*Score for a protein to be counted as a subset must be ≥ Master_protein_score * (1)
*Score for a protein to be counted as a subset must be ≥ Master_protein_score
No protein appears as a subset
This option can be problematical in case of further protein grouping
If subset threshold is set to 0.5
Score for a protein to be counted as a subset must be ≥ Master_protein_score * (1-0.5)
Score for a protein to be counted as a subset must be ≥ Master_protein_score * (0.5)
Score for a protein to be counted as a subset must be ≥ Master_protein_score/2
Only proteins whose score is at least equal to half of Master protein score are counted as subsets.
This option allows to limit the list of subset proteins to those which can be considered are the “more likely to be actually present in the sample”
By default the protein groups are represented by a master protein. This master protein is the same as the one defined by mascot. If it doesn't suit your needs you have the possibility to change the master proteins one by one in the interface OR, in one step for all the protein group, by using the associated filter.
This filter using rules to determine which protein in the same set must be set as master. To have sufficient complexity you can composed multiple rules together.
In a protein group the FIRST protein matches the composition of rules will be set as master.
If no protein in the protein group matches the composition of rules, the old one is kept.
The rules are composed as following :
Example of rule :
Only the proteins with their accession number beginning with “Q70” will match this rule.
Nota : you can defined up to 9 rules.
You can composed the different rules you defined to have a fine filter to set as master the proteins you want. The composition can contains :
As seen below the composition part of the filter window show you the translation of the composition you maked, to verify you don't do mistakes.
In this example, will be set as master the first protein of each protein group which have :
AND
⇒ You want to set as master only the proteins which belong to HUMAN and which have their accession number begins with 'Q70'
In this example, will be set as master the first protein of each protein group which have :
AND
OR
AND
⇒ You want to set as master only protein belong to HUMAN or MOUSE but never a particular protein which have an accession containing 'P5098'.