User Tools

Site Tools


userguide:filtres

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
Next revision Both sides next revision
userguide:filtres [2009/05/18 15:13]
132.168.73.247 créée
userguide:filtres [2010/01/12 15:08]
132.168.73.247
Line 1: Line 1:
-====== Parse and filetring ​======+====== Parse and filtering ​======
  
 When opening a Mascot identification file (dat file), user should specifiy parameters for Mascot parse. Indeed, the first mandatory step is to parse the dat file using free of charge parser distributed by Matrix Science. When opening a Mascot identification file (dat file), user should specifiy parameters for Mascot parse. Indeed, the first mandatory step is to parse the dat file using free of charge parser distributed by Matrix Science.
Line 5: Line 5:
 ===== Mascot Parse ===== ===== Mascot Parse =====
  
-A dialog box allow user to define parse+A dialog box allow user to define parse setting. These parameters are the same as the one you can specify in Mascot Search engine. Mascot will use these informations in order to build the report.
  
 +== Report Top ==
 +  * Absolute : absolute number of hits to identify ​
 +  * Auto : return hits with significant scores
  
 +== Peptides cutoff ==
 + Ion with score less than the ion score cutoff are ignored.
 +
 +== Subset ==
 +Subset threshold definition: fractions core for a protein to be counted as a subset. Its score must be equal or greater than \\
 +''​Master_protein_score * (1-subset threshold)''​
 +
 +**__If subset threshold is set to 1__**   \\
 +*Score for a protein to be counted as a subset must be ≥ Master_protein_score * (1-1)\\
 +*Score for a protein to be counted as a subset must be ≥ Master_protein_score * (0) \\
 +*Score for a protein to be counted as a subset must be ≥ 0 \\
 +\\
 +**All proteins sharing at least one peptide with the Master protein are counted as subsets** \\
 +\\
 +
 +**__If subset threshold is set to 0__** \\
 +*Score for a protein to be counted as a subset must be ≥ Master_protein_score * (1-0) \\
 +*Score for a protein to be counted as a subset must be ≥ Master_protein_score * (1) \\
 +*Score for a protein to be counted as a subset must be ≥ Master_protein_score \\
 +\\
 +**No protein appears as a subset**\\
 +This option can be problematical in case of further protein grouping
 +\\
 +
 +**__If subset threshold is set to 0.5__**
 +Score for a protein to be counted as a subset must be ≥ Master_protein_score * (1-0.5) \\
 +Score for a protein to be counted as a subset must be ≥ Master_protein_score * (0.5)\\
 +Score for a protein to be counted as a subset must be ≥ Master_protein_score/​2\\
 +\\
 +**Only proteins whose score is at least equal to half of Master protein score are counted as subsets.**\\
 +This option allows to limit the list of subset proteins to those which can be considered are the “more likely to be actually present in the sample”
 +
 +===== Filtering =====
 +
 +==== Master Protein Filter ====
 +By default the protein groups are represented by a master protein. This master protein is the same as the one defined by mascot.
 +If it doesn'​t suit your needs you have the possibility to change the master proteins one by one in the interface OR, in one step for all the protein groups, by using the associated filter.
 +
 +This filter use **rules** to determine which protein in the same set must be set as master. To have sufficient complexity you can **composed** multiple rules together.\\
 +:!: **In a protein group the FIRST protein which matches the rules composition will be set as master.**
 +:!: If no protein in the protein group matches the rules composition,​ the old one is kept.
 +
 +=== The rules ===
 +The rules are composed as following :
 +  * The field on which the filter will work : Accession or Description of the protein
 +  * The operation the filter will applied : Contains, Not contains, Begins with, Ends with
 +  * The text to search in the given field (**Case sensitive**)
 +
 +Example of rule :\\
 +{{:​userguide:​master-filter_rule-window.png|}}\\
 +Only the proteins with their accession number beginning with "​Q70"​ will match this rule.
 +
 +//**Nota : you can defined up to 9 rules.**//
 +
 +=== The composition ===
 +You can composed the different rules you defined to have a fine filter to set as master the proteins you want.
 +The composition can contains :
 +  * Rules names : R1, R2, ...
 +  * Parenthesis
 +  * Composition operators : and, &, &&, or, |, ||, ... See [[http://​www.opensymphony.com/​ognl/​html/​LanguageGuide/​apa.html#​operators|OGNL operator page]] to see all the possibilities.
 +
 +As seen below the composition part of the filter window show you the translation of the composition you maked, to verify you don't do mistakes.
 +{{:​userguide:​master-filter_composition-window.png|}}
 +
 +=== Examples ===
 +== Example 1 ==
 +{{:​userguide:​master-filter-window.png|}}\\
 +In this example, will be set as master the first protein of each protein group which have :
 +  * Its Accession number Begins with '​Q70'​
 +  * **AND**
 +  * Its Description Contains '​HUMAN'​
 +
 +=> You want to set as master only the proteins which belong to HUMAN and which have their accession number begins with '​Q70'​
 +
 +== Example 2 ==
 +{{:​userguide:​master-filter-window-2.png|}}\\
 +In this example, will be set as master the first protein of each protein group which have :
 +  * Its Accession number not containing '​P5098'​
 +  * **AND**
 +  * Its Description Contains '​HUMAN'​
 +**OR**
 +  * Its Accession number not containing '​P5098'​
 +  * **AND**
 +  * Its Description Contains '​MOUSE'​
 +
 +=> You want to set as master only protein belong to HUMAN or MOUSE but never a particular protein which have an accession containing '​P5098'​.
userguide/filtres.txt · Last modified: 2011/12/19 16:32 by 132.168.72.130