Skip to content

Commit

Permalink
add sample categories, update example
Browse files Browse the repository at this point in the history
  • Loading branch information
tivdnbos authored Jul 28, 2021
1 parent 6748296 commit 0d4a462
Showing 1 changed file with 5 additions and 4 deletions.
9 changes: 5 additions & 4 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -24,15 +24,16 @@ pip3 install pout2prot

The signature of the script is
```
pout2prot [-h] [--occam] [--decoy_flag DECOY_FLAG] [--fdr_threshold FDR_THRESHOLD] input_folder groups_output_file subgroups_output_file
pout2prot [-h] [--sample_categories] [--occam] [--decoy_flag DECOY_FLAG] [--fdr_threshold FDR_THRESHOLD] input_folder groups_output_file subgroups_output_file
```

Three positional arguments are always required for the script to function properly:
* `input_folder`: This argument should point to a folder that contains one or more `*.pout`-files. All of these input files will be processed by the package. TODO write something about the replicates here
* `groups_output_file`: Pointer to a location on the filesystem where the result file with all protein groups should be stored.
* `input_folder`: This argument should point to a folder that contains (i) one or more `*.pout` files, or (ii) subfolders containing `*.pout` files in the case there are different sample categories. In the latter case, the `--sample_categories` option should be used.
* * `groups_output_file`: Pointer to a location on the filesystem where the result file with all protein groups should be stored.
* `subgroups_output_file`: Pointer to a location on the filesystem where the result file with all protein subgroups should be stored.

Next to these mandatory arguments, the script can also be further modified by providing a value for these optional arguments:
* `sample_categories`: If this option is provided, Pout2Prot will use the name of the subfolders (containing the .pout files) as sample category. If not, only `*.pout` files in the `input_folder` will be considered, and the sample category name will be identical to the sample name.
* `occam`: Should Occam's razor be enabled while determining protein groups? If this option is not provided, Occam's razor will be disabled.
* `decoy_flag`: If a value is provided for this parameter, all proteins that contain this value as a substring will be considered as decoy proteins and will not be taken into account during the analyses of the input files. This filter is disabled by default.
* `fdr_threshold`: Filter out all proteins that have a FDR-threshold that's higher than the value provided here. The default FDR-threshold that's used by this package is 0.01.
Expand All @@ -41,7 +42,7 @@ Next to these mandatory arguments, the script can also be further modified by pr
An example of using the script can be seen here:

```shell
pout2prot data/toy-examples/grouping-examples/SimplestGroupingCases.pout SimplestGroupingCases_out.txt
pout2prot data/toy-examples/grouping-examples/ groups_out.tsv subgroups_out.tsv
```

## Which protein grouping strategy to use?
Expand Down

0 comments on commit 0d4a462

Please sign in to comment.