Skip to content

Hon-lab/nf-pySpade

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

nf-pySpade

The Nextflow pipeline for pySpade. Details about pySpade: https://github.com/Hon-lab/pySpade

Introduction

Over-view

To run the pySpade pipeline, please prepare the folloing input files:

  1. Mapped transcriptome matrix: provide the Cell Ranger path that contains "filtered_feature_bc_matrix.h5".
  2. Mapped sgRNA matrix: the column is cell ID, and the index is sgRNA name. The sgRNA matrix should represent if the sgRNA is present in the cell. More than 1 is considered presented, 0 is not presented.
  3. sgRNA dictionary: the reference file of perturbation regions and the targeting sgRNA. Example of first line (separated by tab): chr1:1234567-1235067 sg1;sg2;sg3;sg4;sg5
  4. positive control file: the file is used in the fc function in order to see if positive controls have good repression/activation. Example of first line (separated by tab): chr1:1234567-1235067 ACTB

The defalult parameters show here, please change them if needed.

  1. output directory: current folder
  2. FDR (false discovery rate): 0.1. 10% of false discovery rate.
  3. fold change cutoff: 0.2. Fold change need to be more than 20% (up-regulation and down-regulation) to be considered as hit.
  4. expression level cutoff: 0.05. Genes need to be expressed in more than 5% of cells to be considered as hit.

Output

  1. Manhattan_plots/filtered_df.csv: the global hits (trans regulated genes) after FDR, fold change and expression level filtering.
  2. Manhattan_plots/filtered_local_df.csv: the local hits (cis regulated genes) after FDR, fold change and expression level filtering.
  3. Manhattan plots: Manhattan plots of individual perturbation regions (pdf file).

About

The Nextflow pipeline for pySpade.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published