perturbseq.pp.compute_TPT

perturbseq.pp.compute_TPT(gbcs_dataset)

Compute transcript per transcript

input: pandas data frame with the columns “cbc”, “umi”, “gbc”, “r2” where every row is a read output: pandas data frame with the columns “gbc”, “cbc”, “umi”, “cbc-umi-r2-count”, “cbc-umi-count”, “TPT” NOTE: for the input, multiple reads corresponding to the same cbc-umi combination should be listed as separate lines!