Truth

Contents

Truth#

missionbio.demultiplex.dna.truth.Truth

class Truth(*args: Any, **kwargs: Any)#

Truth data for GT demultiplexing

The truth data is a dataframe with the NGT for each cell and variant

Input and Output#

read_csv(csv_file)

Load dna truth from a csv file

as_input_file()

Convert the truth to the input file format used by pipelines

add_doublets()

Create truth for doublets.

Filtering#

keep_diff_variants()

returns:

DataFrame of truth

remove_non_snp_variants()

Removes indels and spanning deletions from the assignment

remove_filtered_variants(dna)

Removes poor quality variants

remove_correlated_variants(dna)

Remove correlated variants on the same amplicon

filter_variants(dna)

Remove poor quality variants