Dna.find_relevant_variants

Dna.find_relevant_variants(variant_corr_thresh=0.15, variant_cluster_thresh=0.3)

Variants that vary across clusters.

Parameters
variant_corr_threshfloat [0, 1]

The variant is labeled as relevant if the correlation coeffecient with any other variant is greater than this threshold.

variant_cluster_threshfloat [0, 1]

The variant is labeled as relevant if the difference in median VAF of two clusters is larger than the threshold.

Returns
variant_keepnp.ndarray

The list of variants that vary across clusters.