grelu.interpret.modisco#

grelu.interpret.modisco contains functions that enable the user to run TF-MoDISco (Shrikumar et al. 2018) on trained models. Many of the functions here are based on jmschrei/tfmodisco-lite.

Functions#

`_ism_attrs`(model, seqs, one_hot, prediction_transform, ...)	Perform ISM and format the results for TF-Modisco.
`_add_tomtom_to_modisco_report`(→ None)	Modified from jmschrei/tfmodisco-lite
`_tomtom_on_modisco`(out_dir, h5_file, meme_file[, ...])	Run tomtom on motifs in a modisco report
`run_modisco`(→ None)	Run TF-Modisco to get relevant motifs for a set of inputs, and optionally score the

Module Contents#

grelu.interpret.modisco._ism_attrs(model, seqs: List[str], one_hot: torch.tensor, prediction_transform: Callable | None, start: int, end: int, devices: str | int, num_workers: int, batch_size: int, genome: str)[source]#: Perform ISM and format the results for TF-Modisco.

grelu.interpret.modisco._add_tomtom_to_modisco_report(modisco_dir: str, tomtom_results: pandas.DataFrame, meme_file: str, top_n_matches: int) → None[source]#: Modified from jmschrei/tfmodisco-lite

grelu.interpret.modisco._tomtom_on_modisco(out_dir: str, h5_file: str, meme_file: str, top_n_matches: int = 10, trim_threshold: float = 0.3)[source]#: Run tomtom on motifs in a modisco report

grelu.interpret.modisco.run_modisco(model, seqs: pandas.DataFrame | numpy.array | List[str], genome: str | None = None, prediction_transform: Callable | None = None, window: int = None, meme_file: str = None, out_dir: str = 'outputs', devices: str | int = 'cpu', num_workers: int = 1, batch_size: int = 64, n_shuffles: int = 10, seed=None, method: str = 'deepshap', correct_grad: bool = False, **kwargs) → None[source]#

Run TF-Modisco to get relevant motifs for a set of inputs, and optionally score the motifs against a reference set of motifs using TOMTOM

Parameters:

model – A trained deep learning model
seqs – Input DNA sequences as genomic intervals, strings, or integer-encoded form.
genome – Name of the genome to use. Only used if genomic intervals are provided.
prediction_transform – A module to transform the model output
window – Sequence length over which to consider attributions
meme_file – Path to a MEME file containing reference motifs for TOMTOM.
out_dir – Output directory
devices – Indices of devices to use for model inference
num_workers – Number of workers to use for model inference
batch_size – Batch size to use for model inference
n_shuffles – Number of times to shuffle the background sequences for deepshap.
seed – Random seed
method – Either “deepshap”, “saliency” or “ism”.
correct_grad – If True, gradients will be corrected using the method of Majdandzic et al. (PMID: 37161475). Only used with method=’saliency’.
**kwargs – Additional arguments to pass to TF-Modisco.

Raises:

NotImplementedError – if the method is neither “deepshap” nor “ism”

grelu.interpret.modisco#

Functions#

Module Contents#

This Page