grelu.transforms.seq_transforms#

grelu.transform.seq_transforms contains classes to assign each sequence a score based on its content.

All classes must define the forward function, which should take as input DNA sequences as a list of strings, and return a numpy array containing a scalar value for each sequence.

Classes#

`PatternScore`	A class that returns a weighted score based on the number of occurrences of given subsequences.
`MotifScore`	A scorer that returns a weighted score based on the number of occurrences of given subsequences.

Module Contents#

class grelu.transforms.seq_transforms.PatternScore(patterns: List[str], weights: List[float])[source]#

A class that returns a weighted score based on the number of occurrences of given subsequences.

Parameters:

patterns – List of subsequences
weights – List of weights for each subsequence. If None, all patterns will receive a weight of 1.

patterns[source]#

weights[source]#

forward(seqs: List[str]) → List[float][source]#

Compute scores.

Parameters:: seqs – A list of input sequences as strings.

__call__(seqs: List[str]) → List[float][source]#

class grelu.transforms.seq_transforms.MotifScore(motifs: str | Dict[str, numpy.ndarray] = None, names: List[str] | None = None, weights: List[float] | None = None, pthresh: float = 0.001, rc: bool = True)[source]#

A scorer that returns a weighted score based on the number of occurrences of given subsequences.