imspy.utility package¶
Submodules¶
imspy.utility.sequence module¶
- imspy.utility.sequence.tokenize_unimod_sequence(unimod_sequence)¶
Tokenizes a sequence of modified amino acids. :type unimod_sequence:
str
:param unimod_sequence: A string representing the sequence of amino acids with modifications.- Return type:
List
[str
]- Returns:
A list of tokenized amino acids.
imspy.utility.utilities module¶
- class imspy.utility.utilities.ExponentialGaussianDistribution(μ=-3, σ=1, λ=0.25)¶
Bases:
object
- class imspy.utility.utilities.NormalDistribution(μ, σ)¶
Bases:
object
- class imspy.utility.utilities.TokenSequence(sequence_tokenized=None, jsons=None)¶
Bases:
object
- property jsons¶
- imspy.utility.utilities.exp_distribution(x, λ=1)¶
Exponential function :type x: :param x: :type λ:
float
:param λ: :return:
- imspy.utility.utilities.exp_gaussian(x, μ=-3, σ=1, λ=0.25)¶
laplacian distribution with exponential decay :type x: :param x: :type μ:
float
:param μ: :type σ:float
:param σ: :type λ:float
:param λ: :return:
- imspy.utility.utilities.gaussian(x, μ=0, σ=1)¶
Gaussian function :type x: :param x: :type μ:
float
:param μ: :type σ:float
:param σ: :return:
- imspy.utility.utilities.get_aa_num_proforma_sequence(sequence)¶
get number of amino acids in sequence
- Parameters:
sequence (str) – proforma formatted aa sequence
- Returns:
Number of amino acids
- Return type:
int
- imspy.utility.utilities.is_unimod_end(char)¶
Tests if char is end of unimod bracket
- Parameters:
char (str) – Character of a proForma formatted aa sequence
- Returns:
Whether char is end of unimod bracket
- Return type:
bool
- imspy.utility.utilities.is_unimod_start(char)¶
Tests if char is start of unimod bracket
- Parameters:
char (str) – Character of a proForma formatted aa sequence
- Returns:
Whether char is start of unimod bracket
- Return type:
bool
- imspy.utility.utilities.normal_pdf(x, mass, s=0.001, inv_sqrt_2pi=0.3989422804014327, normalize=False)¶
- Parameters:
x (
Union
[_SupportsArray
[dtype
[Any
]],_NestedSequence
[_SupportsArray
[dtype
[Any
]]],bool
,int
,float
,complex
,str
,bytes
,_NestedSequence
[Union
[bool
,int
,float
,complex
,str
,bytes
]]])mass (
float
)s (
float
)inv_sqrt_2pi (
float
)normalize (
bool
)
- imspy.utility.utilities.re_index_indices(ids)¶
Re-index indices, i.e. replace gaps in indices with consecutive numbers. Can be used, e.g., to re-index frame IDs from precursors for visualization. :type ids: :param ids: Indices.
- Returns:
Indices.
- imspy.utility.utilities.tokenize_proforma_sequence(sequence)¶
Tokenize a ProForma formatted sequence string.
- Parameters:
sequence (str) – Sequence string (ProForma formatted)
- Returns:
List of tokens
- Return type:
List
- imspy.utility.utilities.tokenizer_from_json(path)¶
load a pre-fit tokenizer from a json file :type path:
str
:param path: path to tokenizer as json file :return: a keras tokenizer loaded from json
- imspy.utility.utilities.tokenizer_to_json(tokenizer, path)¶
save a fit keras tokenizer to json for later use :type tokenizer:
Tokenizer
:param tokenizer: fit keras tokenizer to save :type path:str
:param path: path to save json to