biopython v1.71.0 Bio.HMM.Trainer.AbstractTrainer

Provide generic functionality needed in all trainers.

Link to this section Summary

Functions

Initialize

Get a maximum likelihood estimation of transition and emmission

Calculate the log likelihood of the training seqs

Calculate the maximum likelihood estimator

Link to this section Functions

Initialize.

Link to this function estimate_params()

Get a maximum likelihood estimation of transition and emmission.

Arguments:

  • transition_counts — A dictionary with the total number of counts of transitions between two states.
  • emissions_counts — A dictionary with the total number of counts of emmissions of a particular emission letter by a state letter.

This then returns the maximum likelihood estimators for the transitions and emissions, estimated by formulas 3.18 in Durbin et al::

a_{kl} = A_{kl} / sum(A_{kl'})
e_{k}(b) = E_{k}(b) / sum(E_{k}(b'))

Returns: Transition and emission dictionaries containing the maximum likelihood estimators.

Link to this function log_likelihood()

Calculate the log likelihood of the training seqs.

Arguments:

  • probabilities — A list of the probabilities of each training sequence under the current parameters, calculated using the forward algorithm.

Calculate the maximum likelihood estimator.

This can calculate maximum likelihoods for both transitions and emissions.

Arguments:

  • counts — A dictionary of the counts for each item.

See estimate_params for a description of the formula used for calculation.