hmm

integron_finder.hmm.read_hmm(replicon_id, prot_db, infile, cfg, evalue=1.0, coverage=0.5)[source]

Function that parse hmmer –out output and returns a pandas DataFrame filter output by evalue and coverage. (Being % of the profile aligned)

Parameters:
  • replicon_id (str) – the id of the replicon
  • prot_db (integron_finder.prot_db.ProteinDB object.) – The protein database corresponding to the replicon translation
  • infile (str) – the hmm output (in tabulated format) to parse
  • cfg (integron_finder.config.Config object.) – the config
  • evalue (float) – filter out hits with evalue greater tha evalue.
  • coverage (float) – filter out hits with coverage under coverage (% of the profile aligned)
Returns:

data Frame with columns:

”Accession_number”, “query_name”, “ID_query”, “ID_prot”, “strand”, “pos_beg”, “pos_end”, “evalue”
each row correspond to a hit.

Return type:

a pandas.DataFrame

integron_finder.hmm.scan_hmm_bank(path)[source]
Parameters:path (str) –
  • if the path is a dir: return all files ending with .hmm in the dir
  • if the path is a file: parse the file, each line must be an expression (glob) pointing to hmm files
Returns:lists of hmm files to consider for annotation
Return type:list of str
Raises:IOError – if the path does not exists