hmm¶
-
integron_finder.hmm.
read_hmm
(replicon_id, prot_db, infile, cfg, evalue=1.0, coverage=0.5)[source]¶ Function that parse hmmer –out output and returns a pandas DataFrame filter output by evalue and coverage. (Being % of the profile aligned)
Parameters: - replicon_id (str) – the id of the replicon
- prot_db (
integron_finder.prot_db.ProteinDB
object.) – The protein database corresponding to the replicon translation - infile (str) – the hmm output (in tabulated format) to parse
- cfg (
integron_finder.config.Config
object.) – the config - evalue (float) – filter out hits with evalue greater tha evalue.
- coverage (float) – filter out hits with coverage under coverage (% of the profile aligned)
Returns: data Frame with columns:
”Accession_number”, “query_name”, “ID_query”, “ID_prot”, “strand”, “pos_beg”, “pos_end”, “evalue”each row correspond to a hit.Return type: a
pandas.DataFrame
-
integron_finder.hmm.
scan_hmm_bank
(path)[source]¶ Parameters: path (str) – - if the path is a dir: return all files ending with .hmm in the dir
- if the path is a file: parse the file, each line must be an expression (glob) pointing to hmm files
Returns: lists of hmm files to consider for annotation Return type: list of str Raises: IOError – if the path does not exists