pyvdrm

Tools for interpreting drug resistance mutations in viral amino acid sequences

A DRM calling algorithm's Rules should be specified in a context free grammar. It is the intention of this package to supply the semantics for the most commonly used operators on sets of Mutations, but these can be overloaded.

For example, the syntax of the Stanford HIVdb ASI2 algorithm is specified in Backus-Naur form:

...
listitems -> comma residue
selectlist -> residue listitems*
...
scoreitem booleancondition mapper min? number | max l_par scorelist r_par

The semantics for these operators is implemented in the drm module.

With these syntax and semantics, we can define an algorithm as a set of rules that score a list of mutations.

Supported Operations

There are distinct operations one may want to apply to a rule:

Validation Malformed rules should be detected before the mutations are evaluated
Printing/Transformation Displaying rules with or without arguments is useful for debugging and reporting but additionally for translating a set of rules into other machine readable formats independent of semantics
Evaluation Given an set of positional mutations, a rule returns a score

API Synopsis

Algorithm: the domain specific implementation of a rule grammar
Rule: instantiated with a single rule belonging to the algorithm
Mutation: the vcf module represents mutations in familiar notation
Score: the result of evaluating a rule with a set of mutations

An environment is a set of Mutations on which a Rule is applied.

Application Notes

The three application domains are:

The clinician that evaluates mutations to determine drug resistance
The bioinformatician that determines drug resistant mutations
The data scientist that derives DRM algorithms

The intention of this package is to facilitate use of non-ambiguous human readable drug resistance rules but the rules are also machine readable. For example, a machine learning algorithm should be able to derive these rules and close the gap between research and clinical use.

Implementing a DRM algorithm

Define the rule syntax with pyparsing
Implement the semantics for the rule's operators
Define test cases

Creating rules for an existing algorithm

In the case where you are adding rules to an existing rule set for a new drug:

# import the DRM calling algorithm
from pyvdrm.asi2 import ASI2
from pyvdrm.vcf import Mutation

# define
rule = ASI2("SCORE FROM (MAX (L100T => 20, S282N => 15))")

# rule is callable and accepts a set of mutations as the interpretation
# environment:

score = rule(Mutation.from_string("S282N"))

The bioinformatics pipeline that reports drug resistance mutations would store the bank of rule strings for each drug and genotype.

Using pyvdrm to produce drug resistance calls

The vcf module has basic helper functions for construction lists of mutation calls:

from pyvdrm.vcf import VariantCalls
from pyvdrm.asi2 import ASI2

calls = VariantCalls(reference="APITAYAQQTRGLLGCIITSLTGRD",
                     sample=   "APITAYAQQTRGLLTCIITSLTGRD")
# mutation G15T ------------------------------^

rule = ASI2('SCORE FROM ( G15T => 5 )')
score = rule(calls)
print(score)  # => 5

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
.idea		.idea
pyvdrm		pyvdrm
.gitignore		.gitignore
.travis.yml		.travis.yml
README.md		README.md
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.idea

.idea

pyvdrm

pyvdrm

.gitignore

.gitignore

.travis.yml

.travis.yml

README.md

README.md

setup.cfg

setup.cfg

setup.py

setup.py

Repository files navigation

pyvdrm

Supported Operations

API Synopsis

Application Notes

Implementing a DRM algorithm

Creating rules for an existing algorithm

Using pyvdrm to produce drug resistance calls

About

Releases 7

Packages

Contributors 3

Languages

cfe-lab/pyvdrm

Folders and files

Latest commit

History

Repository files navigation

pyvdrm

Supported Operations

API Synopsis

Application Notes

Implementing a DRM algorithm

Creating rules for an existing algorithm

Using pyvdrm to produce drug resistance calls

About

Resources

Stars

Watchers

Forks

Languages