LigandMPNN API

The LigandMPNN API provides an interface to the LigandMPNN protein design tool. This tool is the latest version of the MPNN based design tools and is capable of running with the original Protein MPNN weights as well as the versions which support ligands. This tool takes as input a PDB file and rapidly generates new sequences predicted to fold to the backbone of the input PDB.

Command Line Interface

Examples

Predict a single new sequence for an input PDB using the (default) LigandMPNN weights

lev engine submit ligand-mpnn input.pdb

Predict a single new sequence for an input PDB using the original ProteinMPNN weights

lev engine submit ligand-mpnn input.pdb \
	--model-type protein_mpnn

Predict 1000 new sequences for an input PDB

lev engine submit ligand-mpnn input.pdb \
	--n-mpnn-designs 1000

Predict 1000 new sequences for an input PDB using an elevated temperature (default temperature is 0.1)

lev engine submit ligand-mpnn 1ubq.pdb \
	--n-mpnn-designs 1000 \
	--sampling-temperature 0.2

Predict a single new sequence for an input PDB where all the chains are symmetric and should have the same sequence

lev engine submit ligand-mpnn input.pdb \
	--homo-oligomer 1

Predict a single new sequence for an input PDB where specific residues should be treated symmetrically

lev engine submit ligand-mpnn input.pdb \
	--symmetry-residues 'A1,B1|B1,B2|A3,B3' \
	--symmetry-weights '0.5,0.5|0.5,0.5|0.5,0.5'

Predict a single new sequence for an input PDB where specific residues should be held fixed and not redesigned

lev engine submit ligand-mpnn input.pdb \
	--fixed-residue-positions "A1 A2 A3"

Predict a single new sequence for an input PDB where specific residues should be designed and all others held fixed

lev engine submit ligand-mpnn input.pdb \
	--redesigned-residues "A1 A2 A3"

Predict a single new sequence for an input PDB where specific chains should be designed and all others held fixed

lev engine submit ligand-mpnn input.pdb \
	--chains-to-design "A,B"

Predict a single new sequence for an input PDB where specific amino acids are not allowed in the designs

lev engine submit ligand-mpnn input.pdb \
	--omit-aa "CPG"

Predict a single new sequence for an input PDB where specific amino acids are biased in the designs. Higher values are more likely, negative values are less likely

lev engine submit ligand-mpnn input.pdb \
	--bias-aa 'A:10.0,Y:-10.0'

Flags

--bias-aa (str) (Optional)
- Bias the designs toward specific residues
--chains-to-design (str) (Default: all)
- Set which chains should be designed
--fixed-residue-positions (str) (Optional)
- The space separated list of fixed residue positions
--gpu-type (str) (Default: t4)
- Select the GPU type to use.
- Options:
  - t4
  - a100
--homo-oligomer (int) (Default: 0)
- Is the input a homo oligomer.
--ligand-mpnn-use-atom-context (int) (Default: 1)
- Include the ligand atoms when designing the sequence. (Used to determine the effect of the ligand on the designs)
--model-type (str) (Default: ligand_mpnn)
- Specify the model type to use. Options currently include:
  - protein_mpnn
  - ligand_mpnn
--n-mpnn-designs (int) (Default: 1)
- Number of sequences to design
--omit-aa (str) (Optional)
- Exclude specified amino acids from being used in designs
--parse-atoms-with-zero-occupancy (int) (Default: 1)
- Specify whether atoms with zero occupancy in the b factor column be parsed
--parse-these-chains-only (str) (Default: all)
- Only parse specified chains
--pdb-file (str) (Required)
- Path to a PDB File to design sequences for
--redesigned-residues (str) (Optional)
- The space separated list of residues to design
--sampling-temperature (float) (Default: 0.1)
- The sampling temperature
--symmetry-residues (str) (Optional)
- Define which residues should be treated symmetrically. These should be specified in comma seperated sets with each set divided by a |. For example if A1 is symmetrically tied to B1 and A2 is tied to B2, etc… you would put “A1,B1|A2,B2|A3,B3”. If the entire structure is a homo-oligomer and each residue on the chain is tied to the matching position on all the other chains use the homo-oligomer set 1 to assign all of residues to be treated symmetrically instead of this option.
--symmetry-weights (str) (Optional)
- Define the weights for which residue should be more important for determining the sequence when treated symmetrically. For most cases setting all weights to be equal is appropriate. The syntax is the same as for the symmetric residues substituting a value between 1 and 0 instead of a residue ID. Example: "0.5,0.5|0.5,0.5|0.5,0.5".

Python Interface

Examples

Predict a single new sequence for an input PDB using the (default) LigandMPNN weights

from engine import EngineClient

client = EngineClient()
client.authorize()

job_id = client.submit_ligand_mpnn(
	pdb_paths="input.pdb"
)

Flags

bias_aa (str) (Optional)
- Bias the designs toward specific residues
chains_to_design (str) (Default: all)
- Set which chains should be designed
fixed_residue_positions (str) (Optional)
- The space separated list of fixed residue positions
gpu_type (str) (Default: t4)
- Select the GPU type to use.
- Options:
  - t4
  - a100
homo_oligomer (int) (Default: 0)
- Is the input a homo oligomer.
ligand_mpnn_use_atom_context (int) (Default: 1)
- Include the ligand atoms when designing the sequence. (Used to determine the effect of the ligand on the designs)
model_type (str) (Default: ligand_mpnn)
- Specify the model type to use.
- Options:
  - protein_mpnn
  - ligand_mpnn
n_mpnn_designs (int) (Default: 1)
- Number of sequences to design
omit_aa (str) (Optional)
- Exclude specified amino acids from being used in designs
parse_atoms_with_zero_occupancy (int) (Default: 1)
- Specify whether atoms with zero occupancy in the b factor column be parsed
parse_these_chains_only (str) (Optional)
- Only parse specified chains
pdb_paths (str)
- Path to a PDB File to design sequences for
redesigned_residues (str) (Optional)
- The space separated list of residues to design
sampling_temperature (float) (Default: 0.1)
- The sampling temperature
symmetry_residues (str) (Optional)
- Define which residues should be treated symmetrically. These should be specified in comma seperated sets with each set divided by a |. For example if A1 is symmetrically tied to B1 and A2 is tied to B2, etc… you would put “A1,B1|A2,B2|A3,B3”. If the entire structure is a homo-oligomer and each residue on the chain is tied to the matching position on all the other chains use the homo-oligomer set 1 to assign all of residues to be treated symmetrically instead of this option.
symmetry_weights (str) (Optional)
- Define the weights for which residue should be more important for determining the sequence when treated symmetrically. For most cases setting all weights to be equal is appropriate. The syntax is the same as for the symmetric residues substituting a value between 1 and 0 instead of a residue ID. Example: "0.5,0.5|0.5,0.5|0.5,0.5".

Outputs

designed_sequences.fasta
- A FASTA file containing all designed sequences. The first record in the file is the native sequence of the protein in the PDB file. The headers of the FASTA file contain score and sequence recovery values for each designed sequence