Antibody HM API

The Antibody HM API is our most high resolution structure prediction tool for an antibody variable region. The “NextGen” antibody software developed by Levitate is a highly advanced antibody structure prediction tool that outperforms competing methods. Built on the RosettaCM “hybridize” algorithm in the Homology Modeling protocol, NextGen has been specifically customized for antibody structures, incorporating an antibody-specific database, sequence parsing, and accounting for heavy-light chain orientation. It was benchmarked on the AMA-II antibody structure dataset, where it produced more accurate models than other leading software, including Schrödinger and older Rosetta algorithms.

To further validate its effectiveness we partnered with BIOCAD, testing NextGen against other major software using a dataset of 26 antibody structures. The results confirmed that NextGen consistently produced the most accurate models based on various RMSD (Root Mean Square Deviation) metrics, making it a superior tool for antibody modeling.

Examples

Command Line Examples

Submit an antibody HM job to model the variable domains of an antibody:

lev engine submit antibody-hm \
  --heavy-chain DIQMTQTTSSLSASLGDRVTFSCSASQDISNYLNWYQQKPDGTIKLLIYYTSSLRSGVPSRFSGSGSGTDYSLTINNLEPEDIATYFCQQYSRLPFTFGSGTKLEIKRADAAPTVSIFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC \
  --light-chain QVHLQESGPELVRPGASVKISCKTSGYVFSSSWMNWVKQRPGQGLKWIGRIYPGNGNTNYNEKFKGKATLTADKSSNTAYMQLSSLTSVDSAVYFCATSSAYWGQGTLLTVSAAKTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYTLSSSVTVPSSPRPSETVTCNVAHPASSTKVDKKIVPR

Submit an antibody HM job to model an entire antibody:

lev engine submit antibody-hm \
  --model-full-antibody
  --heavy-chain DIQMTQTTSSLSASLGDRVTFSCSASQDISNYLNWYQQKPDGTIKLLIYYTSSLRSGVPSRFSGSGSGTDYSLTINNLEPEDIATYFCQQYSRLPFTFGSGTKLEIKRADAAPTVSIFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC \
  --light-chain QVHLQESGPELVRPGASVKISCKTSGYVFSSSWMNWVKQRPGQGLKWIGRIYPGNGNTNYNEKFKGKATLTADKSSNTAYMQLSSLTSVDSAVYFCATSSAYWGQGTLLTVSAAKTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYTLSSSVTVPSSPRPSETVTCNVAHPASSTKVDKKIVPR

Python examples

Submit an antibody HM job to model the variable domains of an antibody:

from engine.antibody_hm.client import AntibodyHmClient

client = AntibodyHmClient()
job_id = client.submit(
  heavy_chain="DIQMTQTTSSLSASLGDRVTFSCSASQDISNYLNWYQQKPDGTIKLLIYYTSSLRSGVPSRFSGSGSGTDYSLTINNLEPEDIATYFCQQYSRLPFTFGSGTKLEIKRADAAPTVSIFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC",
  light_chain="QVHLQESGPELVRPGASVKISCKTSGYVFSSSWMNWVKQRPGQGLKWIGRIYPGNGNTNYNEKFKGKATLTADKSSNTAYMQLSSLTSVDSAVYFCATSSAYWGQGTLLTVSAAKTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYTLSSSVTVPSSPRPSETVTCNVAHPASSTKVDKKIVPR")

Submit an antibody HM job to model an entire antibody:

from engine.antibody_hm.client import AntibodyHmClient

client = AntibodyHmClient()
job_id = client.submit(
  model_full_antibody=True,
  heavy_chain="DIQMTQTTSSLSASLGDRVTFSCSASQDISNYLNWYQQKPDGTIKLLIYYTSSLRSGVPSRFSGSGSGTDYSLTINNLEPEDIATYFCQQYSRLPFTFGSGTKLEIKRADAAPTVSIFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC",
  light_chain="QVHLQESGPELVRPGASVKISCKTSGYVFSSSWMNWVKQRPGQGLKWIGRIYPGNGNTNYNEKFKGKAT")

Inputs

Heavy and light chain sequences to model (canonical amino acids only) to model

Options

  • --heavy-chain
    • a protein sequence
    • Only canonical amino acids are supported.
    • Please include only the CH domain sequence. Further heavy chain sequence will be ignored.
  • --light-chain
    • a protein sequence
    • Only canonical amino acids are supported.
    • Please include only the CL domain sequence. Further light chain sequence will be ignored.
  • --model-full-antibody
    • boolean
    • Default is false, if this flag is set to true the entire antibody will be modeled. If false, only the variable domains will be modified

Output

  • models (directory)
    • 5 PDB files representing the centers of the top-scoring clusters of models generated during the homology modeling process.

Output file interpretation

Levitate’s Antibody HM tool returns 5 cluster centers (selected_models.tar.gz) after running a large number of HM trajectories. This clustering is balanced to return 5 models that have good energy within their structural cluster and represent different clusters.

If all 5 models are similar even after clustering, it means that HM was highly converged and/or that the template match was very high. This is a good sign, it means Rosetta has good confidence in this prediction.

If there are 5 distinct predictions, particularly models that vary outside the H3 loop, it may mean that the default sampling is insufficient, or that this particular problem is harder than this API is able to accommodate – please let Levitate know and we can discuss other options for this type of modeling problem.

If your models are highly similar except for the H3 loop, consider using your preferred structure as input to loop modeling for further refinement.

Depending on settings, you may also see full_selected_models.tar.gz. If you did not use –model-full-antibody please ignore this.

Notes

Modeling other types of antibodies

VHH or nanobody

VHH or nanobody antibodies are composed of single chains and thus compatible with the single-chain HM API service. The antibody specializations in this tool are focussed on the VH/VL orientation and thus irrelevant to VHH antibodies. Further loop refinement of CDRs can be performed with the loop remodel tool.

scFv

Often, the linker for an scFv is not an important part of the problem. Many scFvs have off-the-shelf, stock linkers designed to not interfere with the underlying Fv. If you wish to explicitly model the linker, use this tool to model the Fv, and use the resulting model as a custom template in the single-chain HM API service to fill in the linker. Further loop refinement of the linker can be performed with the loop remodel tool.

Antibody full structures

--model-full-antibody can be used as a flag to this API to return full-length antibodies instead of just the scFv. The remaining chains are modeled with a multichain homology modeling tool.

Updated: