RFAllAtom API
The RFAllAtom API provides an interface to access the RoseTTAFoldAllAtom folding tool. This tool is used for structure prediction of complexes involving both protein and non-protein components. It supports Protein, DNA, post translational modifications of proteins, external ligands, and more. Currently RNA is not working properly in RFAllAtom so RFNucleicAcid should be used instead for RNA contaning complexes. Scientific Accuracy is comparable between RFNucleicAcid and RFAllAtom. The source code and additional documentation can be found on the github page.
Examples
Predict a complex which contains protein, dna, and a standalone ligand
lev engine submit rf-all-atom --fasta-file input.fasta --dna-fastas input_dna.fasta --ligand-sdf input_ligand.sdf
Predict a complex which contains multiple protein, dna, and ligand elements
lev engine submit rf-all-atom --fasta-file input1.fasta --fasta-file input2.fasta --dna-fasta input_dna1.fasta --dna-fasta input_dna2.fasta --ligand-sdf input_ligand1.sdf --ligand-sdf input_ligand2.sdf
Predict a complex which contains protein, dna, and a ligand which is covalently bound to the protein
lev engine submit rf-all-atom --fasta-file input.fasta --dna-fastas input_dna.fasta --ligand-sdf input_ligand.sdf --covalent-bonds [((\"$protein_chain\", \"$protein_res\", \"$atom_name\"), (\"$ligand_chain\", \"$ligand_atomnum\"), (\"$atom1_chirality\", \"$atom2_chirality\"))]
Inputs
--fasta-file
(str)- The path to the protein fasta file. Multiple files can be included by using the flag multiple times.
--dna-fasta
(str)- The path to the dna fasta file. Multiple files can be included by using the flag multiple times.
--ligand-sdf
(str)- The path to the ligand sdf file(s)
Options
--covalent-bonds
- A json string defining the covalent bonds. See the RoseTTAFold-All-Atom github page linked above for more details.
--gpu-type
- The type of GPU to use. If more than about 500 protein residues or about 50 nucleic acid residues are present a A100 gpu should be used. Otherwise the default t4 should be sufficient.
Outputs
The output is a tarball with the file labeled “results.pdb” being the predicted structure. The other files mostly represent intermediates and logs which are useful for debugging.