where δi is the distance between atom i and either a reference structure or the mean position of the N equivalent atoms. This is often calculated for the backbone heavy atoms C, N, O, and Cα or sometimes just the Cα atoms.
Normally a rigid superposition which minimizes the RMSD is performed, and this minimum is returned. Given two sets of n {\displaystyle n} points v {\displaystyle \mathbf {v} } and w {\displaystyle \mathbf {w} } , the RMSD is defined as follows:
An RMSD value is expressed in length units. The most commonly used unit in structural biology is the Ångström (Å) which is equal to 10−10 m.
Typically RMSD is used as a quantitative measure of similarity between two or more protein structures. For example, the CASP protein structure prediction competition uses RMSD as one of its assessments of how well a submitted structure matches the known, target structure. Thus the lower RMSD, the better the model is in comparison to the target structure.
Also some scientists who study protein folding by computer simulations use RMSD as a reaction coordinate to quantify where the protein is between the folded state and the unfolded state.
The study of RMSD for small organic molecules (commonly called ligands when they're binding to macromolecules, such as proteins, is studied) is common in the context of docking,7 as well as in other methods to study the configuration of ligands when bound to macromolecules. Note that, for the case of ligands (contrary to proteins, as described above), their structures are most commonly not superimposed prior to the calculation of the RMSD.
RMSD is also one of several metrics that have been proposed for quantifying evolutionary similarity between proteins, as well as the quality of sequence alignments.89
"Molecular docking, estimating free energies of binding, and AutoDock's semi-empirical force field". Sebastian Raschka's Website. 2014-06-26. Retrieved 2016-06-07. http://sebastianraschka.com/Articles/2014_autodock_energycomps.html ↩
Coutsias EA, Seok C, Dill KA (2004). "Using quaternions to calculate RMSD". J Comput Chem. 25 (15): 1849–1857. doi:10.1002/jcc.20110. PMID 15376254. S2CID 18224579. /wiki/Doi_(identifier) ↩
Kabsch W (1976). "A solution for the best rotation to relate two sets of vectors". Acta Crystallographica. 32 (5): 922–923. Bibcode:1976AcCrA..32..922K. doi:10.1107/S0567739476001873. /wiki/Bibcode_(identifier) ↩
Hurley JR, Cattell RB (1962). "The Procrustes Program: Producing direct rotation to test a hypothesized factor structure". Behavioral Science. 7 (2): 258–262. doi:10.1002/bs.3830070216. /wiki/Doi_(identifier) ↩
Petitjean M (1999). "On the Root Mean Square quantitative chirality and quantitative symmetry measures" (PDF). Journal of Mathematical Physics. 40 (9): 4587–4595. Bibcode:1999JMP....40.4587P. doi:10.1063/1.532988. https://hal.archives-ouvertes.fr/hal-02122820/file/PMP.JMP_1999.pdf ↩
Petitjean M (2002). "Chiral mixtures" (PDF). Journal of Mathematical Physics. 43 (8): 185–192. Bibcode:2002JMP....43.4147P. doi:10.1063/1.1484559. https://hal.archives-ouvertes.fr/hal-02122882/file/PMP.JMP_2002.pdf ↩
Jewett AI, Huang CC, Ferrin TE (2003). "MINRMS: an efficient algorithm for determining protein structure similarity using root-mean-squared-distance" (PDF). Bioinformatics. 19 (5): 625–634. doi:10.1093/bioinformatics/btg035. PMID 12651721.{{cite journal}}: CS1 maint: multiple names: authors list (link) https://academic.oup.com/bioinformatics/article-pdf/19/5/625/657994/btg035.pdf ↩
Armougom F, Moretti S, Keduas V, Notredame C (2006). "The iRMSD: a local measure of sequence alignment accuracy using structural information" (PDF). Bioinformatics. 22 (14): e35–39. doi:10.1093/bioinformatics/btl218. PMID 16873492. https://academic.oup.com/bioinformatics/article-pdf/22/14/e35/615011/btl218.pdf ↩