TY - JOUR
T1 - Validation of X-ray Crystal Structure Ensemble Representations of SARS-CoV-2 Main Protease by Solution NMR Residual Dipolar Couplings
AU - Shen, Yang
AU - Robertson, Angus
AU - Bax, Ad
PY - 2023/6/1
Y1 - 2023/6/1
N2 - Considerable debate has focused on whether sampling of molecular dynamics trajectories restrained by crystallographic data can be used to develop realistic ensemble models for proteins in their natural, solution state. For the SARS-CoV-2 main protease, Mpro, we evaluated agreement between solution residual dipolar couplings (RDCs) and various recently reported multi-conformer and dynamic-ensemble crystallographic models. Although Phenix-derived ensemble models showed only small improvements in crystallographic Rfree, substantially improved RDC agreement over fits to a conventionally refined 1.2-Å X-ray structure was observed, in particular for residues with above average disorder in the ensemble. For a set of six lower resolution (1.55–2.19 Å) Mpro X-ray ensembles, obtained at temperatures ranging from 100 to 310 K, no significant improvement over conventional two-conformer representations was found. At the residue level, large differences in motions were observed among these ensembles, suggesting high uncertainties in the X-ray derived dynamics. Indeed, combining the six ensembles from the temperature series with the two 1.2-Å X-ray ensembles into a single 381-member “super ensemble” averaged these uncertainties and substantially improved agreement with RDCs. However, all ensembles showed excursions that were too large for the most dynamic fraction of residues. Our results suggest that further improvements to X-ray ensemble refinement are feasible, and that RDCs provide a sensitive benchmark in such endeavors. Remarkably, a weighted ensemble of 350 PDB Mpro X-ray structures provided slightly better cross-validated agreement with RDCs than any individual ensemble refinement, implying that differences in lattice confinement also limit the fit of RDCs to X-ray coordinates.
AB - Considerable debate has focused on whether sampling of molecular dynamics trajectories restrained by crystallographic data can be used to develop realistic ensemble models for proteins in their natural, solution state. For the SARS-CoV-2 main protease, Mpro, we evaluated agreement between solution residual dipolar couplings (RDCs) and various recently reported multi-conformer and dynamic-ensemble crystallographic models. Although Phenix-derived ensemble models showed only small improvements in crystallographic Rfree, substantially improved RDC agreement over fits to a conventionally refined 1.2-Å X-ray structure was observed, in particular for residues with above average disorder in the ensemble. For a set of six lower resolution (1.55–2.19 Å) Mpro X-ray ensembles, obtained at temperatures ranging from 100 to 310 K, no significant improvement over conventional two-conformer representations was found. At the residue level, large differences in motions were observed among these ensembles, suggesting high uncertainties in the X-ray derived dynamics. Indeed, combining the six ensembles from the temperature series with the two 1.2-Å X-ray ensembles into a single 381-member “super ensemble” averaged these uncertainties and substantially improved agreement with RDCs. However, all ensembles showed excursions that were too large for the most dynamic fraction of residues. Our results suggest that further improvements to X-ray ensemble refinement are feasible, and that RDCs provide a sensitive benchmark in such endeavors. Remarkably, a weighted ensemble of 350 PDB Mpro X-ray structures provided slightly better cross-validated agreement with RDCs than any individual ensemble refinement, implying that differences in lattice confinement also limit the fit of RDCs to X-ray coordinates.
U2 - 10.1016/j.jmb.2023.168067
DO - 10.1016/j.jmb.2023.168067
M3 - Article
C2 - 37330294
SN - 1089-8638
VL - 435
SP - 168067
JO - Journal of Molecular Biology
JF - Journal of Molecular Biology
IS - 11
ER -