Energy decomposition analysis for large-scale first principles quantum mechanical simulations of biomolecules

Kohn-Sham density functional theory (DFT) is an extraordinarily powerful and versatile tool for calculating the properties of materials. In its conventional form, this approach scales cubically with the size of the system under study. This scaling becomes prohibitive when investigating larger arrang...

Full description

Bibliographic Details
Main Author: Phipps, Maximillian Joshua Sebastian
Other Authors: Skylaris, Chris-Kriton
Published: University of Southampton 2017
Subjects:
541
Online Access:https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.714602
id ndltd-bl.uk-oai-ethos.bl.uk-714602
record_format oai_dc
collection NDLTD
sources NDLTD
topic 541
spellingShingle 541
Phipps, Maximillian Joshua Sebastian
Energy decomposition analysis for large-scale first principles quantum mechanical simulations of biomolecules
description Kohn-Sham density functional theory (DFT) is an extraordinarily powerful and versatile tool for calculating the properties of materials. In its conventional form, this approach scales cubically with the size of the system under study. This scaling becomes prohibitive when investigating larger arrangements such as biomolecules and nanostructures. More recently linear-scaling approaches have been developed that overcome this limitation, allowing calculations to be performed on systems many thousands of atoms in size. An example of such an approach is the ONETEP code which uses a plane wave-like basis set and is based upon the use of spherically-localised orbitals. A simple yet common calculation performed using ab initio codes is the total (ground state) energy calculation. By comparing the energy of isolated parts of a system to the energy of the combined system, we are able to obtain the energy of interaction. This quantity is useful as it provides a relative measure of the enthalpic stability of an interaction which can be compared to other systems. Equally, however, this quantity gives little indication of the driving forces that lead to the interaction energy we observe. A number of approaches have been developed that aim to identify these driving forces. Energy decomposition analysis (EDA) refers to the set of methods that decompose the interaction energy into physically relevant energy components which add to the full interaction energy. Few studies have applied EDA approaches to larger systems in the thousand-atom regime, with the vast majority of investigations focussing on small system studies (less than 100 atoms in size). These methods have shown varying degrees of success. In this work, we have evaluated the suitability of a selection of popular EDA methods in decomposing the interaction energies of small biomolecule-like systems. Based on the results of this review, we developed a linear-scaling EDA approach in the ONETEP code that separates the intermolecular interaction energy into chemically distinct components (electrostatic, exchange, correlation, Pauli repulsion, polarisation, and charge transfer). The intermediate state used to calculate polarisation, also known as the absolutely localised molecular orbital (ALMO) state, has the key advantage of being fully antisymmetric and variationally optimised. The linear-scaling capability of the scheme is based on use of an adaptive purification approach and sparse matrix equations. We demonstrate the accuracy of this approach in reproducing the energy component values of its Gaussian basis counterparts, and present a remedy to the limitation of polarisation and charge transfer basis set dependence that is based on the property of strict localisation of the ONETEP orbitals. Additionally, we show the method to have mild exchange-correlation functional and atomic coordinate dependence. We have demonstrated the high value of our method by applying it to the thrombin protein interacting with a number of small binders. Here, we used our scheme in combination with electron density difference (EDD) plots to identify the key protein and ligand regions that contribute to polarisation and charge transfer. In our studies, we assessed convergence of the EDA components with protein truncation up to a total system size of 4975 atoms. Additionally, we applied our EDA to binders that had been partitioned into smaller fragments. Here, we accurately quantified the bonding contributions of key ligand moieties with particular regions of the protein cavity. We assessed how accurately the ligand binding components are reproduced by the fragment contributions using an additivity measure. Using this measure, we showed the fragment binding components to add up to the full ligand binding component with overall minimal additivity error. We also investigated the energy components of a series of small thrombin S1 pocket binders all less than 30 atoms in size. In this study, we demonstrate the EDA and EDD plots as tools for understanding the relative importance of different binder structural features and positionings within the pocket. Overall, we show our EDA method to be a stable and powerful approach for the analysis of interaction energies in systems of large size. The application of this method is not limited to biomolecular studies, and we expect that this approach can be readily applied to analyses within other fields, for example materials, catalysts, and nanostructures.
author2 Skylaris, Chris-Kriton
author_facet Skylaris, Chris-Kriton
Phipps, Maximillian Joshua Sebastian
author Phipps, Maximillian Joshua Sebastian
author_sort Phipps, Maximillian Joshua Sebastian
title Energy decomposition analysis for large-scale first principles quantum mechanical simulations of biomolecules
title_short Energy decomposition analysis for large-scale first principles quantum mechanical simulations of biomolecules
title_full Energy decomposition analysis for large-scale first principles quantum mechanical simulations of biomolecules
title_fullStr Energy decomposition analysis for large-scale first principles quantum mechanical simulations of biomolecules
title_full_unstemmed Energy decomposition analysis for large-scale first principles quantum mechanical simulations of biomolecules
title_sort energy decomposition analysis for large-scale first principles quantum mechanical simulations of biomolecules
publisher University of Southampton
publishDate 2017
url https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.714602
work_keys_str_mv AT phippsmaximillianjoshuasebastian energydecompositionanalysisforlargescalefirstprinciplesquantummechanicalsimulationsofbiomolecules
_version_ 1718730958190936064
spelling ndltd-bl.uk-oai-ethos.bl.uk-7146022018-09-05T03:36:04ZEnergy decomposition analysis for large-scale first principles quantum mechanical simulations of biomoleculesPhipps, Maximillian Joshua SebastianSkylaris, Chris-Kriton2017Kohn-Sham density functional theory (DFT) is an extraordinarily powerful and versatile tool for calculating the properties of materials. In its conventional form, this approach scales cubically with the size of the system under study. This scaling becomes prohibitive when investigating larger arrangements such as biomolecules and nanostructures. More recently linear-scaling approaches have been developed that overcome this limitation, allowing calculations to be performed on systems many thousands of atoms in size. An example of such an approach is the ONETEP code which uses a plane wave-like basis set and is based upon the use of spherically-localised orbitals. A simple yet common calculation performed using ab initio codes is the total (ground state) energy calculation. By comparing the energy of isolated parts of a system to the energy of the combined system, we are able to obtain the energy of interaction. This quantity is useful as it provides a relative measure of the enthalpic stability of an interaction which can be compared to other systems. Equally, however, this quantity gives little indication of the driving forces that lead to the interaction energy we observe. A number of approaches have been developed that aim to identify these driving forces. Energy decomposition analysis (EDA) refers to the set of methods that decompose the interaction energy into physically relevant energy components which add to the full interaction energy. Few studies have applied EDA approaches to larger systems in the thousand-atom regime, with the vast majority of investigations focussing on small system studies (less than 100 atoms in size). These methods have shown varying degrees of success. In this work, we have evaluated the suitability of a selection of popular EDA methods in decomposing the interaction energies of small biomolecule-like systems. Based on the results of this review, we developed a linear-scaling EDA approach in the ONETEP code that separates the intermolecular interaction energy into chemically distinct components (electrostatic, exchange, correlation, Pauli repulsion, polarisation, and charge transfer). The intermediate state used to calculate polarisation, also known as the absolutely localised molecular orbital (ALMO) state, has the key advantage of being fully antisymmetric and variationally optimised. The linear-scaling capability of the scheme is based on use of an adaptive purification approach and sparse matrix equations. We demonstrate the accuracy of this approach in reproducing the energy component values of its Gaussian basis counterparts, and present a remedy to the limitation of polarisation and charge transfer basis set dependence that is based on the property of strict localisation of the ONETEP orbitals. Additionally, we show the method to have mild exchange-correlation functional and atomic coordinate dependence. We have demonstrated the high value of our method by applying it to the thrombin protein interacting with a number of small binders. Here, we used our scheme in combination with electron density difference (EDD) plots to identify the key protein and ligand regions that contribute to polarisation and charge transfer. In our studies, we assessed convergence of the EDA components with protein truncation up to a total system size of 4975 atoms. Additionally, we applied our EDA to binders that had been partitioned into smaller fragments. Here, we accurately quantified the bonding contributions of key ligand moieties with particular regions of the protein cavity. We assessed how accurately the ligand binding components are reproduced by the fragment contributions using an additivity measure. Using this measure, we showed the fragment binding components to add up to the full ligand binding component with overall minimal additivity error. We also investigated the energy components of a series of small thrombin S1 pocket binders all less than 30 atoms in size. In this study, we demonstrate the EDA and EDD plots as tools for understanding the relative importance of different binder structural features and positionings within the pocket. Overall, we show our EDA method to be a stable and powerful approach for the analysis of interaction energies in systems of large size. The application of this method is not limited to biomolecular studies, and we expect that this approach can be readily applied to analyses within other fields, for example materials, catalysts, and nanostructures.541University of Southamptonhttps://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.714602https://eprints.soton.ac.uk/410305/Electronic Thesis or Dissertation