====== Protein Folding in Solution ======
In this exercise, you will calculate the protein folding free energy in aqueous solution using thermodynamic integration, a method based on molecular dynamics (MD). The protein will be described by the empirical force field, CHARMM, [[http://mackerell.umaryland.edu/charmm_ff.shtml]]
===== Background =====
A model protein you will have to deal with is the alanine decapeptide. The folding/unfolding will be achieved by stretching/compressing the chain and fixing the distance between the end carbon atoms in it: atoms 7 and 98. This distance is called a collective variable. At each distance one runs the MD simulation (constrained MD) to extract the time-averaged forces acting on the collective variable, $F(x)$. Then, a free energy difference can be calculated via thermodynamic integration (TI):
\begin{equation}
\Delta A = -\int_a^b F(x)\, dx
\end{equation}
Here $a$ and $b$ are the initial and the final values of the collective variable. TI is a general method, which can be applied to a variety of processes, e.g. phase transitions, electron transfer etc.
===== Task 1: Familiarize yourself =====
Download the files: {{ :exercises:2017_uzh_acpc2:deca_ala.tar.gz |}}
''deca_ala.pdb'' (protein data base) file contains the coordinates
''deca_ala.psf'' (protein structure file) file contains connectivity data
''par_all27_prot_lipid.inp'' contains the force field parameters
''md_1836.inp'' is the CP2K input file
Open the ''deca_ala.pdb'' protein data bank format file with **vmd**. Create a new representation for the protein, e.g. of type **Ribbon** to observe the alpha-helix.
{{ :exercises:2017_uzh_acpc2:deca_ala.gif?400 |}}
===== Task 2: Perform constrained MD simulations =====
For that you have to run MD for different values of the distance between atoms 7 and 98, in each run it will be constrained. In the original file ''md_1836.inp'' it is set to $18.36$ Å as is in the ''deca_ala.pdb'' file.
- Run CP2K with ''md_1836.inp''
- Copy ''md_1836.inp'' to smth. like ''md_1536.inp'';
- Modify the PROJECT_NAME and ''TARGET'' value in the ''CONSTRAINT'' section for a new value: here 15.36;
- Run CP2K with the new input file;
- Repeat for several values in the range $15$ to $20 $ Å.
* To avoid confusion, try to perfrom every task in a new directory
* You may increase or decrease the number of MD steps, which is set to 5000 in the file, to speed-up the calculation or else get a better statiscics.
==== Constraint section TO BE modified for constrained MD ====
&CONSTRAINT
&COLLECTIVE
COLVAR 1
INTERMOLECULAR
TARGET [angstrom] 18.36
&END COLLECTIVE
&LAGRANGE_MULTIPLIERS
COMMON_ITERATION_LEVELS 1
&END
&END CONSTRAINT
===== Task 3: Evaluate the free energy difference =====
⇒ Each constrained MD will produce a ''.LagrangeMultLog''-files, which look like this:
Shake Lagrangian Multipliers: -63.547262596
Rattle Lagrangian Multipliers: 63.240598387
Shake Lagrangian Multipliers: -0.326901815
Rattle Lagrangian Multipliers: -0.318145579
Make sure that you get the units right. The Largange multipliers are written in atomic units (Hartree/bohr), while the distances are in Angstrom.
* From these files you can calculate the average Lagrange multiplier of the Shake-algorithm like this:
grep Shake yourprojectname.LagrangeMultLog | awk '{c++ ; s=s+$4}END{print s/c}'
* The average Lagrange multiplier is the average force $F(x)$ required to constrain the atoms at the distance $x$.
* From these forces the free energy difference can be obtained via TI (see **Background**)
* Calculate $\Delta A$ numerically using the trapezoidal rule (or equivalent) with EXCEL, ORIGIN or any scripting language.