======= Path optimization using NEB =======
In the [[geometry_optimization|last exercise]] you have calculated the energy for Ethane for two slightly different geometries and noticed that the geometry optimization was not able to change one structure into the other with lower energy. As presented in the lecture, it may happen quiet often that a minimization algorithm gets stuck in a local minimum, respectively it is not guaranteed to find the global minimum.
In this exercise, we will therefore perform Nudged Elastic Band (NEB) calculations using the same molecule as before and investigate the energy path between the two geometries.
Following are three geometry files you should put/create in a new exercise directory:
8
i = 12, E = -14.9518242480
C 6.7709731556 5.9999999991 6.2147000005
C 5.2290258859 6.0000000007 6.2147000001
H 4.8193442852 6.0000000005 5.1955521086
H 4.8183461262 6.8823861870 6.7231540475
H 4.8183461248 5.1176138164 6.7231540484
H 7.1806551657 5.9999999994 5.1955522543
H 7.1816533264 6.8823860579 6.7231539750
H 7.1816533250 5.1176139384 6.7231539741
8
i = 76, E = -14.9559722838
C 6.7640435612 6.0000000003 5.9997401503
C 5.2359564388 5.9999999997 6.0002598497
H 4.8332682152 6.0000000001 7.0232923722
H 4.8330445407 5.1142018851 5.4887808109
H 4.8330445407 6.8857981148 5.4887808113
H 7.1667317848 5.9999999999 4.9767076278
H 7.1669554593 6.8857981149 6.5112191891
H 7.1669554593 5.1142018852 6.5112191887
8
i = 76, E = -14.9518421887
C 6.7713019119 5.9963482236 5.9999305287
C 5.2296433443 6.0033677689 6.0065640271
H 4.8226439717 6.0198987997 7.0258669051
H 4.8144385670 5.1135492476 5.5115353199
H 4.8205805610 6.8779314573 5.4838302216
H 7.1765327350 5.1402836770 5.4459664211
H 7.1832404413 6.9024073310 5.5339472993
H 7.1852801526 5.9447026992 7.0166786010
8
i = 76, E = -14.9559544815
C 6.7635882192 5.9976047386 6.0012623335
C 5.2356460989 6.0047571469 6.0004006617
H 4.8320044755 6.0067336493 7.0232527262
H 4.8290963082 5.1202363024 5.4891970991
H 4.8366693774 6.8919269855 5.4881514124
H 7.1628913027 5.1334052558 6.5507178317
H 7.1674882265 5.9517120047 4.9798028986
H 7.1709320051 6.9026197936 6.4738574951
You could in principle start from the geometries you already optimized. In fact, the files ''ethane_1_opt.xyz'' and ''ethane_s1.xyz'' are geometry optimizations resulting from the previous ''ethane1.xyz'' and ''ethane2.xyz'' with slightly different settings (more details below) and some modifications.
The input file you need then looks as follows:
&GLOBAL
PROJECT ethane_neb_aba
RUN_TYPE BAND
PRINT_LEVEL MEDIUM
&END GLOBAL
&FORCE_EVAL
METHOD Quickstep ! Electronic structure method (DFT,...)
&DFT
BASIS_SET_FILE_NAME BASIS_MOLOPT
POTENTIAL_FILE_NAME POTENTIAL
&POISSON ! Solver requested for non periodic calculations
PERIODIC XYZ
&END POISSON
&SCF ! Parameters controlling the convergence of the scf. This section should not be changed.
SCF_GUESS ATOMIC
EPS_SCF 1.0E-6
MAX_SCF 300
&END SCF
&XC ! Parameters needed to compute the electronic exchange potential
&XC_FUNCTIONAL PBE
&END XC_FUNCTIONAL
&END XC
&END DFT
&SUBSYS
&CELL
ABC 12. 12. 12.
PERIODIC XYZ
&END CELL
&TOPOLOGY ! Section used to center the atomic coordinates in the given box. Useful for big molecules
&CENTER_COORDINATES
&END
COORD_FILE_FORMAT xyz
COORD_FILE_NAME ./ethane_1_opt.xyz
&END
&KIND H
ELEMENT H
BASIS_SET DZVP-MOLOPT-GTH
POTENTIAL GTH-PBE-q1
&END KIND
&KIND C
ELEMENT C
BASIS_SET DZVP-MOLOPT-GTH
POTENTIAL GTH-PBE-q4
&END KIND
&END SUBSYS
&END FORCE_EVAL
&MOTION
&BAND
BAND_TYPE CI-NEB
NUMBER_OF_REPLICA 8
K_SPRING 0.05
&CONVERGENCE_CONTROL
MAX_FORCE 0.0010
RMS_FORCE 0.0050
&END
ROTATE_FRAMES TRUE
ALIGN_FRAMES TRUE
&CI_NEB
NSTEPS_IT 2
&END
&OPTIMIZE_BAND
OPT_TYPE DIIS
OPTIMIZE_END_POINTS FALSE
&DIIS
MAX_STEPS 1000
&END
&END
&PROGRAM_RUN_INFO
&END
&CONVERGENCE_INFO
&END
&REPLICA
COORD_FILE_NAME ./ethane_s1.xyz
&END
&REPLICA
COORD_FILE_NAME ./ethane_ts.xyz
&END
&REPLICA
COORD_FILE_NAME ./ethane_s2.xyz
&END
&END BAND
&END MOTION
One notable difference to the previous input files is the specification of the periodic boundary conditions in the ''&POISSON'' section and in the ''&CELL'' (and the increased size of the box -- now 12 Å instead of 10 Å). This is to use a different solver configuration which behaves better in combination with NEB and if the box is big enough it will not change the physics.
To run this simulation, we use a slightly different command which will return right away:
$ nohup mpirun -np 8 cp2k.popt -i ethane_neb_aba.inp -o ethane_neb_aba.out &
In the background, the process is still running, which you can verify by either watching the changes to the output file (exit this command with ''CTRL+c'') using
$ tail -f ethane_neb_aba.out
or by looking at the list of your processes:
$ ps uxf
We replaced the CP2K executable ''cp2k.sopt'' with ''cp2k.popt'', which is a parallel version of CP2K. By prefixing the command with ''mpirun -np 8'' we tell it to run it using the MPI system using 8 cores. And finally to have the command continue to run even if you log out, we prefixed everything with ''nohup''. The ampersand ''&'' at the end is to run everything in the background.
This may take a couple of hours. Continue with the exercises below once the calculation finishes.
====== Visualize the trajectory and plot the energy curve ======
When you take another peek at the input file you used to run the calculation, you will notice that we specified ''NUMBER_OF_REPLICA 8'', which means that CP2K will generate in total 8 beads (3 we specified in the ''&REPLICA'' sections, 5 will be generated automatically by interpolation).
You should therefore find 8 files named ''ethane_neb_aba-pos-Replica_nr_1-1.xyz''..''ethane_neb_aba-pos-Replica_nr_8-1.xyz'' in your exercise directory, containing the optimization of each bead. To get the trajectory over the band, we extract the last frame (see the tip below) and write it into a separate file named ''ethane_neb_aba_8r.xyz'':
$ for i in {1..8} ; do tail -n 10 "ethane_neb_aba-pos-Replica_nr_${i}-1.xyz" >> ethane_neb_aba_8r.xyz ; done
Look at the movement again using VMD.
The anatomy of a XYZ file:
A XYZ file consists of one or multiple blocks (frames) of the following:
8 <-- the number of atoms
i = 0, E = -14.9559722838 <-- some comment, CP2K writes the number of the iteration and the resulting energy here
C 6.7640435612 6.0000000003 5.9997401503 <-- the atomic symbol and position
C 5.2359564388 5.9999999997 6.0002598497 <-- .. for the specified number of atoms
H 4.8332682152 6.0000000001 7.0232923722
H 4.8330445407 5.1142018851 5.4887808109
H 4.8330445407 6.8857981148 5.4887808113
H 7.1667317848 5.9999999999 4.9767076278
H 7.1669554593 6.8857981149 6.5112191891
H 7.1669554593 5.1142018852 6.5112191887
Extracting the last frame (the optimized geometry in case of a geometry optimization) is therefore simple once you know the number of atoms using the command ''tail'', which can be used to get the last ''N'' lines of a file using the switch ''-n N''.
In our case of 8 atoms this is:
$ tail -n 10 geo_opt_output.xyz
Since CP2K writes the energy in the comment line of each frame in the XYZ file (see tip above), we can extract the energy values for each bead directly from the newly generated ''ethane_neb_aba_8r.xyz'':
awk '/E =/ {i=i+1; printf "%s %16.8f\n", i, $6}' ethane_neb_aba_8r.xyz
Create a plot for this energy curve.
====== Vibrational analysis ======
To verify whether the point at the highest energy is actually a transition state, we will be doing a vibrational analysis.
First identify the bead with the highest energy (see exercise above) and create a new XYZ file named ''ethane_neb_aba_TS.xyz'' with the respective coordinates (extracted from either the correct ''ethane_neb_aba-pos-Replica_nr_N-1.xyz'' file or the ''ethane_neb_aba_8r.xyz'').
Use the following input file and the same command as above (with different input and output file names of course) to generate the analysis.
&GLOBAL
PROJECT ethane_TS_va
RUN_TYPE NORMAL_MODES
PRINT_LEVEL MEDIUM
&END GLOBAL
&FORCE_EVAL
METHOD Quickstep ! Electronic structure method (DFT,...)
&DFT
BASIS_SET_FILE_NAME BASIS_MOLOPT
POTENTIAL_FILE_NAME POTENTIAL
&POISSON ! Solver requested for non periodic calculations
PERIODIC XYZ
&END POISSON
&SCF ! Parameters controlling the convergence of the scf. This section should not be changed.
SCF_GUESS ATOMIC
EPS_SCF 1.0E-7
MAX_SCF 300
&END SCF
&XC ! Parametes needed to compute the electronic exchange potential
&XC_FUNCTIONAL PBE
&END XC_FUNCTIONAL
&END XC
&END DFT
&SUBSYS
&CELL
ABC 12. 12. 12.
PERIODIC XYZ
&END CELL
&TOPOLOGY ! Section used to center the atomic coordinates in the given box. Useful for big molecules
&CENTER_COORDINATES
&END
COORD_FILE_FORMAT xyz
COORD_FILE_NAME ./ethane_neb_aba_TS.xyz
&END
&KIND H
ELEMENT H
BASIS_SET DZVP-MOLOPT-GTH
POTENTIAL GTH-PBE-q1
&END KIND
&KIND C
ELEMENT C
BASIS_SET DZVP-MOLOPT-GTH
POTENTIAL GTH-PBE-q4
&END KIND
&END SUBSYS
&END FORCE_EVAL
&VIBRATIONAL_ANALYSIS
NPROC_REP 1
DX 0.01
FULLY_PERIODIC
&PRINT
&MOLDEN_VIB
&END
&CARTESIAN_EIGS
&END
&PROGRAM_RUN_INFO
&EACH
REPLICA_EVAL 1
&END
&END
&END
&END
Once this run completes, you should find a file ''ethane_TS_va-VIBRATIONS-1.mol''.
Now we are going to use the application //molden// (which you can load using ''module load molden'') to visualize the vibrational modes:
$ molden ethane_TS_va-VIBRATIONS-1.mol
Click the //Norm. Mode// checkbox in the //Molden Control// window to list all the modes. What is the lowest frequency you get? By clicking on it you can visualize it.
The presence of a negative (imaginary) mode means that it is actually a transition state (and not stable).
Now repeat the same steps presented here for the bead with the lowest energy. What is now the first frequency you get in the list? Is this geometry stable?
Please note: while you should get only 18 different frequencies you get 21 instead. That means that 3 frequencies are global rotations instead of modes in the molecule and should be ignored when looking for negative frequencies to identify whether a conformer is stable or not.