Introduction
What is molecular Dynamics?
Molecular dynamics is a method in computational biochemistry used to simulate the behavior of atoms. It treats all atoms as balls connected by springs and completely ignores electrons and no bonds are created or destroyed. In a molecular dynamic simulation, the simulation time is divided up into time steps of equal length with the coordinates of the atoms changing with each time step the force is acting on the atoms and therefore their velocities are calculated from their energy. Atoms will move in a specific direction that best decreases their energy, imagine this as a three-dimensional field where the atom or imaginary balls roll down the direction of the gradient their energy is itself calculated using a potential energy function also called a force field that computes the energy of each atom based on coordinates and structural data as well as a set of defined parameters also called force-field parameters. Most force-field equations can be summed up as follows: total energy of an atom consists of a sum of its energy associated with its bonded interactions as well as the energy from non-bonded interactions. In a molecular dynamic simulation the total energy, i.e the sum of the potential and kinetic energy of the system should remain constant.
MOLECULAR DYNAMIC CYCLE
A basic molecular dynamic cycle can be described as follows. First your program reads the coordinate and structural information of your system as well as any initial velocities. Then the atomic energies are calculated using the potential energy function. From these energies, the force acting on each atom is computed; the forces can be used to calculate the acceleration of each atom in a pellet in a particular direction. These are then used to generate new coordinates for each of our atoms as well as their new velocities. The program checks if N, the specified number of time steps, has elapsed. If not, the program repeats the cycle and computes new coordinates for the system. This cycle is repeated until N number of time steps has elapsed at which point the program will write all of its computed data in an output file and terminate.
SYSTEM INFORMATION
Then this file is processed. System information is first specified usually in a group of files - a coordinate file such as a pdb or xyz file, a structural file such as a psf or grow file and usually a force field parameter file such as a prm file. Most molecular dynamic simulation programs use an input script where you specify the names of these files or even copy the file information into the actual input script. After having calculated the energy of an atom using a potential energy function, U(x), the force acting on it will be the negative derivative of U(x). Since atoms are not one but three dimensional, the force will be a combined partial derivative of each of the principal axes. Finally Newton's second law is used to calculate the acceleration acting on each atom. This will also depend on atomic mass and therefore the element that each atom is representing. Using the acceleration and an already specified time step duration, for example 1 femtosecond per step, a function is used to calculate the new atomic coordinates and velocity.
SOLVENT MODEL
The presence of a solvent greatly changes the behavior of a system and as such it is important to consider solvent models if you want to get the results that agree with experimental data. If ignored, inaccurate data will greatly change the properties of the molecules in our system. The next step up is an implicit solvent model, this is more computationally expensive because the program imagines there is a sort of solvent force that permeates our system and applies any relevant forces to our molecules to mimic the presence of a solvent. Finally, the explicit solvent model, which is the most computationally expensive but also the most accurate one, actually fills up our system with solvent molecules and does molecular dynamics calculations on them.
BOUNDARY CONDITIONS
These will govern the behavior of molecules when they come in contact with the simulation box edges or boundaries. Common approaches include a solid wall that deflects or repels atoms when they come in contact with the boundary. Periodic boundary conditions, by far the most common method as it most closely resembles real life, mean that if an atom goes into one boundary it comes out of the opposing boundary we can think of this as an ever repeating unit cell of our simulation space. Some simulation methods delete the atom or will simply crash when an atom goes into a boundary. Sometimes we even use a non-cubic simulation space such as a hexagonal spherical or cylindrical shape.
SIMULATION PROTOCOLS
There are also simulation protocols that can keep specific things in our system the same by altering different variables. We could refer to classical molecular dynamics simulation as a Microcanonical Ensemble (NVE)- Keeping constant the amount of substance, the volume and the total energy. Another method is the canonical ensemble (NVT)-Keeping constant the amount of substance, volume and temperature-which is why it is also referred to as constant temperature molecular dynamics(CTMD). We can refer to the implementation of this as a thermostat because the temperature is kept constant. The isothermal isobaric ensemble (NPT) - keeps the amount of substance, pressure and temperature constant. In the same way that we had a thermostat to keep the temperature constant, a barostat is used.
Why are we using MD?
Molecular dynamics (MD) simulation is a powerful computational technique used to study the interactions between biomolecules, including aptamers and proteins. When investigating aptamer-protein interactions, MD simulations offer several advantages and insights that make them a valuable tool:
Atomic-Level Detail: MD simulations provide a high-resolution view of the dynamics and interactions between individual atoms and molecules. This level of detail is crucial for understanding the binding mechanism of aptamers and proteins, as it reveals how specific amino acids and nucleotides interact during the complex formation.
Flexibility and Dynamics: Aptamer-protein interactions often involve conformational changes in both molecules upon binding. MD simulations can capture these dynamic processes, showing how the aptamer and protein adapt their structures over time as they come together or dissociate. This dynamic information is essential for understanding the kinetics and energetics of binding
Energetic Insights: MD simulations can provide information about the energy landscape of the interaction. By monitoring the potential energy, forces, and interactions between aptamer and protein over time, researchers can calculate binding affinities, identify key binding residues, and gain insights into the stability of the complex
Exploration of Binding Pathways: MD simulations can reveal potential binding pathways and intermediate states during aptamer-protein interactions. This information is valuable for understanding the various stages of complex formation, including the initial encounter complex, transition states, and final bound state.
Screening and Design: MD simulations can be used to screen a library of aptamer sequences for their potential to interact with a target protein. By simulating the binding of multiple aptamer candidates, researchers can identify the most promising candidates for experimental validation.
Drug Discovery: MD simulations are increasingly used in rational drug design. Understanding the aptamer-protein interaction at the atomic level can aid in the design of aptamer-based therapeutics or the optimization of existing aptamer drugs.
Experimental Hypothesis Testing: MD simulations can complement experimental studies by providing hypotheses and predictions that can guide further experiments. For example, they can suggest which amino acid residues are critical for binding or identify potential mutations to enhance binding affinity.
Reduced Cost and Time: While experimental studies of aptamer-protein interactions can be time-consuming and costly, MD simulations can provide insights at a fraction of the cost and time, making them a cost-effective and efficient research tool.
In summary, MD simulations are a valuable tool for studying aptamer-protein interactions because they offer atomic-level insights into the dynamics, energetics, and mechanisms of binding. By complementing experimental studies with computational simulations, researchers can gain a more comprehensive understanding of these interactions and accelerate the development of aptamer-based therapies and diagnostics.
The Four-Step Process
ENERGY MINIMIZATION (EM)
Minimization is an important process to carry out before our main simulation. It is designed to relax the system, distribute the energy around the system and get as many atoms or molecules out of their local minima and global minima.
EQUILIBRATION (NVT)
Equilibration is a process that we carry out before our main simulation where we equilibrate the kinetic and potential energies across the system usually to sort local groups of high energy or any other unnatural phenomena in our system that usually get created during the heating process. Where molecules are provided with energy and therefore the so-called temperature of the system is raised from zero kelvin to the specified temperature.
Dynamics(NPT)
NPT Molecular Dynamics is a computational simulation technique used in the field of molecular dynamics (MD) to study the behavior and interactions of atoms and molecules in a system under constant pressure (P), constant temperature (T), and constant number of particles (N). Each of these variables is held constant during the simulation, which allows researchers to mimic realistic conditions often encountered in experiments or real-world systems.
Applications
These include materials science, crystallography and polymer simulations, discovering protein folding pathways and protein dynamics, as well as protein protein-protein interactions, drug design, calculation of free binding energy, the study of protein ligands interactions, general study of biomolecules such as lipids, proteins, dna, water and other solvents, study of small biological structures such as viral capsids or ribosomes and that is by no means a comprehensive list. There are many more niche applications.
Software used -GROMACS
Why GROMACS?
Development of computer technology in chemistry brings many applications of chemistry, not only the application to visualize the structure of molecules but also to molecular dynamics simulation. One of them is Gromacs. Gromacs is an example of molecular dynamics application developed by Groningen University. This application is non-commercial and able to work in the operating system Linux. The main ability of Gromacs is to perform molecular dynamics simulation and minimization energy. In the molecular dynamics simulation, Gromacs does not work alone. Gromacs interacts with Pymol and Grace. Pymol is an application to visualize molecule structure and Grace is an application in Linux to display graphs. Both applications will support analysis of molecular dynamics simulation.
Other Software - AMBER, ROSETTA, NAMD, Desmond
Many other applications can be used depending on the two substances and binding interactions, parameters and Software availability. For DNA only simulations, NAMD is predominantly used, but AMBER and ROSETTA have high number of force fields which can be employed
What is CHARMM GUI? Why are we using it?
CHARMM-GUI is a user-friendly web-based interface for setting up, running, and analyzing molecular dynamics (MD) simulations of biomolecular systems. It was originally developed for the CHARMM (Chemistry at HARvard Macromolecular Mechanics) force field but has also been extended to support other popular force fields, including GROMACS.
The notable features of CHARMM-GUI:
User-Friendly Interface:
CHARMM-GUI provides an intuitive and user-friendly web interface that allows researchers, including those without extensive computational or programming experience, to set up complex MD simulations with ease.
The interface guides users through the step-by-step process of system preparation, from uploading molecular structures to specifying simulation parameters. It offers helpful explanations and tooltips for each input field, making it accessible to both novice and experienced users.
The interface supports various force fields, solvent models, and simulation packages, such as CHARMM, GROMACS, NAMD, and AMBER, allowing users to choose the software and parameters that best suit their research needs.
Automated System Setup:
CHARMM-GUI automates many aspects of system setup, saving users significant time and effort.
Users can input their molecular structures (e.g., proteins, nucleic acids, lipids) in various formats (e.g., PDB, Mol2) and specify simulation details (e.g., temperature, pressure, ion concentration).
CHARMM-GUI then generates the necessary input files (e.g., topology, coordinate, parameter files) tailored to the chosen force field and simulation software. This automation ensures consistency and accuracy in setting up simulations.
Advanced options, such as defining custom restraints, can also be configured through the interface.
Analysis and Visualization Tools:
CHARMM-GUI offers a suite of analysis and visualization tools for post-simulation data analysis.
Users can visualize simulation trajectories, generate plots and graphs of key properties (e.g., RMSD, RMSF, radial distribution functions), and calculate various structural and dynamic properties of the biomolecular system.
The web interface provides interactive visualization capabilities, allowing users to inspect the structure and behavior of their system in 3D and explore specific regions of interest.
CHARMM-GUI also supports the analysis of membrane systems, ligand binding, and other complex biomolecular processes, providing researchers with valuable insights into their simulation results.
Method used
A Buffer was made i.e. KCl at 0.15 M for simulation to take place The Water-Module used is TIP3. Gridbox for MD simulations was generated using PME-FFT, a Fourier transform method.
A waterbox was generated using CHARMM-GUI, for solvation to occur
Monte-Carlo Ion placing method was used to determine the ion positions in the solution
The Water-Module used is TIP3.
Gridbox for MD simulations was generated using PME-FFT, a Fourier transform method.
Results for CHARMM-GUI
We obtain the Input files for EM, NVT and NPT runs in .mdp format.
We also obtain the coordinates, i.e. topology file in .topol format.
Protein in solvated box structure is obtained in the .gro format
Force Fields:
What are Force Fields?
A force field encompasses two major components: a set of potential energy functions and associated parameters. Potential energy functions, derived from quantum mechanical principles or experimental data, define the energy landscapes governing atomic interactions. Parameters, on the other hand, are numerical values assigned to atoms or atom pairs, representing properties like bond lengths, angles, dihedral angles, and van der Waals radii. By calculating the total energy of a system using these functions and parameters, molecular dynamics simulations predict the system's evolution over time.
Force fields enable researchers to study complex phenomena such as protein folding, ligand binding, and chemical reactions, shedding light on drug design, material science, and biochemistry. However, force fields are not without limitations. The trade-off between accuracy and computational efficiency is a perpetual challenge, as simplified models might fail to capture nuanced interactions. Additionally, force fields are often parameterized using experimental data, which can lead to inaccuracies when extrapolating beyond the training data
Potential Energy Functions : It is also important to choose the right force field simulation. Some force fields such as charm or opls are specifically designed for biomolecules while others are better at simulating crystallographic or small molecule systems. A graph that rates different force fields on their accuracy - a lower number signifying greater agreement with experimental results
What is the need of Force Fields in Simulation?
Force fields are integral to molecular dynamics simulations due to their role in predicting atomic interactions, molecular motion, and thermodynamics. By defining forces between atoms and molecules, force fields guide trajectories, illuminate equilibrium states, and unveil reaction mechanisms. They prove indispensable for studying biological processes, designing drugs and materials, and conducting virtual experiments. Additionally, force fields offer quantitative insights, aiding in hypothesis testing and validation against experimental data. While vital, these models possess limitations stemming from their approximations, driving ongoing efforts to refine them and develop more accurate representations of molecular behavior.
CHARMM Force Field
According to the CHARMM force field the total energy of an atom consists of its energy associated with the bonding and the non-bonding interactions. The bonding energy is itself composed of the sum of the energy associated with the atomic bonds, the atomic angles, atomic dihedrals, atomic impropers and the Urey-Bradley term (described as the harmonic term in the distance between atoms 1 and 3 of some of the angle terms. The way that these manifest themselves is that, depending on the distance between two bonded atoms or the angle between three atoms or four atoms in an improper or dihedral, different energies will be assigned. There is an optimal length between two atoms and depending on the length, a different energy value will be added. Similarly for the non-bonded energy composed of van der waals interactions (also sometimes called the Leonard-Jones term) and the electrostatic interactions (which happen between charged particles). All of this combines into one equation where each contributing energy is given with some variables that are all taken from the input files. Even a small change in these could massively affect the behavior of the simulation..
Which one are we using? - CHARMM 36M
Renowned for its accuracy, CHARMM36m significantly refines protein structure and dynamics simulations. This refinement is a boon for comprehending the mechanistic underpinnings of biological processes. However, its high level of detail demands substantial computational resources. Drawing strength from its meticulous parameterization grounded in experimental data, CHARMM36m offers
Researchers harness CHARMM36m to unravel the mysteries of biomolecular systems. This force field's capability to depict atomic-level intricacies opens avenues to explore functional mechanisms and intermolecular interactions. Consequently, CHARMM36m remains an indispensable tool in the arsenal of scientists delving into the microscopic intricacies of life's building blocks.