Thursday, December 20, 2007

LEADFINDING.COM, ONLINE HIT-TO-LEAD OPTIMIZATION SERVICE

San Diego, California – December 17th, 2007 – ChemDiv Inc (San Diego) and Quantum Pharmaceuticals (Moscow) announce the launch of LEADFINDING.COM, their powerful new internet-based hit-to-lead optimization service and online chemistry store.
LEADFINDING.COM brings together, the industry leading computational chemistry software from Quantum Pharmaceuticals and ChemDiv’s world’s largest and most diverse small molecule collection for drug discovery. Fully automated web-based interface predicts binding affinities on the fly, facilitating basic hit-to-lead optimization tasks for a wide audience of researchers.
LEADFINDING.COM will be of special interest to academic and biotech researchers who have identified a small molecule hit and wish to deploy LEADFINDING.COM’s expertise in selecting candidate molecules for hit follow-up. We offer an online hit-to-lead service helping identify novel chemical lead series from ChemDiv world’s largest Discovery Collection of small molecules. LEADFINDING.COM provides unique online computational tools including hit analog searches, physiochemical properties filters, and predictions of biological activity (IC50). Availability of all selected molecules can be confirmed for online purchase and immediate delivery.
About LEADFINDING:
LEADFINDING is a joint project of ChemDiv, Inc. (ChemDiv) and Quantum Pharmaceuticals (Quantum). The project is resulting from Quantum’s effort in bringing the power of their computational models to the world life sciences community and ChemDiv’s conceptual approach to modern day discovery process. ChemDiv has been constantly contributing to the evolution of drug discovery roadmap, most recently by introducing the Chemistry Anywhere™ concept. Chemistry Anywhere™ is ChemDiv’s partnering program which by accessing a variety of online research tools shortens the time from design to wet lab results.
About ChemDiv:
ChemDiv ( www.chemdiv.com ) is a global chemistry-driven contract research organization. ChemDiv is focused on identifying and delivering pre-clinical opportunities and services to life science partners for the treatment of life-threatening diseases. Over 17 years ChemDiv provides Discovery outSourceTM solutions including medicinal and synthetic chemistry, pre-clinical development, screening libraries and global logistics. ChemDiv international research team encompasses 550 chemists and biologists in San Diego and Moscow based R&D centers.
About Quantum Pharmaceuticals:
Quantum (www.q-pharm.com) develops and commercializes industry leading computational drug discovery technologies based on applying quantum, molecular and statistical physics in molecular modeling. Our solutions help pharmaceutical companies and research facilities around the world successfully accelerate the identification and optimization of new compounds that have the potential to become drug products.

Tuesday, December 18, 2007

How good are biological data - II: Trombine, GSK, GPCR

Many bindign affinity prediction methods, such as scores and QSAR models, rely on availability of accurate information on binding constants. The figure on the left is a result of our sdf-file parser applied to trombine (blue) and GSK (yellow) binding data from BindingDB database. The parser is written with python and uses pybel to extract unique molecules from a given multimolecular sdf.
The parser not only finds identical (in Tanimoto-similarity sense) compounds, but also prints the binding constants from the sdf records. The graph shows the correlation of the reported inverse log(binding constants) for the same molecules from different entries (sources).
The result is in fact fairly impressive (the blue points): the discrepancies-"errors" are quite large and are especially profound for good (or better say very good) binders.
The yellow points represent the result of the same script over GSK-kinase activity data. Although the total number of molecules in BindDB is much larger, almost all of them are unique. The difference between different sources is not as much as for trombine.
The Figure on the right is the visualized script output for GPCR(5-HT2B) from PDSP Ki database. The situation is roughly the same: the accuracy of a typical biological experiment reported in a literature amounts roughly to a single unit of pKd.

This and previously reported correlation for HERG ion channel should serve as an example when the results of binding affinity calculations are compared to experimental data.

Monday, December 10, 2007

EMD Serono, Inc licensed Quantum Pharmaceuticals’ drug discovery technology.


Moscow, 10 December, 2007

EMD Serono, Inc entered license agreement with Quantum Pharmaceuticals to get an access to Quantum Pharmaceuticals’ small molecule hit identification computational platform and apply it in in-house research.

The Quantum Pharmaceuticals’ industry leading computational drug design technologies is based on applying quantum, molecular and statistical physics in molecular modeling and was successfully applied in different drug discovery projects. The initial term of the agreement is one year. The financial terms of the agreement were not disclosed.

Quantum Pharmaceuticals is a drug discovery company based in Moscow, Russia specializing in small molecule screening and design through the use of its proprietary technology platform.

EMD Serono, Inc. and Merck Serono S.A. are affiliates of Merck KGaA, Darmstadt, Germany, with over 16,000 employees worldwide and a strong presence on all continents.

Friday, December 7, 2007

How good are biological experiments? HERG binding data analysis


A correlation between predicted and expermentally measured values of biological activity is a natural measure of a model quality. For instance, QUANTUM docking software calculates binding free energies, which are directly comparable with experimental values of -p(binding constant, Kd). Root mean squared error between the measured and the calculated quantities is the quantitative measure of the software performance.
Whatever the correlation is presented to prove the validity of a model, another important issue is the quality of the experimental data itself. The reported values for binding constants (or activities) often vary because of different measurement strategies, experimental errors or interpretation uncertanties. To visualize the situation we investigated a few datasets for HERG binding taken from QSAR World website.
The downloaded files were saved in source folder and processed with the following simple python script (thanks to openbabel):
files = os.listdir('source/')
molecules = []
for file in files:
molfile = readfile("sdf",'source/'+file)
for mol in molfile:
molfp = mol.calcfp()
present = 0
for savedmol in molecules:
savedmolfp = savedmol.calcfp()
if (molfp | savedmolfp == 1):
present = 1
print mol.data, savedmol.data
if (not present):
molecules.append(mol)

The results where analyzed in a spreadsheet program and represented on the graph above. A lot of molecules occur multiple times in the datasets. While in many of the cases the activities coinside up to 0.01 (which most probably indicates citing from a single source), the remaining values thouch correlated with each other, differ by roughly a single pKd unit.



Tuesday, September 18, 2007

Quantum Science Overview, v 0.1 has been released

Quantum Pharmaceuticals is pleased to release Quantum Science Overview. The document conveys an overview Quantum's drug discovery solutions and scientific results. The content of the report are summarized below:
I. A Novel Integrated Approach in Drug Discovery 1
II. QUANTUM free energy vs. statistical scoring functions
III. Inside Quantum Drug Discovery Studio: Molecular Modeling Concepts and Tools
IV. QDDS and Molecular Interactions in Aqueous Environments (Solvation Model).
V. Collective (non-additive) contributions to a protein-ligand complex binding free energy.
VI. Discovery of new classes of HIV integrase inhibitors.
VII. Biological Spectra Analysis: Linking Biological Activity Profiles to Molecular Toxicity.

Appendices
A. Structure of a Basic PBPK Model applied in q-ADME
B. Binding affinity calculation of known drugs tohuman serum albumin
C. Rediscovery of Blockbuster drugs with QUANTUM

Monday, September 3, 2007

QUANTUM and albumin binding calculations: the role of protein flexibility


Drug distribution within the body is determined mainly by free (unbound) concentration of drug in circulating plasma. The unbound fraction, in turn, depends on drug absorption by plasma proteins. Human Serum Albumin (HSA) is the most abundant blood plasma protein and is produced in the liver.

Binding of a compound to HSA results in an increased solubility in plasma, decreased toxicity, and /or protection against oxydation of the bound ligand. Binding can also have a significant impact on the pharmacokinetics of drugs, e.g. prolonging in vivo half life of the therapeutic agent. However too strong binding prevents drug release in tissues. That is why HSA binding information is one of the key characteristics of a compound determining its ADME properties. Successful in silico calculation of HSA binding could provide a structural basis of drug derivatives with altered HSA-binding properties.

HSA has at least two main drug binding sites characterized as Sudlow site I and Sudlow site II, which bind a number of drugs selectively. Site I, also known as the warfarin binding site, is formed by a pocket in subdomain IIA of HSA. Site II is located in subdomain IIIA and is known as the benzodiazepine binding site. Ibuprofen and diazepam are selectively bind to site II. Multiple active sites make HSA a complicated target for structure-based modeling.

The ligands with known HSA binding affinities were taken from and prepared with a set of built in QUANTUM molecular preparation and processing tools. Acenocoumarol, Acetylsalicilic_acid, Azapropazone, Benzylpenicillin, Benzylthiouracil, Bilirubin, Canrenoate, Carbamazepine, Carbenicillin, Chlorpropamide, Diphenylhydantoin, Furosemide, Indomethacin, Methyl_p-hydroxybenzoate, N-acetyl-L-tryptophan, Oxyphenbutazone, Phenobarbital, phenyl_salicylate, Phenylbutazone, Piretanide, Propyl_p-hydroxybenzoate, Quercetin, Salicylate, Sodium_benzoate, Spironolactone, Sulfadimethoxine, Sulfamethizole, Sulfathiazole, sulfisoxazole, Tenoxicam, Tolbutamide, Warfarin, Carprofen, Chlofibrate, Iopanoate, L-tryptophan were docked on to both of the sites. Three different structures – 2BXH, 2BXF and 2BXG were taken for docking. 2BXH was used for docking to the first binding site, 2BXF and 2BXG have different structures of the binding site II and we decided to use both structures for docking. Docking grid 20x20x20A were centered around a central ligand atom in appropriate binding site.

The results of the docking runs are summarized on the Figure. Both binding sites show considerable flexibility, molecular dynamics and the binding free energy calculations with flexible protein (large red points) lead to remarkable improvement of the predicted values of HSA binding affinities with respect to experiment (smaller blue points represent the results of docking on a rigid protein model). Since QUANTUM model does not have training parameters the presented correlation proves QUANTUM abilities to predict HSA binding constants of druglike compounds to both of the active sites.

Monday, August 13, 2007

Quantum Pharmaceuticals and MOLECMO Nanobiotechnologies announce collaboration.

Moscow, Russia (August, 13 2007) -Quantum Pharmaceuticals announced today a collaboration with MOLECMO Nanobiotechnologies, a drug discovery company.
The collaboration is aimed at application of Quantum Pharmaceuticals' industry leading technology to anti-viral drug discovery project led by MOLECMO. The financial terms of the agreement were not disclosed.
About Quantum Pharmaceuticals.
Quantum Pharmaceuticals is a drug discovery company based in Moscow, Russia specializing in small molecule screening and design through the use of its proprietary technology platform.
About MOLECMO.
MOLECMO is an antiviral drug discovery company based in Cambridge, Massachusetts in the heart of the biotechnology circle, proximal to world class research institutes like Harvard and MIT, famed Boston Hospitals and leading Pharmaceutical firms.

Tuesday, May 15, 2007

QUANTUM free energy vs. statistical scoring functions

QUANTUM employs quantum mechanics, thermodynamics, and an advanced continuous water model for solvation effects to calculate ligands binding affinities. This approach differs dramatically from scoring functions that are commonly used for binding affinity predictions. By including the entropy and aqueous electrostatics contributions in to the calculations directly, QUANTUM algorithms produce much more accurate and robust values of binding free energies.
Interaction of a ligand with a protein is characterized by the value of binding free energy. The free energy (F) is the thermodynamic quantity, that is directly related to experimentally measurable value of inhibition constant (IC50) and depends on electrostatic, quantum, aqueous solvation forces, as well as on statistical properties of interacting molecules. There are two major contributing quantities leading to non-additivity in F: 1) the electrostatic and solvation energy, and 2) the entropy.
Most of popular scores employ a reasonable approximation for short-range quantum interactions, but do not perform a detailed calculation of aqueous electrostatics and entropy. Both the solvation energy and the entropy are difficult to compute: so instead of exact computations, scoring function use an approximation. In this approximation, the contributions of non-additve properties are estimated as fractions of easier-to-calculate pairwise interactions: electrostatics and van der Waals forces.
Such approximation works to a some extent because vacuum electrostatics is nearly canceled by solvation energies; at the same time the enthalpy of binding is approximately compensated by entropy. Therefore, the calculations of solvation energies and entropy seemingly can be avoided by combining molecular mechanics, van der Waals and electrostatic forces linearly with usually small numerical coefficients (of the order of 10%).
However, a potential energy surface given by such linear combinations of unrelated quantities with statistics-based coefficients is not necessarily related to the true interaction profile. That is why such a simple score fails frequently to reproduce unique binding modes and hence gives docking false negatives. In the same time, this approximation tends to overestimate the affinity of weak binders producing docking false positives. Thus, in spite of reasonable accuracy of such predictions, the selectivity of scoring function is low. This means that frequently scoring functions will not allow to identify really strong binder among the pool of similar weak binders. Moreover, affinities of weak binders may be overestimated.
QUANTUM software does not rely on approximate cancellations of important physical quantities. Instead, we employ our continuous water model to compute both the vacuum and aqueous electrostatic energies, use quantum mechanics to calculate the short range forces and thermodynamic sampling to obtain the value of the free energy (entropy). As the result we can not only observe the necessary subtractions of individual energy components, but also perform molecular modeling in a more realistic and physically justified potentials. Fig. 1 shows the results of a docking run on a single rigid protein structure for 220 different ligands. R.m.s. error in free energy is 2kJ/mol, the correlation coefficient is 0.7.
The selectivity improvement in our approach is illustrated by the following model calculation. First, we derived a simple linear model directly from our QUANTUM vacuum force field. The short range interactions where variationally adjusted (to allow for empirical hydrogen bonds). The van der Waals “scaling factors” and “protein dielectric constant” were found by correlating the suggested score with experimental binding affinities for a number of known complexes. Both the linear score and complete QUANTUM force field were tested using 300 protein-ligand pairs and showed comparable accuracy.
Fig. 2 shows docking funnels (the energy difference between the conformers plotted as a function of r.m.s. from the known crystallographic position of a ligand). Both full QUANTUM free energies (red lines) and scoring function (blue lines) were used to calculate, in the case of QUANTUM, or estimate, in the case of linear score, the free energies for numerous binding modes (conformers). The conformers where generated by the QUANTUM 3.3 docking program. The resulting energies were averaged over the conformers with similar r.m.s. values and plotted on the same graph.
The model calculation shows that the QUANTUM force field has a much steeper docking funnel, i.e. is more sensitive to misplacements of a ligand, than a scoring function. Therefore QUANTUM complete force field can be used to distinguish similar binding modes and hence obtain much more accurate docking positions. As a result, QUANTUM shows dramatically lower ratio of false positives and false negatives as compared with a scoring function based method.
In fact, non-additive interactions (especially solvation effects) play a key role in molecular recognition of small molecules by proteins. QUANTUM software is practically the only accurate and highly sensitive method available to a broad audience of researchers, which is capable to realistically model the intermolecular interactions.

Thursday, April 19, 2007

Docking selectivity: Additive vs. non-additive force field

The free binding energy (F) of a small molecule and a protein is a non-additive complex function of individual interatomic interactions. There are two major contributing quantities leading to non-additivity in F: the electrostatic energy, and the entropy.

A common approach to molecular docking is to develop a simple (generally additive) model of intermolecular interactions and train it to reproduce experimental values of binding energy for a range of known inhibitors. The obvious advantage is the calculations speed.

Any more sophisticated approach takes requires more computational resources and may hardly lead to an immense improvement in calculations accuracy. The question thus is: do non-additive force fields have any advantages in docking situtations?

To investigate the issue the following experiment was performed:
  1. A simple force Molecular Mechanics (MM) force field, containing a reasonable approximation for (distant-dependent media polarization) electrostatics, van-der-waals and hydrogen bonding, was developed.
  2. Same van-der-waals and hydrogen bonds were paired with vacuum electrostatics and a (non-additive) water model to simulate solvation effects.
Both models were transformed into simple linear regressions (to avoid sophisticated thermodynamic integration) and trained to reproduce the same amount of experimental values.

Although the two models show roughly the same level of accuracy in predicting the binding energies, the selectivity is drustically different. The figure above shows the results of the energy calculations for a set of few hundreds decoys. Both graphs represent the relative binding energy (counted from the minimum position) vs. the r.m.s. deviation of the calculated ligand positions from the known native position. The lines are the energy offset values averaged over the conformers (decoys) with similar r.m.s. positions.

The conclusion?
The solvation energy is the major source of non-additivity in ligand binding. Though being one of the most complicated quantities to be acounted properly in a docking run, the solvation energy is one of the major mechanisms of molecular recognition

Friday, January 19, 2007

What makes a correct binding energy calculation?

A correct calculation of a ligand binding affinity requires proper account of a few large contributions at the same time. The final value of the binding free energy is the sum of multiple sign-alternating entries: electrostatic (ES), solvation (ESolv), exchange-van der Waals (vdW) and entropic (TdS) term.

Common wisdom claims that ES and ESolv nearly compensate each other (up to a few h-bond energies), the remaining (normally large) vdW contribution is nearly canceled by TdS term. In fact, there is another important anti-correlation: strongly burried ligands with large vdW contributions are normally both entropycally and electrostatically constrained: The deeper a ligand gets into a protein (low dielectric constant medium) from water (high dielectric costant medium), the more the dessolvation energy (ES+ESolv) is, the larger the entropy losses normally are.


To clarify this issue, a simple calculation with 32 pdb structures with knows binding affinities was performed (1AU0, 1AU3, 1AU4, 1BR5, 1BR6, 1C83, 1C84, 1C85, 1C86, 1C87, 1EFY, 1EJN, 1JIJ, 1JIK, 1MS6, 1N3W, 1NAX, 1NLI, 1PWY, 1Q6K, 1RRI, 1RRW, 1RRY, 1RS2, 1TOW, 1TSM, 1URG, 1V2M, 2CBR, 2CBS, 3CBS, 4ERK). The binding affinities were taken from the PDBBind database. For every structure both the complex, the ligand and the protein was initially optimized in QUANTUM force field, then a thermodynamic integration was used to establish the free binding energies. The correlation between the calculated and measured (reported in the literature) values is shown on the left Figure 1. The free binding energies are reported in kJ/mol.


First we check the sanity of our solvation energy calculations. The following graph represents the anti-correlation between the electrostatic energy ES and the solvation contribution ESolv (both polar and non-polar, no additional dielectric constant for the protein was used since the protein is fully flexible). The two values nearly cancel each other, as it should be the case.
The only physically meaningful combination is the dessolvation punishment ESolv+ES, which is plotted against the vdW contribution (see Figure 3).


Remarkably, the dessolvation energy anti-correlates with the vdW term, which is a good measure of the contact surface. In fact, many scoring functions are pair sums of relatively short range potentials and hence correlate well with the contact surface. Non-additive desolvation energy does not correlate with the contact terms and (together with entropy losses) provides a limitation on binding energy for large and deeply burried ligands. TdS depends on the details of interactions in a smooth logarithmic way, which means that the dessolvation punishment is a leading effect.

The whole Molecular Mechanics (MM) Hamiltonian was used to generate MM trajectories for thermodynamic integration for each of the complexes. The results of the calculations are represented on the Figure 1. Root mean squared error of the calculations 6.1 kJ/mol. The dessolvation energy is a big contribution and removes (or better to say improves over) the correlation between the binding free energy and contact surface.
Conclusion 1: Binding free energy of a ligand is not a contact surface (or anything close to that approximable with a sum of pairwise short-range potential)
Conclusion 2: Desolvation energy is a major limiting factor restricting the binding energy of large ligands.

Tuesday, January 9, 2007

Molecular dynamics of HIV Integrase with an inhibitor

Below we provide an example of Molecular Dynamics of HIV-integrase monomer with 1- (5- CHLOROINDOL-3- YL)-3 -HYDROXY-3 -(2H- TETRAZOL- 5- YL)-PROPENONE (pdb code 1qs4).



The protein structure is taken from the original 1qs4 pdb data and combined with the missing loop data from 2itg structure. The complete calculation yuilds -27kJ/mol binding energy, close to the experimentally observed value.

The inhibitor molecule is shown in red licorice. The Mg++ ion is shown as a megenta sphere.