key: cord-0041988-0zrfaps7 authors: nan title: Abstracts of QSAR‐related Publications 3/2006 date: 2006-03-07 journal: QSAR Comb Sci DOI: 10.1002/qsar.200690010 sha: b50d5f01a76ab2642a42fccd40df46008a3a7cbc doc_id: 41988 cord_uid: 0zrfaps7 nan plexes were taken from the Protein Data Bank (pdb codes are given)]; Compound sets (two subsets of the COBRA v2.11 collection of pharmacologically active reference compounds used for SVM training: 188 thrombin and 94 COX-2 inhibitors, these subsets were used as a reference for ranking circa 2.7 million substances that are commercially available from different vendors). Data determined: IC 50 [concentration of the test substance (mM) required for 50% inhibition of COX-2 measured using ELISA technique an celecoxib, diclofenac, and rofenoxib as positive controls]. Computational methods: Molecular modeling [each compound was represented by a 3PP fingerprint using the fingerprint generator in the software suite MOE v2004.05, an individual 3PP feature was a triangle, all possible triangles with their vertexes located at the atom centers of a molecule were considered, presence or absence of a certain triangle defined the "on" (i.e., bit is set) or "off" state of the corresponding bit in the fingerprint, the vertex could be either donor (D), acceptor (A), polar (P), donor and planar (D)), acceptor and planar (A)), hydrophobic (H), and hydrophobic and planar (H)) as defined by the rule-based atom-typer implemented in MOE which follows the PAT-TY atom-type definition, SVM was used to constructs a surface in the n-dimensional space that separated active from inactive compounds, where 3PPs were employed to describe a molecule]; SVM [Support Vector Machine, a novel learning machine, based on the idea of mapping the data into a higher-dimensional feature space via a non-linear mapping and then to perform linear regression in this space, where e is a constant parameter depending on the noise of present in the data, and g is a constant parameter of the kernel controlling the amplitude of the Gaussian function, the SVM constructs a surface in the n dimensional space that separates active from inactive compounds]. Results: SVMs were trained to predict COX-2 and thrombin inhibitors. Trained classifiers were obtained using sets of known COX-2 and thrombin inhibitors as "positive examples" and a large collection of screening compounds as "negative examples". The ligands were encoded by topological pharmacophore-point triangles. In a retrospective virtual screening, 50 -90% of the known active inhibitors were listed within the first 0.1% portion of the ranked database. Validation of the constructed classifiers was performed by developing a method for feature extraction and visualization using SVM, where potential pharmacophore points were weighted according to their importance for COX-2 and thrombin inhibition. Known thrombin and COX-2 pharmacophore points were correctly recognized by the machine learning system. The extracted potential pharmacophore patterns coincided with known binding models of thrombin and COX-2 inhibitors. Fig. 1 shows the potential pharmacophore points reflecting their relative contribution (weight) to the SVM classifier. In a virtual screening procedure, the number of potential COX-2 inhibitors were predicted and tested. A benzimidazole derivative exhibited significant inhibitory activity with an IC 50 of 0.2mM, which outperformed Celecoxib in the assay. The study demonstrated that the SVM machine-learning method can be used in virtual screening and the results can be interpreted yielding a set of rules for designing novel molecules. The proposed method complements the suite of modeling techniques that have been employed for designing selective COX-2 inhibitors previously. The SVM method offers particular advantages over other machine-learning approaches: (i) the SVM class/nonclass boundary is constructed as the maximum margin classifier, i.e., it does not represent an arbitrary solution; (ii) it relies only on the so-called "support-vectors", i.e., those molecules that define the classifier function. (B. B.) Title: POEM. Parameter optimization using ensemble methods. Application to target specific scoring functions. Authors: Antes*, I.; Merkwirth, C.; Lengauer, T. a dificult task, depending on the complexity of the model. In this paper a novel parameter optimization method, POEM, is proposed for this task, which employs ensemble methods. The novel method was applied to the optimization of target specific scoring functions in molecular docking. As test applications the FlexX and Screenscore scoring functions were fitted to the kinase and ATPase protein classes. Starting from random parameters promising results were obtained. Fig. 1 shows the relative mean square errors for five model types and four data sets (data set and scoring function, number of data points, number of complexes, linear, quadratic, k-NN, ANN, ANN ensemble): ATP.ssc, 256, 16, 0.75, 0.36, 0.35, 0.31, 0.23; ATP.flexx, 256, 16, 0.85, 0, 46 0, 54 0, 56 0, 33; Kin.scc, 366, 15, 0, 75 The parameter sets generated showed superior performance within the scoring functions functional limits compared to the original values. The POEM procedure converged rapidly, in the present case within 200 -300 cycles, and led to robust minima of the loss function/parameter landscape, thus rendering the approach appropriate for optimizations of rugged landscapes avoiding of getting stuck in local minima. (B. B.) Protein Data Bank (pdb codes are given) with K i binding constant values; 122 complexes with < 2.5 resolution were retained]. Computational methods: Molecular modeling [missing residues and side chains of the protein-ligand complexes that were not part of the active site were added and crystallographic waters were retained, hydrogens were added using either InsightII or Cerius2, in order to remove any unfavorable steric clashes and to optimize hydrogen-bonding patterns, the hydrogen atoms were subsequently minimized utilizing either the CFF force field parameter set or the DREIDING force field parameters in conjunction with Gasteiger charges, two coordinate sets of protein-ligand complexes were obtained (one for each force field) that were then used for training the LigScore1 scoring function, two more coordinate sets for each protein-ligand complex were derived, against which the LigScore2 scoring function was trained, the LigScore scoring functions have been developed by employing the genetic function approximation (theory is given), the van der Waals energy of protein-ligand interactions were computed via a Lennard-Jones 9 -6 potential using van der Waals radii and energy parameters of either the CFF or the DREID-ING force field]; GA (Genetic Algorithm). Data calculated: Descriptors [descriptors in Eq. 1: C pos_tot is the total surface area of the ligand involved in attractive polar interactions with the protein, and TotPol 2 is equal to Cpos_tot 2 þ C neg_tot 2 , where C neg_tot is the total surface area of the ligand involved in repulsive polar interactions with the protein, and hi indicates the summation of all interactions between the ligand and protein atoms; descriptors in Eq. 2: [root mean square deviation () of the position of the corresponding atoms of two superimposed molecular structures]. Results: Two novel empirical scoring functions, LigScore1 and LigScore2, has been developed to accurately predict the binding affinity between ligand molecules and their protein receptors. The LigScore functions consist of three distinct terms that describe the van der Waals interaction, the polar attraction between the ligand and protein, and the desolvation penalty attributed to the binding of the polar ligand atoms to the protein and vice versa. The functional form of LigScore1 is simple and contains only three descriptors (Eq. 1). Utilizing a regression approach on a data set of 118 protein-ligand complexes we have obtained a linear equation, LigScore2, using these three descriptors (Eq. 2). LigScore2 has good predictability with regard to experimental pK i values yielding a correlation coefficient, r 2 , of 0.75 and a standard deviation of 1.04 over the training data set, which consists of a diverse set of proteins that span more than seven protein families. Fig. 1 shows the experimental pK i against computed LigScore1/LigScore2 for 122 protein complexes with energy-minimized ligand poses. The dashed lines indicate the 2.5 pKi cutoff used to determine the outliers in the model. For the entire 122 data points, the r 2 values for LigScore1 (diamonds) and LigScore2 (squares) are 0.60 and 0.65, respectively and the corresponding s values are, 1.50 and 1.29 log units, respectively. It was concluded that the new empirical scoring functions shows high predictive accuracy of ligand-receptor-binding affinities over a wide range of protein classes as well as pKi values, while consisting of only three physicochemical descriptors that can be readily related to fundamental molecular interactions. (B. B.) Title: COSMOfrag. A novel tool for high-throughput ADME property prediction and similarity screening based on quantum chemistry. [Continuum Solvation Model for Real Solvents method applied for the accurate prediction of thermodynamic, environmental, or physiological properties, the molecular information is gathered in the so-called s profiles (e/ 2 ), the COSMO-RS calculations are restricted to small-to medium sized compound sets, because of the CPU time demand of the calculations (theory is given)]; COSMOfrag [a faster version of the COSMO-RS algorithm, because it replaces the costly quantum chemistry step with a selection of suitable fragments from a database of 40 000 DFT/COSMO precalculated molecules, the COSMOfrag approach replaces the single s profile with a composition of partial s profiles, selected by the use of extensive similarity searching algorithms (theory is given)]. Data calculated: Properties (the following properties were predicted using the COSMOfrag methodology: water solubility, soil sorption, logP, intestinal absorption). Chemical descriptors: logP (logarithm of the partition coefficient in 1-octanol/water). Results: A novel tool for high-throughput ADME property prediction and similarity screening based on quantum chemistry, COSMOfrag, is presented. The COSMO-RS method is capable of the accurate prediction of thermodynamic, environmental, or physiological properties. The modified COSMOfrag method utilizes a selection of suitable fragments from a database of, presently, 40 000 DFT/COSMO precalculated molecules and replaces the single s profile with a composition of partial s profiles, selected by the use of extensive similarity searching algorithms. The enhanced performance of COSMOfrag has been demonstrated using five representative datasets. Performance statistics were the following [dataset, n, property, CPU time (s), average CPU time per compound (s)]: pesticides, 107, water solubility, 70, 0.63; pesticides, 53, soil sorption, 40, 0.75; BOSS, 150, water solubility, 150, 1.0; PHYSPROP, 2570 PHYSPROP, , logP, 1750 Abraham, 170, intestinal absorption, 90, 0.53 . The loss of accuracy by using COSMOfrag versus full COSMO-RS calculations has been found to be small (in the range of 0.05 log units). The increased speed of COSMOfrag makes viable COSMO-RS property calculations for more than 100 000 compounds per day on a standard PC CPU allowing high-throughput tasks, especially in life sciences. (B. B.) Title: Autocorrelation of molecular electrostatic potential surface properties combined with partial least squares analysis as new strategy for the prediction of the activity of human A 3 adenosine receptor antagonists. Authors [the 3D models of all 358 A 3 antagonists were obtained by using the 3D structure generator Corina, implemented in the ADRIANA QSAR program suite, conformer generation and best conformer selection have been carried out using standard parameters of Corina, the autocorrelation MEP/PLS model was derived using the autocorrelation vectors as molecular descriptors presented as an intrinsic descriptor of the distribution of an atomic property along the molecular graph, each component of the autocorrelation vector is calculated using the formula A(d) ¼ S ij p i p j , where A is the autocorrelation coefficient referring to atom pairs ij, p i is the atomic property, and d is the i -j topological distance, thus a new 3D descriptor has been introduced that was based on the autocorrelation of properties at distinct points on the molecular surface]; PLS (Partial Least Squares projections to latent structures analysis was performed using the Unscrambler statistical software). [n (number of molecules); N opt (number of significant PLS components); SEC, RMSEC (standard error of calibration, and root mean square error of calibration, respectively); SEP bs , RMSEP bs (standard error of prediction after bootstrapping, and root mean square error of prediction after bootstrapping, respectively); bias (systemic difference between predicted and observed values); r 2 cv (cross-validated correlation coefficient); r pred (predictive correlation coefficient)]. Results: An approach of combining MEP surface properties (autocorrelation vectors) with the conventional PLS analysis has been employed for the prediction of the human A 3 receptor antagonist activities using 358 structurally diverse human A 3 receptor antagonists representing the largest mo-lecular collection used to generate a general human A 3 antagonist structure-activity relationship model. A robust quantitative model has been obtained as described by the following statistical parameters (n, N opt , r, r cv , r bs , slope, offset, SEC, RMSEC, SEP bs , RMSEP bs , bias): 358, 6, 0.82, 0.81, 0.68, À 0.61, 0.77, 0.76, 0.79, 0.79, 5 .37 e À 8 . Fig. 1 shows the predicted versus experimental pK i values obtained using the combined MEP/PLS approach. The model developed was successfully validated using a test set of 40 compounds (r pred ¼ 0.82, slope ¼ 0.57, offset ¼ À 0.92, SEP ¼ 0.75, RMSEP ¼ 0.76, bias ¼ 0.17). It was suggested that the proposed MEP/PLS approach can be considered as an alternative hit identification tool in virtual screening applications. (B. B.) Title: G-protein-coupled receptor affinity prediction based on the use of a profiling dataset. QSAR design, synthesis, and experimental validation. Authors [Predictive Neighborhood Behavior modeling based on the "neighborhood behavior" principle (similar structures! similar properties), and PNB models tend to extrapolate the property of a novel compound as an average of properties of reference molecules that are shown to be structurally similar, according to a well-defined computed similarity score, in addition to the calculated property, the model also returns, for each compound, two confidence thresholds controlling the relative trust in the PNB model prediction: a "density criterion" expressing how well the current compound is surrounded by relevant neighbors, as a function of their dissimilarity to the candidate compound, and a "homogeneity criterion" measuring the (weighed) variance of the property within the set of selected neighbors]; Synergy model [the calibration of a synergy model consists of finding, with regard to the confidence indices of the PNB prediction, an optimal balance of weights for the linear vs the PNB prediction, such that the returned "synergy" estimations (weighted averages of the two independent linear and PNB predictions, respectively) are as close as possible to the experimental values]; QSAR modeling (the final models are "synergy" approaches, combining the predictions of two conceptually independent parent approaches, based on an equation and respectively on neighborhood behavior, to return a more accurate prediction as a weighted average of the estimates provided by each parent). Results: GPCR affinity prediction was based on the use of a profiling dataset. A QSAR model accounting for "average" GPCR binding was built from a large set of experimental standardized binding data of 1939 compounds tested over 40 different GPCRs, and applied to the design of a library of "GPCR-predicted" compounds. The model was able to capture critical structural features for GPCR binding. 360 of these compounds were randomly selected and tested in 21 GPCR binding assays. Positive compounds were defined by their ability to inhibit the binding of reference compounds at 10mM by more than 70%. A 5.5-fold enrichment in positives was achieved when comparing the "GPCR-predicted" compounds with 600 randomly selected compounds predicted as "non-GPCR" from a general compound set. The model was capable of predicting the strongest binders, since enrichment was greater for higher cutoffs. Significant enrichment was also obtained for peptidic GPCRs and receptors not used to develop the QSAR model. Fig. 1 shows the experimental (ACT exp ) versus predicted (ACT pred ) values for the GPCR average activity of the validation set, where full circles mark the position of the synergy model predictions: when both linear and PNB predictions are available, the corresponding marker is spiked. By contrast, hollow circles mark the predicted values for the molecules failing to be predicted by the PNB approach (the case when the synergy predictions equal the linear model-based prediction). For cases, both linear and PNB values exist, the dotted bars span the range between the predicted values of the linear (plus sign) and PNB models (x sign), respectively. The prediction ability of the model was evaluated by synthesizing and testing a collection of GPCR-predicted compounds on a large series of GPCRs from different families. It was suggested that the model is useful for the design of ligand binding with newly identified GPCRs, including orphan ones. The approach should allow identification of new privileged scaffolds for each of these families and understanding how specific structural modulations provide pharmacological selectivity and specificity. (B. B.) Title: QSAR analyses of 3-(4-benzylpiperidin-1-yl)-N-phenylpropylamine derivatives as potent CCR5 antagonists. Authors Results: CCR5 receptor binding affinity data of a set congeners of type I was subjected to QSAR study using the Hansch analysis. Hansch. Indicator variables encoding different group contributions and different physicochemical variables such as p, s, MR and STERIMOL parameters of phenyl ring substituents of the compounds were used as predictor variables for the binding affinity. 3D-QSAR analyses of the same data set using MSA, RSA, and MFA techniques were also performed. The best model with reasonable statistical quality was derived from the MSA (Eq. 1) (method, R 2 , F, q 2 (LOO), PRESS, R 2 pred ): PCA, 0.697, 17.5, 0.585, 28.5, 0.592; PCRA, 0.951, 146.6, 0.898, 7.0, MSA, 0.722, 21.9, 0.650, 22.4, 0.774; RSA, 0.613, 0.582, 19.4, 0.535, 29. MSA revealed the importance of the RNCG, i.e., substituents with a high RNCG value have more binding affinity than the unsubstituted piperidine and phenyl (R 1 position) congeners. The RNCS was found to be detrimental (e.g., R 2 ¼ 3,4-Cl 2 ) for the activity. An increase in the length of the molecule in the Z dimension (Lz) is useful (e.g., R 3 ¼ sulfonylmorpholino), while an increase in the area of the molecular shadow in the XZ plane (Sxz) was detrimental (e.g., R 1 ¼ N-c-hexylmethyl-5-oxopyrrolidin-3-yl) for the binding affinity. The presence of a chiral center made the molecule less active (e.g., R 1 ¼ N-methyl-5-oxopyrrolidin-3-yl). An increase in the van der Waals area, the molecular volume, and the difference between the volume of the individual molecule and the shape reference compound enhanced (e.g., R 3 ¼ (CH 3 ) 2 NSO 2 -) the binding affinity. Substituents with higher JursFPSA_2 values (fractional charged partial surface area) like the N-methylsulfonylpiperidin-4-yl group in R 1 position have better binding affinity than the substituents such as 4-chlorophenylamino in the R 1 position. Unsubstituted piperidines (R 1 position) with less JursFNSA_1 values have lower binding affinity than the 4-chlorophenyl substituted compounds. The MFA derived equation shows interaction energies at different grid points, while the RSA model shows the importance of hydrophobicity and charge at different regions of the molecules. The models were validated through the leave-one-out, leave-15%-out, and leave-25%-out cross-validation techniques. The devel-oped models were also subjected to a randomization test (99% confidence level (17), 5600 -5603. Compounds: 20 Compounds of type I (CSABs), where R ¼ H, 3,4-di-Me, 2,4-di-Me, 2,5-di-Me, 4-Me, 4-Et, 4-i-Pr, 4n-Bu, 4-tert-Bu, 4-n-dodecyl, 2-Cl-4-Me, 4-F, 4-Cl, 4-Br, 2,3,4tri-MeO, 2,5-di-i-Pr, 2,4-di-i-Pr, 2,5-di-tert-Bu, 2,4,6-tri-Et, 2,4,6-tri-i-Pr. Data determined: IC 50 [concentration of the test substance (mmol/L) required for 50% decrease of the target cell survival]. Computational methods: BCUT (the BCUT metrics of Pearlman are whole molecule atom nonadjacent atoms, 29 standard 3D H-suppressed BCUT and a description of the nominal bond-type for adjacent andmetrics). Data calculated: logP (logarithm of the partition coefficient in 1-octanol/water was calculated employing Crippens fragmentation method); BEHv5 (BCUT descriptor defined as the highest eigenvalue n.5 of Burden matrix/weighted by atomic van der Waals volumes calculated using the Dragon v3.0 program). Results: A set of 20 alkyl-, halo-, and methoxy-aryl-substituted 2-[(carboxymethyl) sulfanyl]-4-oxo-4-arylbutanoic acids were synthesized and biologically evaluated. The new compounds, called CSAB, inhibited proliferation of human cervix carcinoma HeLa cells, in vitro in a concentration range of 0.644 to 29.48mM/L. Two compounds displayed antiproliferative activity at submicromolar concentrations, and five compounds were active at low micromolar concentrations (< 2mM/L). The most active compounds in the set exerted lower cytotoxicity toward healthy human peripheral blood mononuclear cells (PBMC and PBMC þ PHA) (selectivity indexes > 10). The following structure-activity relationships were obtained using estimated logP values and BCUT descriptors, was observed. It was suggested that the results of the study provide the basis for the design of new congeners with higher activity. (B. B.) Title: Effect of selection of molecular descriptors on the prediction of blood-brain barrier penetrating and nonpenetrating agents by statistical learning methods. Authors (415 compounds with known BB ratios); (ratio of the steady-state concentrations of a drug in the brain and blood taken from Micromedex, American Hospital Formulary Service, and literature sources). Computational methods: Molecular modeling (the 2D structure of each of the compounds was generated using ChemDraw and DS View-erPro v5.0, which were converted into 3D structures using CONCORD and fully optimized using the semiempirical AM1 method); Statistical learning methods (Bayesian neural network, BNN; principal component analysis, PCA; neural network, NN; linear regression, LR; linear discriminate analysis, LDA; C4.5 decision tree, C4.5 DT, probabilistic neural network, PNN; k-nearest neighbors, k-NN; support vector machine, SVM); RFE (Recursive Feature Elimination); SVM [Support Vector Machine, a learning machine, based on the idea of mapping the data into a 3D feature space via a non-linear mapping and then to perform linear regression in this space, where e is a constant parameter depending on the noise of present in the data, and g is a constant parameter of the kernel controlling the amplitude of the Gaussian function]. Data calculated: Descriptors (R ugty molecular rugosity geometrical; S hpl hydrophilic region geometrical; S hpb hydrophobic region geometrical; C apty capacity factor geometrical; H iwpb , hydrophobic intery moment; H iwpa , amphiphilic moment; dis1, dis2, dis3, length vectors; S anc sum of solvent accessible surface areas of negatively charged atoms; 5 c v CH , valence molecular connectivity c index for cycle of five atoms; 6c v CH , valence molecular connectivity c index for cycle of six atoms; e a , hydrogen bond donor acidity (covalent hbda); m, molecular dipole moment; m cp , chemical potential; c en , electronegativity index; Q H,Max , most positive charge on H atoms; Q O,Max , most positive charge on O atoms; Q H,Min , most negative charge on H atoms; S hal , sum of estate indices of halogen atoms; S(i), atom type E-state indices of diverse atoms]; S i [E-state index for atom i, a structure representation based on atom level topological and electronic information, defined as /r ij 2 sum over all atoms i=j, where r ij is the distance between atoms counted as the graph distance (d ij ) plus one]. Results: BBB-nonpenetration is desirable for non-CNStargeting drugs to minimize potential CNS-related side effects. In this study the effect of selection of molecular descriptors on the prediction of blood-brain barrier-penetrating (BBB þ ) and -nonpenetrating (BBB À ) agents have been modeled by six statistical learning methods yielding outstanding accuracies of 75 -92% and 60 -80%, respectively. The majority of these predictions gave a substantially lower BBB À accuracy, and thus overall accuracy, than the BBB þ accuracy. Of the six statistical learning methods tested, the SVM appeared to give a slightly higher prediction accuracy than the other methods for both BBB þ and BBB À agents. It was found that the effect of proper selection of molecular descriptors can improve both the BBB À and the overall accuracies of statistical learning methods. Molecular descriptors were selected by using RFE. The method was tested using 415 BBB þ and BBB À agents demonstrating that RFE substantially improved both the BBB-and the overall accuracy for all of the methods studied. Differences in the values of descriptors important for distinguishing between BBB-Penetrating (BBB þ ) and -Nonpenetrating (BBB À ) agents were the following (descriptor, average BBB þ value, average BBB À value): m, 1.89(AE 1.90), 2.92(AE 3.48); e a , 0.01(AE 0.87), À 0.34(AE 1.36); S hpb , 2.44(AE 2.85), 2.17(AE 3.96); R ugty , 1.18(AE 2.31), 1.54(AE 2.84); dis1, À 4.52(AE 0.68), À 4.37(AE 1.01). The results of the study demonstrated that statistical learning methods combined with proper feature selection is potentially useful for a more balanced and improved prediction of the BBB penetrating properties of the BBB þ and BBB À agents. It was suggested that the prediction accuracy of statistical learning methods may be further improved by consideration of factors such as hydrogen bonding, active transport, and the relationship with pharmacodynamic properties. . Results: A model for prediction of%Abs of neutral molecules has been developed based upon surface charges of the molecule calculated by DFT. The surface charges were decomposed into s moments which are correlated to a partition coefficient representing transfer of the molecule between water and the epithelial membrane. A predictive model was built and tested using a data set of 241 drugs yielding RMSE ¼ 13% on a training set of 38 compounds as well as on a test set of 107 drugs with high quality experimental %Abs data. Fig. 1 shows the plot of the experimental versus calculated, respectively,%Abs values calculated by COSMO (training set denoted by open squares: n ¼ 38, RMSE ¼ 12.5; high quality test set denoted by closed squares: n ¼ 107, RMSE ¼ 12.8). The predictive quality of COSMO-%Abs was comparable to previous models. Property maps of the molecule, visualizing the atoms that enhance or hinder absorption, were produced to aid drug design. (5), 1343 -1351. Compounds: 30 HLA-DR4 peptidomimetic inhibitors, e.g., Ac-(Cha)RAMASL-NH2 (Inh.1), Ac-(L-Nba)RA-MASL-NH2, or Ac-(L-m-F-Phe)RAMASL-NH2. Biological material: a) HLA-DR4 receptor, a class II MHC molecule b) MHC class II peptides, cell surface proteins, that perform an essential function in immunological detection using cells. Data taken from the literature: IC 50 [concentration of the peptidomimetic inhibitor test substance required for 50% inhibition of HLA-DR4 (details not given)]; MRP [Mean Relative Potency of a peptidomimetic inhibitor expressed as a ratio of IC 50 against that of Ac-(Cha)RAMASL-NH2]; Crystal (atomic coordinates of the compounds were determined by X-ray diffraction techniques). Computational methods: Molecular modeling (the ligand-receptor interaction between some peptidomimetic inhibitors and a class II MHC peptide presenting molecule, the HLA-DR4 receptor, was modeled using 3D-QSAR methods such as the CoMFA, CoMSIA, and the Catalyst program); CoMSIA [Comparative Molecular Similarity Indices Analysis of the molecules was carried out as an alternative approach to CoMFA based on similarity indices calculated at the intersections of a three dimensional lattice, the five physicochemical properties for CoMSIA (steric, electrostatic, hydrophobic, and hydrogen bond donor and acceptor) were evaluated using a common probe atom with 1 radius, þ 1.0 charge]; Catalyst [computer program for automatic generation of pharmacophore models for a training set of molecules specifying the relative alignments and active conformations of the ligands consistent with the binding to a common receptor site, the pharmacophore model (hypothesis) consists of a collection of features necessary for the biological activity of the ligands arranged in 3D space]; GOLD [flexible protein-ligand docking program featuring a (i) genetic algorithm methodology for protein docking; (ii) full ligand and partial protein flexibility; (iii) energy functions partly based on conformation and non-bonded contact information from the Cambridge Structural Database (CSD), scoring was performed using the Xscore program]. GLIDE (docking program comparing the distances from a grid point to the receptor surface to distances from the ligand center to the ligand surface, the resultant refined poses are kept, and then minimized with a distance-dependent dielectric constant, and the conjugate gradient algorithm, the final poses are scored with Glide-Score with an inclusion of an energy score). Data calculated: r s {Spearmans rank correlation coefficient r s ¼ 1 À [6 · S i d 2 i /n(n 2 À 1)], where d i is the difference between two ranks at the point i and n is the total number of points}; RMSD [root mean square deviation () of the position of the corresponding atoms of two superimposed molecular structures]. Results: Ligand-receptor interactions for some MHC Class II HLA-DR4 peptide mimetic inhibitors were modeled modeling using several molecular docking and 3D QSAR techniques. The conformations of the peptidomimetic inhibitors investigated were defined by docking them into the known structure of HLA-DR4 receptor employing the GOLD, GLIDE Rigidly, GLIDE Flexible, and Xscore programs. The goodness of a docking result for docking a series of peptidomimetic inhibitors into the HLA-DR4 receptor was judged by comparing the Spearmans rank correlation coefficient computed between each docking result and the activity data taken from the literature. The best CoMFA and CoMSIA models were obtained by using the aligned structures of the best docking result. The structural features selected by a stepwise CoMSIA were transferred into the Catalyst program for an automatic generation of conformations and constructing pharmacophore models for the peptidomimetic inhibitors studied. Most inhibitors were accurately predicted by the best pharmacophore model, the Catalyst Hypo1 hypothesis. The Hypo1 hypothesis was mapped onto the corresponding structures of the inhibitors. Fig. 1 shows the mapping of the Hypo1 hypothesis onto the structure of the peptidomimetic inhibitor Ac-(Cha)OMASL-NH2, where the pharmacophore features indicated are: (A) for hydrophobic, (B) for hydrogen-bond donor, and (C) for positive ionizable feature. CoMSIA investigated all the single or possible combinations of structural features involved in the binding process and identified only two to be statistically important for pharmacophore construction. A q 2 value of 0.71 was obtained for structures aligned by the GOLD docking while that obtained for structures aligned by some correspondence points was only 0.57 for the same H þ S field indexes employed in the CoMSIA studies. The results indicated that the field alignment methods utilized by GOLD are superior to the point alignment method used by the CoMSIA model. The only binding pocket that remained unaccounted by the best CoM-SIA model was p6 which prefers binding with some hydroxyalkyl groups as defined by the M13 phage display libraries by others. This feature is included as D or a hydrogen-bond donor in the pharmacophore construction process by the Catalyst program. Results: The SARS coronavirus 3C-like proteinase is an important drug design target for the development of drugs against severe acute respiratory syndrome (SARS). In this study a "flexible" 3D model has been built for the proteinase by homology modeling and multicanonical MD method and used for virtual screening of molecule libraries. The test compounds were docked into the protease homology model, followed by the employment of a pharmocophore model, consensus scoring, and "drug-like" filters in order to improve the hit rate and the success rate of the virtual docking hit list. Refinement of the enzyme with MD and using a "flexible model" for docking screening were found to be important aspects of the design. Fig. 1 shows the binding mode of C3930 with SARS coronavirus 3C-like proteinase. 40 Compounds were purchased and their affinities were tested against the proteinase. Three compounds, including calmidazolium, were found to inhibit the enzyme with an apparent K i from 61 to 178mM. Fig. 2 shows the plot of the docking/score energies using homology model structure and SARS coronavirus 3C-like proteinase crystal structures. [all calculations and molecular modeling of Mtb-MAT were carried out on a Silicon Graphics O2 R4400 using the INSIGHTII molecular modeling package, the HOMOLOGY program was used for the comparative protein modeling, the PSI-BLAST algorithm was used to identify homologous structures for the Mtb-MAT sequence by searching the structural database of protein sequences in the PDB, the MAT crystal structures of E. coli and rat from the PDB were selected as template structures for homology modeling of the Mtb-MAT enzyme, sequence alignment, loop and end region modeling, as well as side chain and splice repair was followed by simulated annealing and restrained energy minimizations of the Mtb-MAT model, the homodimer of Mtb-MAT was generated with the Pdbset module of CCP4 (Windows V5.0 using the Biomt transformations, the hydrogens were added at pH 7.4, the homodimer was analyzed with the ProStat module, the accuracy and validity of the model was tested with Profiles-3D and graphically portrayed the properly folded and misfolded region(s) in the protein structure by performing an Eisenberg analysis of the model, the homology model of human MAT (hMAT) was generated using the same protocol as described for Mtb-MAT using only the rat MAT (1QM4) crystal structure as the reference protein]. Data calculated: MEP [Molecular Electrostatic Potential (kcal/mol) was calculated using MOLCAD implemented in Sybyl v6.7]. Results: MAT is an important target for antituberculosis drug discovery. In this study a homology model of MAT has been constructed using the X-ray structures of E. coli MAT and rat MAT as templates. The resulting model showed correct stereochemistry as assessed by the Profiles-3D score (Ramachandran plot and good 3D structure compatibility). All structurally and functionally important residues in the active site of Mtb-MAT have been identified that are also present in the E. coli and rat MAT crystal structures and the reported point mutation data. The homology model conserved well the crucial topological and active site features of the MAT family of proteins. There are some differences in the active sites of Mtb and human MAT, which can be exploited to design specific and selective inhibitors of Mtb-MAT. Fig. 1 shows the active site residues of Mtb-MAT (bold) and hMAT after the superposition onto the rMAT crystal structure (1QM4). The residues of rMAT have not been shown as it shares 100% sequence identity to human MAT, in the active sites. Only the inhibitor L-cis-AMB from the 1QM4 X-ray structure has been shown to indicate its relative position in the Mtb and hMAT active sites. These differences appear when one compares the molecular electrostatic potential of mycobacterial and human MAT. The differences in the MEP maps of Mtb and human MAT indicated that selective and specific Mtb-MAT inhibitors can be designed using the homology model. Compounds: Top 50 compounds identified and ranked by the in silico screening: leukotriene F4, chenodeoxycholic acid, adenosine-4P, solanidine, leukotriene E4, ITP, leukotriene D4, prostaglandin E3, docosahexaenoic acid, geranylgeranyl diphosphate, eicosapentanoic acid, cholesterol, XTP, leukotriene A4, ergosterol, heroin, 2MeSADP, ecdysone, cholic acid, arachidonic acid, deoxycholic acid, TRH, XDP, icosanoic acid, retinol, IDP, leukotriene B4, ajmaline, ATP c, morphine, prostaglandin A1, S-adenosylmethionin-amine, ADPglu-ADP-2P, stearic acid, GTP, ADP, codeine, UDP-glucose, ATP, UDP, elaidic acid, CTP, PRPP, cortisol, UTP, IDP, cinochonine, IMP, CDP, A3P5PS. Biological material: a) Human P2Y 12 and P2Y 1 receptors members of the G-protein coupled class of receptor family; b) CHO cells stably expressing P2Y 12 -G 16 a fusion proteins. Data taken from the literature: Database Crystal structure (in house database of 500 animal metabolites); (atomic coordinates of the bovine rhodopsin were determined by X-ray diffraction techniques). [in silico screening was performed using Auto-Dock v3.0, the top 10 configurations according to dG binding energy were energy minimized using the CHARMm force field employing the InsightII/CHARMm programs, a total of more than 500 compounds in the in house database were processed by this method, the 3D structure of the P2Y 12 receptor was constructed on the basis of the coordinates and the conformation of the 3D model of the P2Y 1 receptor by both Fourier transform analysis and homology modeling using the bovine rhodopsin structure as a template, the transmembrane regions (TM) of the P2Y 12 receptor were determined by alignment with those of the P2Y receptors and bovine rhodopsin, additionally, high temperature molecular dynamics (MD) simulation was performed for energy minimization using the InsightII/CHARMm parameter]; AUTO-DOCK (automated docking program for finding potential docking sites of whole ligands with userspecified dihedral angle flexibility on rigid protein binding sites of known structure by using a Metropolis Monte Carlo algorithm of simulat-ed annealing for positional and conformational searching in combination with a rapid energy evaluation through precalculated grids of molecular affinity potentials allowing the inclusion of van der Waals, electrostatic, and hydrogen bonding interactions); Phylogenetic analysis [evolutionary relationship was analyzed based on the similarity of the primary sequence in the alignment of proteins, the alignments were calculated by ClustalX, and uncertain portions, such as gaps, were deleted using BioEdit]. Data calculated: dG [AutoDock binding energy (kcal/mol)]. Results: Endogenous ligands acting on a human P2Y12 receptor, were searched by in silico screening using AutoDock against an in house molecule library containing more than 500 animal metabolites. Fig. 1 shows the 3D model of PRPP ligand-bound human P2Y 12 receptor. The side chains of the important residues in proximity to the docked PRPP are highlighted and labeled. In addition to the known P2Y 12 ligands, such as 2MeSADP and ADP, the screening yielded a selection of cysteinylleukotrienes (CysLTs) and PRPP, with high free energy changes. These ligands were subjected to an in vitro Ca 2 þ assay using the CHO cells stably expressing P2Y 12 -G 16 a fusion proteins. It was found that CysLTE4 and PRPP behaved at the P2Y 12 receptor as agonists with the EC 50 values of 1.3 and 7.8 nM, respectively. Moreover, the phylogenetic relationship of the P2Y, P2Y-like, and CysLT receptors was analyzed based on sequence alignment followed by evolutionary analyses. The phylogenetic analyses revealed that the P2Y 12 , P2Y 13 , P2Y 14 , GPR87, CysLT-1, and CysLT-2 receptors formed a P2Y-related receptor subfamily with common sequence motifs in the transmembrane regions. Data taken from the literature: IC 50 [concentration of the test substance (mM) required for 50% inhibition of strand transfer]; Data set (33 HIV IN inhibitory DKAs with associated IC 50 values). Computational methods: Molecular modeling (the algorithm HypoGen implemented in the Catalyst package was used to derive automated SAR pharmacophore hypoteses from a data set of b diketo-derivatives); Catalyst [computer program for automatic generation of pharmacophore models for a training set of molecules specifying the relative alignments and active conformations of the ligands consistent with the binding to a common receptor site, the pharmacophore model (hypothesis) consists of a collection of features necessary for the biological activity of the ligands arranged in 3D space, the common ones being hydrogen bond acceptor, hydrogen bond donor, and hydrophobic features, the Catalyst/ HypoGen module takes activity data into account and uses active and both inactive compounds in an attempt to identify hypotheses that are common among the active compounds but not among the inactives]. Results: Pharmacophore-based design of HIV IN inhibitors was performed employing the Catalyst/HypoGen module using a set of 33 b-diketo-acid HIV IN inhibitors (17 training set and 16 test set compounds) reported in the literature. The derived models showed quantitative predictive ability in terms of activity. The best statistical hypothesis consisted of four features (one hydrophobic aromatic region, two hydrogen-bond acceptors, and one hydrogen-bond donor) with correlation coefficient of 0.96. Fig. 1 shows the top scoring Hypo-Gen pharmacophore Hypo1 mapped onto the most active compound (L-870,810) in the training set, where HBD, HBA, and HyAr denote hydrogen bond donor, hydrogen bond acceptor, and hydrophobic aromatic region features of the pharmacophore. [the 3D structures of buxamine-B and -C were constructed using the Sybyl program, energy minimization was carried out using the Tripos force field, the docking studies were carried out using FlexX docking software, Buxamine-B and -C models were docked in the aromatic gorge of AChE (PDB: 1ACL), the docking results were analyzed by LIGPLOT and WebLab ViewerPro]; FlexX (program for automatic protein-ligand docking based on incremental construction while the receptor is kept rigid); LPC (program for analysis of protein-ligand contacts based upon an approach known as surface complementarity). Results: Structural basis of AChE inhibition by triterpenoidal alkaloids of type I has been performed by enzyme ki-netics and FlexX molecular docking experiments. Buxamine-C has been found to be 20-fold potent than buxamine-B (K i ¼ 5.5mM and 110mM, respectively). The results of the docking experiments suggested that the cyclopentanophenanthrene skeleton of these inhibitors precisely fits into the aromatic gorge of AChE (pdb code: 1ACL). It was suggested that the C-3 and C-20 amino groups of both alkaloids mimic the wellknown bis-quaternary ammonium type inhibitors such as decamethonium and interact with the Trp84 and Trp279 residues of the enzyme, respectively. Fig. 1 shows the superposition of both natural inhibitors and decamethonium docked into the aromatic gorge of AChE (pdb code: 1ACL). The C-3 amino group in buxamine-C appears to adopt a good position at the bottom of the aromatic gorge exerting a minimum of destabilizing contacts, which seems to be crucial for the inhibitory activity of such inhibitors. These observations are supported by significant difference between complementarity values calculated for docked buxamine-B (0.3) and buxamine-C (0.5) by the LPC software. Biological material: Glycogen phosphorylase (GP) is a key enzyme in the regulation of glycogen metabolism, catalyzing the breakdown of glycogen to glucose-1-phosphate; among the GP isozymes, human liver GP (HLGP) is the preferred target. Data taken from the literature: Crystal structure [atomic coordinates of rabbit muscle glycogen phosphorylase complexed with a W1807, caffeine at the inhibitor site, and CP320626 at the dimer dimer interface site (1C50) were taken from the Brookhaven Protein Data Bank (pdb codes: 3AMV, AGFZ, 1C50, respectively)]. Computational methods: Molecular modeling conformers of type I with R ¼ NO 2 (Ia) were generated by distance geometry approach, the conformer set was energy minimized with a distance-dependent dielectric of 2r with the MMFFs force field calculations and superposed onto known inhibitors in complex with GP (3AMV), caffeine at the inhibitor site (1GFZ), and CP320626 at the dimer interface site (1C50), the AMP allosteric site was identified as the most likely binding site for compound Ia through the SQ overlay, a docking study was carried out using ICM software yielding 100 initial docking poses within the AMP allosteric site, each of the complexes obtained was energy optimized using the MMFFs force field, the total energy of the complex, the individual energies of the ligand and the enzyme, and the interaction energy between the ligand and the enzyme were calculated, the binding pocket was characterized by generating a grid for FLOG as a series of iso-energetic surfaces to characterize the binding pocket by its polar (hydrogen bond donor and acceptor) and hydrophobic nature as well as its van der Waal limits]. Data calculated: E int [interaction energy between the ligand and the enzyme as well as the total energy of the complex, and the individual energies of the ligand and the enzyme, respectively, were calculated (kcal/mol)]. Results: A novel type of potent glycogen phosphorylase inhibitors were designed using molecular modeling methodologies based on a phenyl diacid lead, W1807. In the absence of suitable competitive binding assays, compound W1807 was predicted to bind at the AMP allosteric site. The allosteric site was located by superposition onto known inhibitors which bind at different sites in the enzyme and analyses of the surrounding protein environment associated with these distinct sites. Possible docking modes of W1807 at the AMP allosteric site were further modeled using the complex crystal structure 3AMV. W1807 was predicted to interact with positively charged arginines at the AMP allosteric site in the docking model. A reasonable docking model was computationally determined and subsequently confirmed by X-ray crystallography. Fig. 1 shows the docking model of W1807 inside the AMP allosteric site. The heavy atoms of W1807 are shown in ball and stick, the GP structure is shown in solid ribbon with helices, hydrogen atoms have been omitted for clarity. Characterization of the binding pocket by FLOG revealed a large empty hydrophobic region near the central phenyl ring suitable for accommodating compounds with larger hydrophobic groups and thus improve binding. Based on the results a fused ring analogues were designed with increased hydrophobic bulk in this unfilled region to improve binding. As predicted, this exercise resulted in a new series of GP inhibitors with significantly improved potency. The new ligands showed significantly improved potency. Compounds: 10 Structurally diverse sulfonamides, e.g., type I. plex were determined by X-ray diffraction techniques). Computational methods: Molecular modeling (the structure of the hCA II/type I complex was analyzed by difference Fourier techniques, the model was refined using the CNS program to crystallographic R-factor and R-free values of 0.204 and 0.251, respectively, model building and map inspections were performed using the program O, the stereochemical quality of the model was assessed by Procheck, the most favored and additionally allowed regions of the Ramachandran plot contained 98.6% of the nonglycine residues). Data calculated: rmsd [root mean square deviation () of the position of the corresponding atoms of two superimposed molecular structures]. Results: Crystallographic structure for the adduct of CA II with type I perchlorate, a membrane-impermeant antitumor sulfonamide, showing antitumor activity, has been determined using X-ray diffraction techniques. Fig. 1 shows the view of the active site region of the hCA II/type I complex showing the residues participating in recognition of the inhibitor molecule, hydrogen bonds and the active site Zn(II) ion coordination are also displayed. Examination of the crystallographic structure of the complex reveled the following features: the phenylethyl moiety of the ligand fills the active site, engaging in van der Waals interactions with side chains of Gln192, Val121, Phe131, Leu198, and Thr200. The 2,4,6-trimethylpyridinium moiety is located at van der Waals distance from the aliphatic chain of Ile91 es-tablishing a strong offset face-to-face stacking with Phe131. Other X-ray crystal structures of hCA II-sulfonamide/sulfamate adducts led to the proposal of two principal binding modes: some inhibitors bind with their tail within the hydrophobic half of the CA II active site, defined by amino acid residues Phe131, Val135, Leu198, Pro202, Leu204. A second group of derivatives, including the positively charged sulfonamide investigated here, produces binding with their tail in a different region of the active site, pointing toward the hydrophilic half of it and making an offset face-to-face stacking with Phe131. It appears that this last interaction orients the inhibitor toward the hydrophilic part of the active site, whereas impossibility to participate in this stacking leads to the binding of inhibitors within the hydrophobic half. It was suggested that these findings are relevant for the design of better inhibitors targeting isozymes II, IX, and XII, some of these being overexpressed in hypoxic tumors showing bad disease prognosis. Biological material: Tyrosinase, a key enzyme catalyzing the first two steps of melanin biosynthesis, it is a copper protein widely distributed in nature. Data taken from the literature: IC 50 [concentration of the test substance (mM) required for 50% inhibition of tyrosinase]. Computational methods: QSAR modeling [the main steps for the application of this method in QSAR were the following: (i) draw the molecular pseudographs for each molecule of the data set, using the software drawing mode; (i) use appropriate atom weights in order to differentiate the molecular atoms, where each atomic nucleus was characterized with the following parameters: atomic mass (M), atomic polarizability (P), atomic Mulliken electronegativity (K), van der Waals atomic volume (V), and the atomic electronegativity in Pauling scale (G)]; TOMO-COMD-CARDD [TOpological MOlecular COMputer Design-Computer Aided "Rational" Drug Design) that has been developed to generate molecular fingerprints on the basis of the application of discrete mathematics and linear algebra theory to chemistry, in this sense, atom, atom-type, and total quadratic and linear molecular fingerprints have been defined in analogy to the quadratic and linear mathematical maps (theoretical framework and algorithm is given)]; LDA [Linear Discriminant Analysis implemented in STATISTICA v6.0; statistical parameters: the Wilksl parameter (U-statistic), square Mahalanobis distance (D 2 ), and Fisher ratio (F) for the training set; linear canonical regression coefficient (R can ) and chi-squared (c 2 ) as the measure of the statistical quality of the developed models]. Data calculated: DP% (posterior classification probability classifying a compound as active or inactive by the discriminant equation). Results: The search for novel tyrosinase inhibitors requires the study of the role of tyrosinase in hyperpigmentation and melanogenesis disorders. In this study novel tyrosinase inhibitors were identified by using TOMOCOMD-CARDD descriptors and pattern recognition techniques for finding functions that discriminate between potent tyrosinase inhibitor compounds and inactive ones. Significant LDA models were developed and globally good classifications of 93.51% and 92.46% were achieved for the best models using non-stochastic and stochastic linear indices respectively, in the training set. For external prediction sets accurate classifications of 91.67% and 89.44% were obtained. Fig. 1 shows the plot of the DP% calculated by one of the developed equations using stochastic linear indices for each compound in the training and test sets (the compounds 1 -183 and 184 -246 are active (tyrosinase inhibitors) in training and test sets, respectively; chemicals 247 -541 and 542 -658 are inactive (non-inhibitors of tyrosinase)in both training and test sets, respectively. The new method increases the speed of the discovery of new lead-like compounds, as a suitable alternative to the screening and in vitro assay. This was proven experimentally through the isolation and characterization of six new cycloartane compounds isolated from herbal plants followed by tyrosinase inhibitory assay. A good concordance was shown between the theoretical and experimental results. It was suggested that the results obtained provides a QSAR tool that can be used in the identification of new tyrosinase inhibitor compounds. [concentration of the test substance (nM) required for 50% inhibition of p38a]; Crystal structure (atomic coordinates of type II complexed by p38a were determined by X-ray diffraction techniques). Computational methods: Molecular modeling [the geometries of the five fragments were fully optimized in the gas phase at the B3LYP/6-31G(d) level of theory, the molecular dipole moments, isotropic polarizabilities, and atomic charges were evaluated by single-point calculations at the B3LYP/6-311G(d,p) level, the ESP atomic charges were calculated according to the CHELPG algorithm, the polarized continuum model (PCM) was adopted at the HF/6-31 þ G(d) level of theory for the geometry optimizations in water]. Data calculated: Descriptors [solvation free energy, DGsolv, and absolute chemical hardness, h , descriptors were calculated using single-point calculations using the Gaussian98/03 program suites, the molecular electrostatic potentials (MEPs) were calculated at the B3LYP/6 -311G(d,p) level of theory and plotted using Spartan02 software, the octanolwater partition coefficients, logP, were predicted using ACD/labs v6.00 software package, the van der Waals molecular volume, V, and wateraccessible hydrophobic surface area, ASA-H, descriptors were evaluated by the MOE software, only atoms with absolute values for the ESP atomic charges of less than 0.2 ewere considered in calculating the ASA-H, the molecular geometries optimized in solution were adopted for these calculations]. Results: Theoretical and experimental design of atypical kinase inhibitors has been performed and applied to the p38 MAP kinase. More specifically, 5 mimics of the benzimidazolone nucleus found in inhibitors of p38 kinase were proposed, and their theoretical potential as bioisosteres were investigated. A set of calculated descriptors relevant to the anticipated binding interaction for the bioisostere fragments were calculated. The design considerations and synthesis of p38 inhibitors based on these H-bond acceptor fragments are described in detail. Fig. 1 shows the cartoon representation of p38a/SB-203580 complex 1IAN of a benzimidazolone in the active site of p38a. Comparative evaluation of the pyridine-, benzimidazolone-, benzotriazole-, and triazolopyridine-fragment containing inhibitors shows the triazoles type II and IV to be significantly more potent experimentally than the benzimidazolone structure of type II after which they were modeled. An X-ray crystal structure of type IV bound to the active site showed that the triazole group serves as the H-bond acceptor but unexpectedly as a dual acceptor, inducing movement of the crossover connection of p38R. Fig. 2 shows the X-ray structure of type IV bound to p38a. As a result of the study, significantly more potent compounds were identified with superior calculated physicochemical properties than the benzimidazolone group that they were designed to replace. (an enhanced, faster version of DISCO, given a set of molecules that are related by their ability to bind to a protein receptor, DISCOtech identifies features that could be elements in a pharmacophore model, DISCOtech operates in distance space and can perform clique detection to generate pharmacophore hypotheses on up to 300 conformers per molecule). Data calculated: rmsd [root mean square deviation () of the position of the corresponding atoms of two superimposed molecular structures]. Results: Two 1,3,5-trisubstituted aromatic scaffolds intended to serve as g-turn mimetics have been synthesized and incorporated in five pseudopeptide analogues of Ang II. Molecular modeling supported the supposition that 1,3,5-trisubstituted aromatic scaffolds may serve as relevant g-turn mimetics. Fig. 1 shows the image of models of scaffolds superimposed onto an inverse g-turn found in a crystallized protein. All the tested compounds exhibited nanomolar affinity for the AT 2 receptor, and the best compound displayed a K i affinity of 1.85 nM. Four pseudopeptides were found to be AT 2 selective, while one also exhibited good affinity for the AT 1 receptor (K i ¼ 30.3 nM). The results of molecular modeling, using the program DISCOtech, showed that the high-affinity li-gands could interact similarly with the AT 2 receptor as other ligands with high affinity for this receptor. Fig. 2 shows the developed DISCOtech model including all compounds, where the spheres indicate the DISCOtech features: N-terminal, (A); C-terminal, (B); tyrosine phenolic oxygen, (C); green, proposed receptor interaction point for the arginine guanidino group, (D). Computational methods: Molecular modeling (sketching, geometry optimization and conformational search of resveratrol was performed BioMedCAche v5.0 software, the minimum energy conformation was obtained first with the MM2 method followed by application of semiempirical AM1 minimization using two dihedral angle search by step, the charges of resveratrol and heme group were computed using the Tripos force field with the Gasteiger-Hückel method, docking of resveratrol was carried with FlexX in Sybyl v7.0 into the peroxidase site of the crystallographic PGSH-1 and PGSH-2 structures, scaling function was used to penalize deviations of the docked conformer from the ideal geometry); FlexX (program for automatic protein-ligand docking based on incremental construction while the receptor is kept rigid). Data calculated: DG bind [binding energy (kcal/mol) estimated as the sum of free energy contributions from hydrogen bonding, ion-pair interactions, hydrophobic and p-stacking interactions of aromatic groups, and lipophilic interactions]. Results: Resveratrol was docked into the PGSH-1 and PGSH-2 receptor in order to study the molecular basis for the selective inhibition of the PGHS-1 peroxidase activity by resveratrol. The docking model predicted a slightly less favorable DG bind (À 17.9 kcal/mol) value of the resveratrol to the PGHS-2 peroxidase site than its corresponding binding to the PGHS-1 (À 20.4 kcal/mol) site. Fig. 1 and Fig. 2 show the molecular docking results of resveratrol with the PGHS-1 and PGHS-2 peroxidase sites, respectively, where only the main amino acid residues within 3 around the inhibitor are shown for clarity. In both complexes resveratrol is attached to the peroxidase site of the corresponding PGHS isoform, describing the molecular surface contour of the ligand, and dashed lines represent the hydrogen bonds formed between resveratrol and the corresponding PGHS isoform. The formation of hydrogen bonds among the hydroxyl groups of the resveratrol phenyl rings, the backbone of Feheme and the carbonyl group of Leu294 inside the PGHS-1 peroxidase site, as well as the absence of His214 in the backbone of PGHS-1 were the crucial features required to keep the aromatic rings of the natural product parallel to the Feheme group and transverse to the peroxidase access channel. This positioning represented a large steric hindrance at this site and led to selective inhibition. Compounds: 10 Oxindole-based compounds of type I, where R 1 ¼ diverse structural moieties; R 2 ¼ H, OMe, Br, I; e.g., GW5074 where R 1 ¼¼C-Ph-3,5-diBr-4-OH; R 2 ¼ I. [all calculations were conducted on Silicon Graphics Octane R12000 machines, ligand structures were built with Sybyl 6.8 and optimized with the MMFF94 force field and atomic charges, the Pfmrk structural model was con-structed in five steps, the active crystal structure of hCDK2 (pdb ID ¼ 1QMZ) was chosen as the template for the homology modeling of Pfmrk, ab initio parameterization involving the EM and MD refinement procedures of the Pfmrk-ATP structural model required parameterization of the Mg 2 þ ion within its hexacoordinate environment: atomic charges, van der Waals radius and force constant for nonbonded interactions, the RHF (spin-restricted Hartree-Fock) Hamiltonian with the 6 -31G* basis set implemented in the Gaussian98 package was employed to calculate the electrostatic potential (ESP) for the Mg 2 þ coordination complex, refinement of the Pfmrk structural model was conducted by EM and MD calculations using AMBER7 supplemented by the Mg 2 þ -related parameters calculated, geometry of the energy minimized structures was analyzed with PRO-CHECK, a set of Pfmrk inhibitors of type I were docked to the structural model of Pfmrk using GOLD and the G Score function]; GOLD [flexible protein-ligand docking program featuring a (i) genetic algorithm methodology for protein docking; (ii) full ligand and partial protein flexibility; (iii) energy functions partly based on conformation and non-bonded contact information from the Cambridge Structural Database (CSD)]. Results: Recent genome sequencing and molecular cloning projects have identified several enzymes from P. falciparum that may represent novel drug targets, including CDKs which are expected to play a crucial role in parasitic growth. In this study a 3D structural model of Pfmrk, a putative human CDK activating kinase (CAK) homolog in P. falciparum has been constructed. Significant features of this structural model were (i) parameterization of the Mg 2 þ hexacoordination system using ab initio quantum chemical calculations to accurately represented the ATP-kinase interaction; and (ii) comparison between the GOLD docking scores and measured binding affinities for a series of oxindole-based Pfmrk inhibitors of known activity. The refined structural model was evaluated by the docking of a series of known Pfmrk inhibitors. A reasonable correlation (r ¼ 0.82) has been calculated between the G Scores and inhibition constants (pIC 50 ) validated the accuracy of the developed Pfmrk structural model. Fig. 1 shows the plot of the G Score versus pIC 50 values for the oxindole-based Pfmrk inhibitors. Fig. 2 shows the binding mode of GW5074 in the ATPbinding pocket of Pfmrk, where the docked GW5074 is rendered in ball-and-stick mode, the protein residues are rendered in stick mode, and the hydrogen bonds are highlighted by dotted lines. Analysis of the ligand-Pfmrk binding interactions revealed specific residues (Met66, Met75, Met91, Met94, and Phe143) within the Pfmrk binding pocket that may play an important role in inhibitor binding affinity and selectivity. It was suggested that the Pfmrk structural model developed, together with information obtained from analysis of ligand-receptor interactions, should help the rational design of novel, potent and selective Pfmrk inhibitors as antimalarial agents. Several novel compounds were identified that exhibit in vitro inhibitory activity against Pfmrk. The chemical scaffolds of these inhibitors were distinct from those associated with known CDK inhibitors. (B. B.) entry 1MHM) and the human enzyme irreversibly complexed with the substrate methyl-ester (2.7 , PDB entry 1I7B) were taken from the Protein Data Bank]; Protein sequences [28 protein sequences were taken from the Swissprot database). Computational methods: Molecular modeling [protein alignment was carried out using Clustalx 1.81, the AdoMetDC domain was modeled using the human and potato X-ray crystal structures as templates, initial models were built with Modeller 6v2 using high refinement to generate 100 models, hydrogens were added in InsightII at a pH 7.2, minimization was performed in InsightII/Discover3 with the cff91 force field, a cutoff of 9.5 was used for van der Waals and electrostatic interactions with a distance dependent dielectric constant of 4.0, results of modeling were analyzed using Procheck, Ligplot 4.1.1, Cerius2 and Pymol, the DSSP algorithm was used to assign secondary structure as implemented in the DSSPCMBI program], ligand docking was performed using the SA-docking module of InsightII]. Data calculated: rmsd [root mean square deviation () of the position of the corresponding atoms of two superimposed molecular structures]. Results: In P. falciparum, the two main regulatory activities of polyamine biosynthesis, ODC and AdoMetDC, occur in a single bifunctional protein. In this study, novel features of the P. falciparum AdoMetDC was revealed by structural alignment. The AdoMetDC domain was modeled using the human and potato X-ray crystal structures as templates. Three parasite-specific inserts and the core active site region was identified by homology modeling. The domain was modeled without the two largest inserts, to yield and rmsd of 1.85 from the human template. Fig. 1 shows the important active site residues in the P. falciparum AdoMetDC, with the ligand AdoMetDC, where Glu72 and Glu438 are acidic residues, Phe5 and Phe415 are aromatic residues, and Cys87 is a polar residue. Hydrogen bonds are highlighted using dashed lines. Contact with the rest of the bifunctional complex was predicted to occur on one face of the PfAdoMetDC domain. In the active site four substitutions were identified compared to the human template. It was hypothesized that one of these substitutions may be responsible for the lack of inhibition by Tris, compared to mammalian AdoMetDC. The model also gave an explanation for the lack of putrescine stimulation in PfAdoMetDC compared to mammalian AdoMetDC (in some organisms the activity and/or proteolytic processing) of AdoMetDC is stimulated by putrescine. It was shown that a network of residues that connects the putrescine-binding site with the active site in human AdoMetDC was conserved in the malarial and plant cognates. Internal basic residues were found to assume the role of putrescine, based on the model and site-directed mutagenesis: the presence of Arg11 was crucial for normal activity, while disrupting Lys15 and Lys215 each cause 50% inhibition of AdoMetDC activity. It was suggested that these novel features of malarial AdoMetDC can guide further experimental investigations that may lead to the discovery of parasite-specific inhibitors. (B. B.) Title: Design, synthesis and biological activity of acyl substituted 3-amino-5-methyl-1,4,5,7-tetrahydropyrazolo[3,4-b]pyridin-6-ones as potential hypnotic drugs. Authors Computational methods: VolSurf [computer program that automatically converts 3D molecular fields into simpler molecular descriptors related to absorption, distribution, metabolism, excretion (ADME) properties, which are easy to understand and to interpret]; "rule of 5" (Lipinskis rule set postulating that poor oral absorption is probable for a compound that have any two of the following properties: molecular weight above 500 amu, ClogP above 5, and having more than 5 hydrogen bonds donors, or 10 hydrogen bond acceptors present); Catalyst [computer program for automatic generation of pharmacophore models for a training set of molecules specifying the relative alignments and active conformations of the ligands consistent with the binding to a common receptor site, the pharmacophore model (hypothesis) consists of a collection of features necessary for the biological activity of the ligands arranged in 3D space, the common ones being hydrogen bond acceptor, hydrogen bond donor, and hydrophobic features]. Data calculated: ADME [ADME properties such as Caco2 (Gastro-Intestinal Barrier) and BBB (Blood-Brain Barrier)]; CLOGP (1-octanol/water partition coefficients of the compounds were estimated using the CLOGP algorithm). Chemical descriptors: N,O, NH,OH (number of H bond acceptors and H bond donors, respectively); MW [molecular weight (g/mol)]. Results: Among the known non-benzodiazepinic hypnotic drugs acting on the a1 subunit of the GABA-A receptor, zolpidem zaleplon and indiplon show high affinity and selectivity. Following a design methodology including Catalyst pharmacophoric requirements and ADME-predicted properties, a set of compounds of type I as potential hypnotic drugs were designed and synthesized based on the scaffold shown in Fig. 1 . The acyl substituted 3-amino-4,5-dihydro-1H-pyrazolo [3,4-b] pyridin-6(7H)-ones were completely devoid of any inhibitory activity (%Bind < 20%), while Zolpidem showed an inhibitory activity of 99.4% (at a concentration of 1.0 · 10 À 5 M). This lack of activity could be attributed to two possible reasons: (i) in the pharmacophoric model shown in Fig. 1 Zolpidem shows a hydrophobic interaction of an aryl substituent present in N2 of the pyrazole ring; such interaction, which is not possible for the molecules of type I, could explain the lack of inhibitory activity; (ii) a literature search has revealed that Zaleplon is metabolized by oxidation of the pyrimidine ring to afford an inactive metabolite that presents a lactam carbonyl group at the same relative position of the pyridone carbonyl group of the molecules of type I. It was concluded that the presence of such carbonyl group, the distinctive feature of the applied synthetic methodology, could be responsible of the loss of activity of the novel compounds. (B. B.) and (iii) encoding the filtered MIF into the GRIND variables]; GOLPE (Generating Optimal Linear PLS Estimations program using D-optimal design to pre-select non-redundant variables and fractional factorial design to run PLS analyses with different variable combinations); Probes (OH2 water, DRY hydrophobic probe, C3 sp3 methyl probe, N1 Neutral flat NH, e.g., amide, N1: sp3 N with lone pair, N1 þ sp3 amine NH cation, NM3 trimethylammonium cation, OH phenol or carboxy OH, O-sp2 phenolate oxygen, O sp2 carbonyl oxygen); CPCA (Consensus Principal Component Analysis). Results: An evaluation and comparison of two different approaches, GRID/CPCA and GRIND/CPCA, that can be used for visualizing the structural differences between related proteins is presented. Ten crystal structures of CDK2/cyclin A and GSK3b complexed with different inhibitors were compared to identify regions that could be potential sites for achieving selectivity for CDK2 versus GSK3b. The study pointed out marked differences in the bottoms of the CDK2-GSK3b ATP binding pockets that guided the optimization to obtain a selective benzodipyrazole CDK2 inhibitor. It had been reported that a bromine substituent at position 6 of indirubin increases the selectivity for GSK3b over CDK2. The superimposition of the crystal structure of 6-bromo-indirubin-3'-oxime bound into the GSK3b and the benzo-pyrazole ligand bound into the CDK2_3 crystal structure is shown in Fig. 1 . The attachment points of the bromine and the gem-dimethyl almost overlap. The two moieties occupy the same area of the ATP pocket (which correspond to the NM3 region, previously described). The gain in selectivity could be associated with two main differences in the ATP pockets of the enzymes. Phe80 of CDK2, the so-called gatekeeper residue that had been often exploited for the design of kinase selective ligands, was replaced by a leucine in GSK3b, and Ala144 was replaced by a cysteine. As a result of these mutations, the CDK2 had a less elongated and less flat buried region at the back of the ATP pocket. The selectivity regions identified using the GRIND/ CPCA approach were in good agreement with those obtained using the well-known GRID/CPCA method. The analysis performed using the GRIND descriptors confirmed the presence of favorable hydrophobic areas near the Phe80 of CDK2 that can be fitted optimally by adding two hydrophobic moieties above and below the plane identified by the benzodipyrazole scaffold. Fig. 2 shows the DRY-DRY GRIND variable able to discriminate CDK2-GSK3b proteins. The nodes A and B indicate the position of the favorable hydrophobic areas at a distance of about 3 from each other. Both methods provided insights on structural features that might be useful for improving selectivity against both GSK3b and CDK2 kinases. In particular, the GRID/CPCA approach was instrumental in rationally designing a novel, potent, and selective CDK2 benzodipirazole inhibitor. (B. B.) tradiol (pdb code: 1ere), CHIME and NCBF (Noncovalent Bond Finder) were used for structure visualization, key ligand receptor interactions, were investigated by superimposing the steroid skeleton of each ligand in the set in the MTD-PLS-selected conformation, to the 17a-estradiol structure from 1ere inside the protein, and then, we the 17a-estradiol was deleted, thus the binding position proposed by MTD-PLS for almost each ligand was investigated without any assumptions regarding protein flexibility]; MTD [Minimal Steric (Topological) Difference is a measure of the misfit of the ligand molecule within the receptor site, a hypermolecule is calculated by superposition of the ligands, the vertices of a hypermolecule are obtained by an optimization procedure, the vertices are attributed to three regions: receptor cavity (e j ¼ À 1), receptor walls (e j ¼ þ 1) and exterior (e j ¼ 0), the MTD i value for the ligand L i is equal to the number of occupied wall vertices, plus the number of unoccupied cavity vertices (Eq. 1), where j ¼ vertex enumeration, s ¼ total number of cavity vertices, x ij ¼ 1 if molecule i occupies vertices j and x ij ¼ 0, if not]; PLS (Partial Least Squares projections to latent structures analysis performed SIMCA P v9.0); LOO (Leave-One-Out cross-validation). Data calculated: Descriptors [fragmental descriptors used in this work: ÀOH, ÀOÀ, ÀNHÀ, ÀSÀ, ¼CHÀ, ÀF, ÀCl, ÀBr, À I, N (nitro), À NH 2 , À CH 2 À , À CH 3 , O (nitro), ¼C < , ÀOÀ, ¼O (keto), ¼O (ester), CH, C, ¼NÀ]; VIP (Variable Influence on Projection); q 2 (cross-validated correlation coefficient). Results: MTD-PLS, a PLS variant of the minimal topologic difference method has been used for mapping interactions between estradiol derivatives and the ERa. A set of 45 estrogen agonist derivatives with associated RBA values was used. Fig. 1 shows the hypermolecule construction and vertex numbering, where the common rigid skeleton is used in superposition, and only the differing vertexes are numbered. Retrospective virtual screening with MTree models generated for angiotensin-converting enzyme and the a1a receptor on a large candidate database. The resulting models were in very good agreement with the available X-ray structural information and known pharmacophores demonstrating the quality and possibility for chemical interpretation in these models. Fig. 1 shows the derived ACE MTree model, where the bold hexagons indicated significant correspondence with potency. The screening yielded enrichment factors up to 71 for the first 1% of the screened database. MTree models developed outperformed database searches using single feature trees in terms of hit rates and quality. The MTree screening procedures additionally identified alternative molecular scaffolds not included in any of the query molecules. Moreover, the new methodology identified relevant molecular features, that are known to be important for affinity to the target. The methodology allows for introducing weights of individual nodes of the MTree model, giving differing importance to these features. Such weights could be derived on the basis of experimental affinity data. (B. B.) Title: Theoretical and experimental design of atypical kinase inhibitors. Application to p38 MAP kinase dihydro benzimidazol-2-one-pyridine b) Compounds of type I, type II, type III, type IV Title: Novel properties of malarial S-adenosylmethionine decarboxylase as revealed by structural modelling. Authors: Wells Biological material: Ornithine decarboxylase (ODC, EC 4.1.1.17) and S-adenosylmethionine decarboxylase (Ado-MetDC, EC 4.1.1.50) domains in Plasmodium falciparum (PfAdoMetDCare) represented by a single bifunctional protein Nerviano Medical Sciences Viale Pasteur 10, I-20014 Nerviano (MI), Italy. E-mail: anna.vulpetti@nervianoms.com A PLS variant of the minimal topologic difference method. 3. Mapping interactions between estradiol derivatives and the alpha estrogenic receptor Estradiol and 60 estradiol derivatives (51 training set and 10 test set compounds) Biological material: Uterus estrogen receptor alpha (ERa) from rat (Ratus norvegicus) Computational methods: Molecular modeling [the docked lactamate/biapenem model was constructed by modifying the enzyme-intermediate complex (code: 1X8I), after removing the intermediate the optimized biapenem structure was docked in the active site, the resulting complex was solvated in a 25 radius of pre-equilibrated TIP3P water sphere centered at the zinc ion and optimized according to the QM/MM and DFT protocols, the quantum region in the QM/MM simulations, which included the Zn(II) ion and its ligands, the antibiotic molecule, the catalytic water, and an active-site histidine residue, was treated using the SCC-DFTB model, 500 ps molecular dynamics (MD) simulations of both the apo enzyme and the substrate-enzyme complex were performed employing the SHAKE algorithm, all DFT calculations were performed using Gaussian 03]; QM/MM, DFT [Quantum Mechanical/Molecular Mechanical, and Density Functional Theory (DFT) methods, respectively]; SCC-DFTB (Self-Consistent Charge Density Functional Tight Binding); SHAKE (the SHAKE algorithm used to constrain all bond lengths of the molecules to their equilibrium values). Results: The active-site dynamics of the binding of biapenem to monozinc Cpha a-lactamase has been studied employing QM/MM and DFT methods. The quantum region in the QM/MM simulations was treated using the SCC-DFTB model. Biapenem was docked at the active site corresponding to a recent X-ray structure of an enzyme-intermediate complex (code: 1X8I). Fig. 1 shows the important bonding interactions at the active site of the CphA-biapenem complex, where the biapenem is drawn with bold lines.The modeling experiments revealed that the substrate is connected in a metal binding through its 3-carboxylate oxygen, and anchored by several hydrogen bonds between the substrate and active-site residues, particularly those made possible by the conformational change of Asn233. An activesite water was found in a pocket near the lactam carbonyl carbon of the substrate hydrogen bonded primarily with the carboxylate side chain of the metal-binding Asp120. It was proposed that the Asp120 residue, like His118, may serve as a general base to activate the catalytic water. The energetic stability of the metal-ligand bonds and the hydrogen-bond network was checked by MD simulations of both the apo enzyme and the substrate-enzyme complex. The structure and dynamics of the substrate-enzyme complex model provided valuable insights into the mode of catalysis in these enzymes that play a central role in the bacterial resistance to a-lactam antibiotics. The theoretical simulations complementing experimental results provided important details of the process and yielded quantities that would be hard to measure accurately. It was suggested that a molecular level modeling of the binding and catalysis of the metallo-a-lactamases is essential in understanding the origin of bacterial resistance and in designing novel and effective inhibitors. (B. B.) Title: Angiotensin II pseudopeptides containing 1,3,5-trisubstituted benzene scaffolds with high AT 2 receptor affinity.Authors Biological material: Angiotensin receptors: AT 1 in the rat liver membranes, and AT 2 in the pig uterus myometrium.Data determined: Using this method spatially assigned analysis of fragment properties can provide receptor site maps, within the limits of the training sets. A steric misfit was identified for the steroidal position 2; Advantageous hydrophobic and van der Waals interactions (enhanced by high polarizability) were found for the 17R-CHdCH-X fragment. The MTD-PLS mapping results were confirmed by the experimentally derived estradiolestrogen receptor binding site contacts based on X-ray crys-tallography data. The results suggest that the proposed MTD-PLS method can yield useful insights for interactions with receptors of unknown 3D structure and, generally, for the steric rigidity of receptor sites. (similarity-based virtual screening based on similarity search, e.g., using Tanimoto coefficient or fingerprint-based similarity searching, the searches are evaluated by the recall at 5%, i.e., the percentage of the database molecules belonging to the same activity class as the reference structure that is retrieved in the top 5% of a ranking of the database); TurboSim (similarity-based virtual screening involving the nearest neighbors of the reference structure); kNN [k-Nearest Neighbors, a pattern recognition method, where the distance (usually Euclidean) between the pattern vector of an unknown and each of the pattern vector of the training set is first computed, the k nearest samples to the unknown are selected and it is classified in the group to which the majority of the k samples belongs]. . Results: A simple way of enhancing the effectiveness of similarity-based virtual screening by using information about the nearest neighbors (NNs) of the initial target structure in a similarity search is reported. In this study the underlying hypothesis was tested that fusing the outputs of similarity It was suggested that the proposed turbo similarity searching approach provides a simple way to enhance the effectiveness of simulated virtual screening searches. The approach requires no modifications to existing similarity software other than the ability to fuse the outputs of the multiple searches to give a single combined ranking of the database structures. In this study 2D fingerprints and the Tanimoto coefficient were used, but any other type of similarity measure can be used that satisfies the similar property principle. Center for Bioinformatics, University of Hamburg Bundesstr. 43, D-20146 Hamburg, Germany. E-mail: rarey@zbh.uni-hamburg.de; Tel.: 49-40-428-387-351; Fax: 49-40-428-387-351. Source: J. Med. Chem. 2005, 48(21) , 6575 -6584. Compounds: 47 691 Compounds.Biological material: a) Angiotensin converting enzyme (ACE), a type membrane-anchored dipeptidyl carboxypeptidase that is essential for blood pressure regulation and electrolyte homeostasis b) a1a adrenergic receptors, that belong to the family of Gprotein coupled receptors (GPCRs). Data taken from the literature: Dataset[the data set for model generation and virtual screening was extracted as a drug-like subset from the World Drug Index (WDI), the filtered candidate database contained 47 691 compounds, comprising 331 a1a and 108 ACE inhibitors]. Computational methods: MTree [all molecules were described by the established feature tree descriptor, which is derived from a topological molecular graph, a consistent topological molecular alignment based on chemically reasonable matching of corresponding functional groups was performed employing a new pairwise alignment algorithm (theory is given), the MTree models generated were used for retrospective virtual screening in compound libraries]; Catalyst [computer program for automatic generation of pharmacophore models for a training set of molecules specifying the relative alignments and active conformations of the ligands consistent with the binding to a common receptor site, the pharmacophore model (hypothesis) consists of a collection of features necessary for the biological activity of the ligands arranged in 3D-space, the common ones being hydrogen bond acceptor, hydrogen bond donor, and hydrophobic features; the virtual screening was performed using the pharmacophores derived by using Catalyst]. Results: A novel approach for ligand-based virtual screening is presented by combining query molecules into a multiple feature tree model called MTree. The proposed approach capable of dealing with a set of known actives simultaneously.