MOE-QSAR课件.ppt
- 【下载声明】
1. 本站全部试题类文档,若标题没写含答案,则无答案;标题注明含答案的文档,主观题也可能无答案。请谨慎下单,一旦售出,不予退换。
2. 本站全部PPT文档均不含视频和音频,PPT中出现的音频或视频标识(或文字)仅表示流程,实际无音频或视频文件。请谨慎下单,一旦售出,不予退换。
3. 本页资料《MOE-QSAR课件.ppt》由用户(三亚风情)主动上传,其收益全归该用户。163文库仅提供信息存储空间,仅对该用户上传内容的表现方式做保护处理,对上传内容本身不做任何修改或编辑。 若此文所含内容侵犯了您的版权或隐私,请立即通知163文库(点击联系客服),我们立即给予删除!
4. 请根据预览情况,自愿下载本文。本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
5. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007及以上版本和PDF阅读器,压缩文件请下载最新的WinRAR软件解压。
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- MOE_QSAR 课件
- 资源描述:
-
1、InforSense&CCG All Rights ReservedAdvanced Application in MOEQSARInforSense&CCG All Rights ReservedOutlineQSAR OverviewDescriptor calculationDescriptor selection(PCA)Deriving QSAR modelsModel ValidationInforSense&CCG All Rights ReservedQSARQuantitative Structure-Activity Relationship(QSAR)applicatio
2、ns correlate experimental data(e.g.biological activity or physical properties)with the structure of chemical compounds in a quantitative manner.QSAR models allow the interpretation and prediction of properties of structurally related compounds.*)The art of deriving a QSAR model lies in:Identifying a
3、 suitable mathematical functional form Reducing the complex dimensionality of reality into as few dimensions as possible while still being able to give useful predictions of specific properties for molecules not experimentally tested so far.Most QSAR models are based on linear correlations.InforSens
4、e&CCG All Rights ReservedQSAR Model DevelopmentRobust QSAR model development generally proceeds as follows:Assemble a database of experimental results and molecular structures.Identify a descriptor set that correlates highly with the property in question,use descriptors which are mutually orthogonal
5、 and as meaningful and intuitive as possible(based on the underlying physico-chemical properties).Split the dataset into an appropriate training and test set.The training set will be used to develop the model.The test set will be used to validate the predictive power of the model.In most cases the a
6、pplicability of the model will be closely limited to the property space of the test set.Apply methods(regression,classification,etc.)to generate the predictive models based on the training set.Predict activities for the test set to assess robustness of the model.Descriptor calculation(QuaSAR-Descrip
7、tor)Descriptor selection(Principle Components,QuaSAR-Contingency)Modelvalidation(Model-Evaluate)Modeldevelopment(QuaSAR-Model,Model-Composer)InforSense&CCG All Rights ReservedQuantitative&Qualitative QSAR Models in MOEBesides the selection of most appropriate descriptors and a meaningful separation
8、of available data into training and test sets,the choice of an appropriate functional form is key to successful QSAR modeling.MOE provides quantitative as well as qualitative QSAR approaches:Quantitative approaches include linear regression methods such as Partial Least Squares(PLS)and Principal Com
9、ponent Regression(PCR).Qualitative approaches include a non-linear binary filter based on Bayesian statistics as well as a binary classification tree.InforSense&CCG All Rights ReservedModeldevelopmentModelvalidationDescriptor selectionDescriptorcalculationDescriptor CalculationInforSense&CCG All Rig
10、hts ReservedInitial Steps in Understanding a DatasetThe initial steps in interpreting an experimental dataset involve:Building preliminary Structure Activity RelationshipsCommon fragments to actives/inactives Looking for patterns within the data structureAre clusters present in the data?Evaluating t
11、he relative importance of descriptors for a potential modelInvolves both stochastic and heuristic evaluation Finding commonality,and diversity within the dataRobustness in chemical spaceStep 1 in data analysis:Find the relevant set of descriptorsInforSense&CCG All Rights ReservedMolecular Descriptor
12、s and FingerprintsMolecular Descriptors encode molecular properties per molecule into single numerical values.Qualitative:yes/no flags for presence or absence of certain features(like bits in fingerprints see below).Quantitative:numerical measures of physico-chemical or structural properties.May dep
13、endent on connectivity and chemistry only(2D)or also on conformation/3D geometry(3D).Fingerprints typically consists of bit strings of several hundreds or even thousands of individual yes/no flags.Each position of the bit string encodes the presence(1)or absence(0)of a distinct property or feature.I
14、ncluding substructure fragments,connectivity patterns or pharmacophore type functional properties.010010100101001.bit stringBrSNCH3OOONH2Molecular weight:385.282logP:2.552#rotatable single bonds:5or2,5,7,10,12,15,bit positionInforSense&CCG All Rights ReservedQuaSAR Descriptor PanelNumerical molecula
15、r descriptors may be calculated either via(MOE|Compute|QuaSAR|QuaSAR-Descriptor)without opening a database or via(DBV|Compute|Descriptors).Input databaseDescriptor synchronization with databaseDescriptor listDisplay filtersInforSense&CCG All Rights ReservedOverview of MOE Descriptors300 2D and 3D de
16、scriptors Topological indices Surface area properties Physical properties Energy termsAdd new descriptors with SVL Automatically added to relevant calculations Existing descriptors can be used as templateProprietary VSA descriptors Subdivision of surface area based on LogP,MR(molar refractivity)and
17、Partial Charge 2D based approximation(for speed on large datasets)Semi empirical descriptors Descriptor names prefixed with Hamiltonian:AM1_,PM3_,MNDO_ Total energy,electronic energy,heat of formation,HOMO,LUMO,Ionization PotentialInforSense&CCG All Rights ReservedBinned VSA Descriptors I A subset o
18、f highly uncorrelated,intuitive and meaningful 2D descriptors has been implemented in MOE to provide a stable“default”approach for new datasets:the binned Van-der-Waals surface area descriptors(referred to as binned VSA descriptors in MOE)1).LogP(partition coefficient),MR(molar refractivity)and part
19、ial charge are used to cover a meaningful property space from hydrophobic to hydrophilic interactions.Each of these descriptor sets is derived from,or related to the Hansch汉施 and Leo descriptors.2)The descriptor returns the approximate surface area of a molecule,produced from a 2D representation,tha
20、t falls into a given range of property values.Using the subset of binned VSA descriptors may help to overcome the necessity of using automatic descriptor selection routines.3)InforSense&CCG All Rights ReservedBinned VSA Descriptors IIThe surface contribution which may be sensed by neighboring molecu
21、les is approximated by subtracting overlapping surface areas from first shell atom neighbors.The 2019 Wildman&Crippen1)atom type model is used to map properties onto individual atoms.Contributions to LogP and MR are derived in linear models from datasets of about 10,000 experimental data points each
22、2).For partial charge calculation,the Gasteiger PEOE charges is used.The approximate surface area contributions of a given molecule are added for each property bin.3)Vi values:V7 V2 V1 V6 V3 V4+V8+V5Pi range:0,1)1,2)2,3)3,4)4,5)5,6)6 Descriptors:D1D2D3D4D5D6C8C3C4C5C6N7O2C1InforSense&CCG All Rights
23、Reserved2D BCUT and GCUT Descriptors BCUT:Burden Matrix eigenvalues The BCUT descriptors*)are calculated from the eigenvalues of a modified adjacency matrix.The adjacency matrix contains a 1 if atoms i,j are bonded;0 otherwise.Each ij entry of the adjacency matrix takes the value bij-1/2 where bij i
24、s the formal bond order between bonded atoms i and j.The diagonal takes the value of the associated PEOE,SMR,logP descriptor.The resulting eigenvalues are sorted and the smallest,1/3 percentile,2/3 percentile and largest eigenvalues are reported.GCUT:Inverse graph distance matrix eigenvalues The GCU
25、T descriptors are calculated from the eigenvalues of a modified graph distance adjacency matrix,similar to BCUT descriptors.Each ij entry of the adjacency matrix takes the value dij-2 where dij is the(modified)graph distance between atoms i and j.The diagonal takes the value of the associated PEOE p
展开阅读全文