Detailed Information

Cited 15 time in webofscience Cited 17 time in scopus
Metadata Downloads

The comparison of automated clustering algorithms for resampling representative conformer ensembles with RMSD matrix

Full metadata record
DC Field Value Language
dc.contributor.authorKim, Hyoungrae-
dc.contributor.authorJang, Cheongyun-
dc.contributor.authorYadav, Dharmendra K.-
dc.contributor.authorKim, Mi-hyun-
dc.date.available2020-02-27T19:42:10Z-
dc.date.created2020-02-06-
dc.date.issued2017-03-23-
dc.identifier.issn1758-2946-
dc.identifier.urihttps://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/6307-
dc.description.abstractBackground: The accuracy of any 3D-QSAR, Pharmacophore and 3D-similarity based chemometric target fishing models are highly dependent on a reasonable sample of active conformations. Since a number of diverse conformational sampling algorithm exist, which exhaustively generate enough conformers, however model building methods relies on explicit number of common conformers. Results: In this work, we have attempted to make clustering algorithms, which could find reasonable number of representative conformer ensembles automatically with asymmetric dissimilarity matrix generated from openeye tool kit. RMSD was the important descriptor (variable) of each column of the N x N matrix considered as N variables describing the relationship (network) between the conformer (in a row) and the other N conformers. This approach used to evaluate the performance of the well-known clustering algorithms by comparison in terms of generating representative conformer ensembles and test them over different matrix transformation functions considering the stability. In the network, the representative conformer group could be resampled for four kinds of algorithms with implicit parameters. The directed dissimilarity matrix becomes the only input to the clustering algorithms. Conclusions: Dunn index, Davies-Bouldin index, Eta-squared values and omega-squared values were used to evaluate the clustering algorithms with respect to the compactness and the explanatory power. The evaluation includes the reduction (abstraction) rate of the data, correlation between the sizes of the population and the samples, the computational complexity and the memory usage as well. Every algorithm could find representative conformers automatically without any user intervention, and they reduced the data to 14-19% of the original values within 1.13 s per sample at the most. The clustering methods are simple and practical as they are fast and do not ask for any explicit parameters. RCDTC presented the maximum Dunn and omega-squared values of the four algorithms in addition to consistent reduction rate between the population size and the sample size. The performance of the clustering algorithms was consistent over different transformation functions. Moreover, the clustering method can also be applied to molecular dynamics sampling simulation results.-
dc.language영어-
dc.language.isoen-
dc.publisherBIOMED CENTRAL LTD-
dc.relation.isPartOfJOURNAL OF CHEMINFORMATICS-
dc.subjectDERIVATIVES-
dc.subjectGENERATION-
dc.subjectINHIBITORS-
dc.subjectMOLECULES-
dc.subjectOPTIMIZE-
dc.subjectDOCKING-
dc.subject3D-QSAR-
dc.subjectMODELS-
dc.subjectTOOLS-
dc.titleThe comparison of automated clustering algorithms for resampling representative conformer ensembles with RMSD matrix-
dc.typeArticle-
dc.type.rimsART-
dc.description.journalClass1-
dc.identifier.wosid000397665500001-
dc.identifier.doi10.1186/s13321-017-0208-0-
dc.identifier.bibliographicCitationJOURNAL OF CHEMINFORMATICS, v.9-
dc.identifier.scopusid2-s2.0-85016130603-
dc.citation.titleJOURNAL OF CHEMINFORMATICS-
dc.citation.volume9-
dc.contributor.affiliatedAuthorKim, Mi-hyun-
dc.type.docTypeArticle-
dc.subject.keywordAuthorConformer ensemble-
dc.subject.keywordAuthor3D shape-based alignment-
dc.subject.keywordAuthork-Means clustering of multidimensional scaled RMSD values (RCKmeans)-
dc.subject.keywordAuthorHierarchical clustering with dynamic tree cut (RCDT)-
dc.subject.keywordAuthorLinear kernel PCA (RCPCA)-
dc.subject.keywordAuthorNonlinear kernel PCA (RCPCA_ RBF)-
dc.subject.keywordPlusDERIVATIVES-
dc.subject.keywordPlusGENERATION-
dc.subject.keywordPlusINHIBITORS-
dc.subject.keywordPlusMOLECULES-
dc.subject.keywordPlusOPTIMIZE-
dc.subject.keywordPlusDOCKING-
dc.subject.keywordPlus3D-QSAR-
dc.subject.keywordPlusMODELS-
dc.subject.keywordPlusTOOLS-
dc.relation.journalResearchAreaChemistry-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalWebOfScienceCategoryChemistry, Multidisciplinary-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryComputer Science, Interdisciplinary Applications-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
Files in This Item
There are no files associated with this item.
Appears in
Collections
약학대학 > 약학과 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Mi Hyun photo

Kim, Mi Hyun
Pharmacy (Dept.of Pharmacy)
Read more

Altmetrics

Total Views & Downloads

BROWSE