Regression model (classification)
Open in:QDB ExplorerQDB Predictor
Name | Type | n | Accuracy |
---|---|---|---|
Training set | training | 51 | 0.804 |
Test set | external validation | 17 | 0.765 |
Regression model (classification)
Open in:QDB ExplorerQDB Predictor
Name | Type | n | Accuracy |
---|---|---|---|
Training set | training | 51 | 0.843 |
Test set | external validation | 17 | 0.765 |
When using this QDB archive, please cite (see details) it together with the original article:
Piir, G. Data for: Development of binary classification models for grouping hydroxylated polychlorinated biphenyls into active and inactive thyroid hormone receptor agonists. QsarDB repository, QDB.269. 2025. https://doi.org/10.15152/QDB.269
Akinola, L. K.; Uzairu, A.; Shallangwa, G. A.; Abechi, S. E. Development of binary classification models for grouping hydroxylated polychlorinated biphenyls into active and inactive thyroid hormone receptor agonists. SAR and QSAR in Environmental Research 2023, 34, 267–284. https://doi.org/10.1080/1062936x.2023.2207039
Title: | Akinola, L. K.; Uzairu, A.; Shallangwa, G. A.; Abechi, S. E. Development of binary classification models for grouping hydroxylated polychlorinated biphenyls into active and inactive thyroid hormone receptor agonists. SAR and QSAR in Environmental Research 2023, 34, 267–284. |
Abstract: | Some adverse effects of hydroxylated polychlorinated biphenyls (OH-PCBs) in humans are presumed to be initiated via thyroid hormone receptor (TR) binding. Due to the trial-and-error approach adopted for OH-PCB selection in previous studies, experiments designed to test the TR binding hypothesis mostly utilized inactive OH-PCBs, leading to considerable waste of time, effort and other material resources. In this paper, linear discriminant analysis (LDA) and binary logistic regression (LR) were used to develop classification models to group OH-PCBs into active and inactive TR agonists using radial distribution function (RDF) descriptors as predictor variables. The classifications made by both LDA and LR models on the training set compounds resulted in an accuracy of 84.3%, sensitivity of 72.2% and specificity of 90.9%. The areas under the ROC curves, constructed with the training set data, were found to be 0.872 and 0.880 for LDA and LR models, respectively. External validation of the models revealed that 76.5% of the test set compounds were correctly classified by both LDA and LR models. These findings suggest that the two models reported in this paper are good and reliable for classifying OH-PCB congeners into active and inactive TR agonists. |
URI: | http://hdl.handle.net/10967/269
http://dx.doi.org/10.15152/QDB.269 |
Date: | 2025-05-07 |
Name | Description | Format | Size | View |
---|---|---|---|---|
2023SQER267.qdb.zip | Models for detecting thyroid hormone receptor agonists | application/zip | 23.14Kb | View/ |