| Data for the training set was
curated from multiple public domain sources and
normalized to a common basis prior to model building.
Models 1 through 3 contain training sets with
experimental hERG binding IC50 values for 127,
170 and 204 compounds respectively.
Almost all molecules used
in the model are commercially available drugs.
98% of these drugs are considered drug-like based
on Lipinski's rule of five.
| Number
of Lipinski’s Rule Violations |
Number
of Molecules |
|
0 |
170 |
|
1 |
28 |
|
2 |
3 |
|
3 |
3 |
|
4 |
0 |
|

|
A self-dissimilarity test
for the training set was carried by using 2D structural
fingerprints and was determined to be 64%. The
maximal dissimilarity was observed to be 96%.
|