Model MY_2D_QSAR

Calculates the property MY_2D_QSAR. See Output for other properties that can be computed.

The model was built using the Least-Squares method.

The document contains these additional sections of information:
Regression Statistics
Model Coefficients and Variables
Training Data Information
Excluded Variable Information
Model Construction Parameters

Regression Statistics Table

This table contains statistics from the regression modeling procedure.

Statistic Value
N70
r0.977
r20.954
r2 (adjusted)1.211
r2 (prediction)0.174
RMS residual error0.1843
q2 (cross-validation)0.202
RMS residual error (cross-validation)1.145

Coefficient-Variable Table

Back to Top

This table contains the coefficients and associated variables for the equation.

Coefficient Variable
0.263585Constant
0.139511ALogP
-0.151553Count<ECFP_6:-591526139>
0.254562Count<ECFP_6:-1910270391>
0.255009Count<ECFP_6:-1100000244>
0.255548Count<ECFP_6:-1074141656>
0.257191Count<ECFP_6:1559650422>
0.25646Count<ECFP_6:642810091>
0.257106Count<ECFP_6:-182236392>
0.257097Count<ECFP_6:-606302475>
0.256129Count<ECFP_6:1572579716>
1.11729Count<ECFP_6:-992506539>
-0.493043Count<ECFP_6:734603939>
3.02386Count<ECFP_6:-797085356>
0.256944Count<ECFP_6:2099970318>
0.258393Count<ECFP_6:770157610>
0.257053Count<ECFP_6:-2024255407>
0.257212Count<ECFP_6:1996767644>
0.25823Count<ECFP_6:-786013480>
0.258704Count<ECFP_6:1997021792>
0.257812Count<ECFP_6:-175146122>
0.259305Count<ECFP_6:1298504034>
0.259698Count<ECFP_6:858184972>
0.261082Count<ECFP_6:-932108170>
-0.000900798Count<ECFP_6:-1332781180>
0.0869928Count<ECFP_6:-757679000>
0.331836Count<ECFP_6:-1102925512>
1.15026Count<ECFP_6:-177264675>
-1.71094Count<ECFP_6:859796174>
1.1353Count<ECFP_6:-1686976258>
0.260358Count<ECFP_6:-1302110264>
0.261407Count<ECFP_6:1095683433>
0.263643Count<ECFP_6:-768126022>
0.261617Count<ECFP_6:2007300961>
0.261366Count<ECFP_6:397284699>
0.260751Count<ECFP_6:1451403962>
0.262348Count<ECFP_6:2071685859>
-0.620883Count<ECFP_6:-1952026932>
0.464346Count<ECFP_6:-658363709>
-1.78009Count<ECFP_6:-1506130950>
1.28495Count<ECFP_6:1670941296>
0.261055Count<ECFP_6:1155958977>
0.260224Count<ECFP_6:-952707428>
0.260105Count<ECFP_6:-1278685991>
0.259641Count<ECFP_6:1079175434>
0.259554Count<ECFP_6:-2135040425>
0.162945Count<ECFP_6:-1897341097>
-0.774082Count<ECFP_6:-167460056>
0.772336Count<ECFP_6:-1059365320>
0.943147Count<ECFP_6:-572965350>
0.926212Count<ECFP_6:-1867561664>
-0.962577Count<ECFP_6:-1683911134>
-1.47467Count<ECFP_6:-178525456>
0.101045Count<ECFP_6:-292555972>
1.49489Count<ECFP_6:-666950485>
0.366783Count<ECFP_6:1564392544>
0.00835034Count<ECFP_6:1571214559>
-0.0527057Count<ECFP_6:-2019199918>
-0.807647Count<ECFP_6:-1487746661>
-1.03346Count<ECFP_6:292958156>
1.75669Count<ECFP_6:-756348342>
0.479547Count<ECFP_6:-103562730>
-1.4798Count<ECFP_6:-1950934120>
0.181543Count<ECFP_6:-857146788>
0.035885Count<ECFP_6:-408473190>
-0.627152Count<ECFP_6:864909220>
0.770691Count<ECFP_6:-740847217>
0.910334Count<ECFP_6:408216150>
0.988691Count<ECFP_6:1595399376>
0.262223Count<ECFP_6:515773057>
1.45669Count<ECFP_6:78036066>
-2.06101Count<ECFP_6:-1884411803>
0.504692Count<ECFP_6:1021725999>
0.716968Count<ECFP_6:-665999307>
0.215824Count<ECFP_6:661073749>
-0.379654Count<ECFP_6:864518973>
0.257071Count<ECFP_6:191790798>
-0.694789Count<ECFP_6:1338334141>
-0.0174989Molecular_Weight
0.117988Num_AromaticRings
0.811412Num_H_Acceptors
0.653521Num_H_Donors
-0.389538Num_Rings
0.0348633Num_RotatableBonds
-11.4967Molecular_FractionalPolarSurfaceArea

Training Data Information

Back to Top

The data used to train the model consisted of 70 samples. The following are the statistics for the dependent (Y) and independent (X) variables. (The first row shows statistics for the Y variable. All other rows are for X variables.)

VariableMinMaxMeanStd. Dev.
pki-trypsin3.8547.6995.99750.86078
ALogP-2.3866.5252.6541.3647
ECFP_6N/AN/AN/AN/A
Molecular_Weight355.46676.91512.5865.768
Num_AromaticRings142.85710.72281
Num_H_Acceptors475.11430.8871
Num_H_Donors343.35710.47916
Num_Rings263.84290.87236
Num_RotatableBonds5148.44291.6443
Molecular_FractionalPolarSurfaceArea0.2180.4030.288230.036476

Excluded Variable Information

Back to Top

The following table shows statistics for the independent (X) training data variables that were excluded from the model for any of the following reasons: (1) The variable was constant or was a string when a number was expected. [Unexpected string variables appear as constants with values of 0.] (2) The variable contained too few nonzero values (fewer than 8, as specified by the MinSamplesPerVariable parameter; however, fingerprint features excluded due to too few nonzero values are not listed below). (3) The variable was correlated with another variable (correlation coefficient greater in magnitude than 0.9, as specified by the Max Correlation parameter). The Reason column indicates the reason that the variable was excluded.

VariableMinMaxMeanStd. Dev.Reason
Count<ECFP_6:670515721>010.842860.36656Correlated with other variables
Count<ECFP_6:960161451>010.242860.43191Correlated with other variables
Count<ECFP_6:20550775>010.242860.43191Correlated with other variables
Count<ECFP_6:1658067901>010.842860.36656Correlated with other variables
Count<ECFP_6:-1016680330>010.242860.43191Correlated with other variables
Count<ECFP_6:2102150379>010.985710.11952Correlated with other variables
Count<ECFP_6:-675671408>010.242860.43191Correlated with other variables
Count<ECFP_6:571867147>010.185710.39168Correlated with other variables
Count<ECFP_6:1574959513>010.242860.43191Correlated with other variables
Count<ECFP_6:-978131182>010.714290.45502Correlated with other variables
Count<ECFP_6:1454306807>010.185710.39168Correlated with other variables
Count<ECFP_6:796830164>010.185710.39168Correlated with other variables
Count<ECFP_6:117107367>010.242860.43191Correlated with other variables
Count<ECFP_6:944467641>010.728570.44791Correlated with other variables
Count<ECFP_6:1336540477>010.742860.44021Correlated with other variables
Count<ECFP_6:-1331450522>010.528570.50279Correlated with other variables
Count<ECFP_6:1449212896>010.714290.45502Correlated with other variables
Count<ECFP_6:-102666057>010.114290.32046Correlated with other variables
Count<ECFP_6:-1756464860>010.742860.44021Correlated with other variables
Count<ECFP_6:-755462605>010.642860.48262Correlated with other variables
Count<ECFP_6:717474525>010.70.46157Correlated with other variables
Count<ECFP_6:-1150899835>010.642860.48262Correlated with other variables
Count<ECFP_6:1146720904>010.628570.48668Correlated with other variables
Count<ECFP_6:710652510>010.642860.48262Correlated with other variables
Count<ECFP_6:-1289586824>010.628570.48668Correlated with other variables
Count<ECFP_6:1698998511>010.114290.32046Correlated with other variables
Count<ECFP_6:-1811366813>010.628570.48668Correlated with other variables
Count<ECFP_6:1515192889>010.742860.44021Correlated with other variables
Count<ECFP_6:-1650219925>010.114290.32046Correlated with other variables
Count<ECFP_6:-281505363>010.128570.33714Correlated with other variables
Count<ECFP_6:-1173882748>010.642860.48262Correlated with other variables
Count<ECFP_6:1637591468>010.642860.48262Correlated with other variables
Count<ECFP_6:2006518499>010.642860.48262Correlated with other variables
Count<ECFP_6:-81428579>010.685710.46758Correlated with other variables
Count<ECFP_6:1233434266>010.642860.48262Correlated with other variables
Count<ECFP_6:-1364467941>010.628570.48668Correlated with other variables
Count<ECFP_6:-594723798>010.628570.48668Correlated with other variables
Count<ECFP_6:2146640915>010.628570.48668Correlated with other variables
Count<ECFP_6:1929265201>010.642860.48262Correlated with other variables
Count<ECFP_6:325895898>010.628570.48668Correlated with other variables
Count<ECFP_6:865482986>010.157140.36656Correlated with other variables
Count<ECFP_6:-1505292865>010.157140.36656Correlated with other variables
Count<ECFP_6:-376546800>010.457140.50176Correlated with other variables
Count<ECFP_6:601995614>010.157140.36656Correlated with other variables
Count<ECFP_6:-954757448>010.114290.32046Correlated with other variables
Count<ECFP_6:-1625362884>010.114290.32046Correlated with other variables
Count<ECFP_6:833921154>010.128570.33714Correlated with other variables
Count<ECFP_6:-174624245>010.114290.32046Correlated with other variables
Count<ECFP_6:-1490910266>010.114290.32046Correlated with other variables
Count<ECFP_6:-454715551>010.114290.32046Correlated with other variables
Count<ECFP_6:2025485523>010.114290.32046Correlated with other variables
Count<ECFP_6:-19155222>010.142860.35245Correlated with other variables
Count<ECFP_6:-1078835860>010.142860.35245Correlated with other variables
Count<ECFP_6:1386744051>010.157140.36656Correlated with other variables
Count<ECFP_6:-1719301700>010.142860.35245Correlated with other variables
Count<ECFP_6:864287155>010.114290.32046Correlated with other variables

Model Construction Parameters

Back to Top

The following parameter values were specified by the learner component. Some items are internal parameters not exposed by the component. In the course of building the model, certain values may have been adjusted from the values shown below.

ParameterValue
LearnedPropertyNameMY_2D_QSAR
Namepki-trypsin
UsePropertiesUserSet
PredefinedSetEstate_Keys
UserSetALogP,ECFP_6,Molecular_Weight,Num_AromaticRings,Num_H_Acceptors,Num_H_Donors,Num_Rings,Num_RotatableBonds,Molecular_FractionalPolarSurfaceArea
IgnoreProperties
InitialModelFromLeast-Squares
Weight Property
kNN Options
Number of Nearest Neighbors20
Dynamic Smoothing Factor0.5
Number of XV Groups11
Additional Options
NumberOfComponents20
MinSamplesPerVariableSqrtEstimate
Decorrelation MethodPearson
Max Correlation0.90
Learn OptionsPerform OPS Analysis, Track Fingerprint Features
Indicator BaselineMost Common Value
Numeric Distance FunctionEuclidean
Numeric ScalingMean-Center and Scale, Scale by Number of Dimensions
Fingerprint Distance FunctionTanimoto
Model Domain FingerprintFCFP_2
Additional Properties
TopLevelCommentAdd Protocol Comment Here
Destination Folder16606/LearnedProperties
Max OPS Fingerprint Bits1000
Create Proxy ComponentFalse