Load dataset diabetes.arff, to be found in the "Data" folder in WEKA (download folder), also
loaded in the folder "Assignments&Labs". For an explanation of the dataset see
https://archive.ics.uci.edu/ml/support/diabetes.
Apply Feature Selection using Principal Components Analysis (PCA).
Under Select Attributes -> Attribute Evaluator (select PrincipalComponents). Note the
default Search Method should be Ranker (so click yes at the pop-up or change the method
before selecting attribute evaluator. Leave all default settings and select Start. What are the top 3
features or feature-sets?
. Repeat the PCA, adjust the varianceCovered to 0.7, and run the model. Describe your findings.
Using the same data, Apply Normalization. Under Preprocess -> Filter-> Unsupervised->
Attribute->Normalize->Ok. Close and Then Apply. Use default settings. Then run PCA again
with 0.95 for varianceCovered. Explain your results.
Additionally, click ‘Save’ and save the normalized data as csv OR select ‘Edit’ and take a screenshot of
your normalized data.
Using the same data. Apply Binning. Under Preprocess -> Filter-> Unsupervised-> Attribute-
>Discretize ->Ok. Close and Then Apply. Use default settings. Then run PCA again with 0.95
for varianceCovered. Explain your findings. Note: Make sure to Undo the normalization
before you apply binning.
Sun | Mon | Tue | Wed | Thu | Fri | Sat |
---|---|---|---|---|---|---|
30 | 31 | 1 | 2 | 3 | 4 | 5 |
6 | 7 | 8 | 9 | 10 | 11 | 12 |
13 | 14 | 15 | 16 | 17 | 18 | 19 |
20 | 21 | 22 | 23 | 24 | 25 | 26 |
27 | 28 | 29 | 30 | 1 | 2 | 3 |
Get Free Quote!
344 Experts Online