Skip to content

Final Project for MA463X: Data Analytics & Statistical Learning. Completed by Mike Giancola, Ranier Gran, Cassidy Litch, Charles Lovering, and Cuong Nguyen.

Notifications You must be signed in to change notification settings

mjgiancola/wpi-ma-463x-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

71 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

wpi-ma-463x-project

Final Project for MA463X: Data Analytics & Statistical Learning.

Description

We analyzed the breast cancer data set available here. Our goal was to classify the given tumor cells as malignant or benign. We experimented with the following methods:

  • K-Nearest Neighbors
  • Linear Discriminant Analysis & Quadratic Discriminant Analysis (LDA & QDA)
  • Logistic Regression
  • Random Forest
  • Bagging
  • Boosting

Our final choice based on training/validation results was bagging with logistic regression. This model's accuracy was 96.70% on test (unseen) data.

Our report, which goes into greater deal on our experiments, results, and analysis can be found here.

Collaborators

Completed by Mike Giancola, Ranier Gran, Cassidy Litch, Charles Lovering, and Cuong Nguyen.

Professor

Randy Paffenroth

About

Final Project for MA463X: Data Analytics & Statistical Learning. Completed by Mike Giancola, Ranier Gran, Cassidy Litch, Charles Lovering, and Cuong Nguyen.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •