Machine-Learning-Specification-University-of-Washington-

Coursera Assignment and Project of Machine learning specialization on coursera from University of washington.

Requirements:

Python 3
Graphlab
Ipython notebook

Machine learning: A case study approach:

Certificate: https://www.coursera.org/account/accomplishments/records/EKVNF5DESGQQ

Regression:

Case Study - Predicting Housing Prices

Data set: King County's house sales data-set

Certificate: https://www.coursera.org/account/accomplishments/records/XWHBSSUQQBS4

In our first case study, predicting house prices, I created models that predicts a continuous value (price) from input features (square footage, number of bedrooms and bathrooms,...). This is just one of the many places where regression can be applied. Other applications range from predicting health outcomes in medicine, stock prices in finance, and power usage in high-performance computing, to analyzing which regulators are important for gene expression.

In this course, we explored regularized linear regression models for the task of prediction and feature selection. We handled very large sets of features and select between models of various complexity. We also analyzed the impact of aspects of data -- such as outliers -- on your selected models and predictions. To fit these models, I implemented optimization algorithms that scale to large datasets.

Work done

Compare and contrast bias and variance when modeling data.
Estimate model parameters using optimization algorithms.
Tune parameters with cross validation.
Analyze the performance of the model.
Describe the notion of sparsity and how LASSO leads to sparse solutions.
Deploy methods to select between models.
Exploit the model to form predictions.
Build a regression model to predict prices using a housing dataset.
Implement these techniques in Python.

Classification:

Case Studies: Analyzing Sentiment & Loan Default Prediction

Data set: Amazon review database consisting 183531 entries Lending club data sub-set consisting of 122607 entries

Certificate: https://www.coursera.org/account/accomplishments/records/CYV6DNDRPZUG

In this case study on analyzing sentiment,I created models that predicted a class (positive/negative sentiment) from input features (text of the reviews, user profile information,...). In our second case study for this course, loan default prediction, we tackled financial data, and predicted when a loan is likely to be risky or safe for the bank. These tasks are an examples of classification, one of the most widely used areas of machine learning, with a broad array of applications, including ad targeting, spam detection, medical diagnosis and image classification.

Work done

Tackle both binary and multiclass classification problems.
Implement a logistic regression model for large-scale classification.
Create a non-linear model using decision trees.
Improve the performance of any model using boosting.
Scale your methods with stochastic gradient ascent.
Describe the underlying decision boundaries.
Build a classification model to predict sentiment in a product review dataset.
Analyze financial data to predict loan defaults.
Use techniques for handling missing data.
Evaluate your models using precision-recall metrics.
Implemented these techniques in Python

Clustering:

Case study: Finding Similar Documents

Data set: Wikipedia articles subset

Certificate: https://www.coursera.org/account/accomplishments/records/2TVNUDMQHEZT

In this third case study, finding similar documents, we examined similarity-based algorithms for retrieval. In this course, we also examined structured representations for describing the documents in the corpus, including clustering and mixed membership models, such as latent Dirichlet allocation (LDA). Implemented expectation maximization (EM) to learn the document clusterings, and see how to scale the methods using MapReduce.

Work done

Create a document retrieval system using k-nearest neighbors.
Identify various similarity metrics for text data.
Reduce computations in k-nearest neighbor search by using KD-trees.
Produce approximate nearest neighbors using locality sensitive hashing.
Cluster documents by topic using k-means.
Examine probabilistic clustering approaches using mixtures models.
Fit a mixture of Gaussian model using expectation maximization (EM).
Perform mixed membership modeling using latent Dirichlet allocation (LDA).
Describe the steps of a Gibbs sampler and how to use its output to draw inferences.
Compare and contrast initialization techniques for non-convex optimization

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
Classification		Classification
Clustering		Clustering
Machine Learning Foundations: A Case Study Approach		Machine Learning Foundations: A Case Study Approach
Regression		Regression
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine-Learning-Specification-University-of-Washington-

Requirements:

Machine learning: A case study approach:

Certificate: https://www.coursera.org/account/accomplishments/records/EKVNF5DESGQQ

Regression:

Certificate: https://www.coursera.org/account/accomplishments/records/XWHBSSUQQBS4

Work done

Classification:

Certificate: https://www.coursera.org/account/accomplishments/records/CYV6DNDRPZUG

Work done

Clustering:

Certificate: https://www.coursera.org/account/accomplishments/records/2TVNUDMQHEZT

Work done

About

Releases

Packages

Languages

aanandshekharroy/Machine-learning

Folders and files

Latest commit

History

Repository files navigation

Machine-Learning-Specification-University-of-Washington-

Requirements:

Machine learning: A case study approach:

Certificate: https://www.coursera.org/account/accomplishments/records/EKVNF5DESGQQ

Regression:

Certificate: https://www.coursera.org/account/accomplishments/records/XWHBSSUQQBS4

Work done

Classification:

Certificate: https://www.coursera.org/account/accomplishments/records/CYV6DNDRPZUG

Work done

Clustering:

Certificate: https://www.coursera.org/account/accomplishments/records/2TVNUDMQHEZT

Work done

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages