3 Free Courses that Helped Me Land My First Data Scientist Job in Amazon

1. Andrew Ng’s Machine Learning Course

I know what you are thinking. Everyone knows this one! With around 2.4 million views as of October 2022, I am sure most of us know this one but it is still one of the best courses for anyone to start learning about Machine Learning and includes practice quizzes to test one’s learning.

1.1. Syllabus

Andrew Ng, who really does not need an introduction, focuses on what matters most for those getting started with Machine Learning and breaks down the course into the following:

  1. Regression with Multiple Input Variables
  2. Classification

1.2. Link to the Course

Free enrollment is available on Coursera.

2. John Paisley’s Machine Learning Course

John Paisley is an Assistant Professor in the Department of Electrical Engineering at Columbia University. He is also an affiliated member of the Data Science Institute at Columbia.

2.1. Syllabus

  • Maximum likelihood estimation, linear regression, least squares
  • Ridge regression, bias-variance, Bayes rule, maximum a posteriori inference
  • Bayesian linear regression, sparsity, subset selection for linear regression
  • Nearest neighbor classification, Bayes classifiers, linear classifiers, perceptron
  • Logistic regression, Laplace approximation, kernel methods, Gaussian processes
  • Maximum margin, support vector machines, trees, random forests, boosting
  • Clustering, k-means, Expectation Maximization (EM) algorithm, missing data
  • Mixtures of Gaussians, matrix factorization
  • Non-negative matrix factorization, latent factor models, Principal Component Analysis (PCA) and variations
  • Markov models, hidden Markov models
  • Continuous state-space models, association analysis
  • Model selection

2.2. Link to the Course

Course materials are available for free on edX.

3. Andreas Muller’s Applied Machine Learning

Andreas Muller is an Associate Research Scientist in Data Science Institute at Columbia.

3.1. Syllabus

  • Matplotlib and Visualization
  • Supervised learning
  • Preprocessing
  • Linear models for regression
  • Linear models for classification
  • Trees, forests and ensembles
  • Gradient descent and gradient boosting
  • Model evaluation
  • Calibration and imbalanced data
  • Parameter tuning and automatic machine learning
  • Dimensionality reduction
  • Clustering and mixture models
  • Working with text data
  • Topic models for text data
  • Word and document embeddings
  • Neural networks
  • Keras and convolutional neural networks
  • Time series

3.2. Link to the Course

Pro Tip: Make sure to check out his YouTube videos, as well as the course materials, which are both available on the course website.

Leave a Reply