• Stars
    star
    301
  • Rank 137,646 (Top 3 %)
  • Language
  • Created over 4 years ago
  • Updated over 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Resources I used for ML Engineer, Applied Scientist and Quant Researcher interviews.

ML-Interview

This is a list of resources I found useful during my preparation for interviews. Broadly speaking, I interviewd for three different profiles: Machine Learning Engineer, Applied Scientist and Quantitative Researcher.

NOTE: All these profiles usually include multiple "traditional" programming/algorithm rounds, and for that, I relied upon mild leetcoding spree, spread over a period of 3 months.

Classical Machine Learning

  • A very good (slighly advanced) course on Machine Learning by Alex Smola. Link
  • Perhaps everything that you'll ever need to know for the interview sake. Link
  • Generative vs Discriminative Classifiers (you should know the difference, and tradeoffs when choosing one over the other) Link
  • Gradient Boosted Trees Link
  • Gentle Introduction to Gradient Boosting Link
  • ROC and AUC (I like this video) Link
  • Clustering (from Ryan Tibshirani's Data Mining course, other slides are really good as well) Link 1 Link 2 Link 3
  • Good old Linear Regression. Link
  • L0, L1 and L2 regularization (Subset Selection, Lasso and Ridge Regression), a comparison. The Elements of Statistical Learning, Trevor Hastie, Robert Tibshirani, Jerome Friedman, 2nd Edition, Section 3.4.3 Link

Deep Learning

  • Why tanh for Recurrent Networks Link
  • Receptive Fields in CNNs Link
  • For everything Convolution Link
  • For eveything Gradient Descent Link
  • Adaptive Learning rates in SGD Link
  • Backpropagation in Python, Andrej Karpathy Link

Probability and Statistics

  • As the title would say, "Generalized Linear Models, abridged".Link
  • A good course to cover Statistics Link
  • Basic Statistics: Introduction to Mathematical Statistics, Hogg, McKean and Craig, Chapters 1-4. Link
  • Introduction to Hypothesis Testing: Introduction to Mathematical Statistics, Hogg, McKean and Craig, Section 4.5-4.6 Link
  • Examples of Uncorrelated vs Independent Random Variable Link
  • Discrete time Markov Chains,Poisson Processes, Renewal Theory Adventures in Stochastic Processes, 2nd Edition, Sidney Resnick Link TODO: Add a link to more succint notes.
  • Q-Q Plots Link

Large Scale Machine Learning

Assorted Mathematics

  • Some facts about Symmetric Matrices. Link
  • Bare minimum SVD by Gilbert Strang. Link
  • SVD and PCA in real-life. Link
  • If you are not sure how SVD and PCA are related. Link
  • If you want to brush up on Chain Rule (or if you are like me and get confused between gradient and derivative notation) Link. Wikipedia has some useful information as well.
  • Collection of Quantitative Interview problems by Pete Benson, University of Michigan. Link
  • Cholesky Factorization Link
  • QR Factorization Link

System Design (for ML)

  • Structure for Google Index Link
  • Recommender Systems, Xavier Amatriain Link 1 Link 2
  • News Feed Ranking @ Facebook (Lars Backstrom) Link

Uncategorized

  • Sobel Operator Link
  • You have a fair Die, and you can choose to roll it up to 3 times. Whenever you decide to stop, the number thatโ€™s facing up is your score. What strategy would you choose to maximize your score? What is the expected score with your strategy? If you are given more than 3 chances, can you improve your score?