Skip to main content

Python Machine Learning Notebooks (Tutorial style)

Python Machine Learning Notebooks (Tutorial style)

Dr. Tirthajyoti Sarkar, Sunnyvale, CA (You can connect with me on LinkedIn here)

Essential codes/demo IPython notebooks for jump-starting machine learning/data science.
You can start with this article that I wrote in Heartbeat magazine (on Medium platform):

"Some Essential Hacks and Tricks for Machine Learning with Python"

Essential tutorial-type notebooks on Pandas and Numpy

Jupyter notebooks covering a wide range of functions and operations on the topics of NumPy, Pandans, Seaborn, matplotlib etc.

Tutorial-type notebooks covering regression, classification, clustering, dimensionality reduction, and some basic neural network algorithms

Regression

  • Simple linear regression with t-statistic generation
  • Polynomial regression with how to use scikit-learn pipeline feature (check the article I wrote on Towards Data Science)
  • Decision trees and Random Forest regression (showing how the Random Forest works as a robust/regularized meta-estimator rejecting overfitting)

Classification

  • Logistic regression/classification
  • Naive Bayes classification

Clustering

  • K-means clustering
  • Affinity propagation (showing its time complexity and the effect of damping factor)
  • Mean-shift technique (showing its time complexity and the effect of noise on cluster discovery)
  • DBSCAN (showing how it can generically detect areas of high density irrespective of cluster shapes, which the k-means fails to do)
  • Hierarchical clustering with Dendograms showing how to choose optimal number of clusters

Dimensionality reduction

  • Principal component analysis

Deep Learning/Neural Network


Random data generation using symbolic expressions

  • How to use Sympy package to generate random datasets using symbolic mathematical expressions.

Comments

Popular posts from this blog

R tutorials for Data Science, NLP and Machine Learning

R Data Science Tutorials This repo contains a curated list of R tutorials and packages for Data Science, NLP and Machine Learning. This also serves as a reference guide for several common data analysis tasks. Curated list of Python tutorials for Data Science, NLP and Machine Learning . Comprehensive topic-wise list of Machine Learning and Deep Learning tutorials, codes, articles and other resources . Learning R Online Courses tryR on Codeschool Introduction to R for Data Science - Microsoft | edX Introduction to R on DataCamp Data Analysis with R Free resources for learning R R for Data Science - Hadley Wickham Advanced R - Hadley Wickham swirl: Learn R, in R Data Analysis and Visualization Using R MANY R PROGRAMMING TUTORIALS A Handbook of Statistical Analyses Using R , Find Other Chapters Cookbook for R Learning R in 7 simple steps More Resources Awesome-R Repository on GitHub R Reference Card: Cheatsheet R bloggers: blog aggregator R Resources...

AutoML-Papers

Awesome-AutoML-Papers A curated list of automated machine learning papers, articles, tutorials, slides and projects. Introduction to AutoML Machine learning (ML) has achieved considerable successes in recent years and an ever-growing number of disciplines rely on it. However, this success crucially relies on human machine learning experts to perform the following tasks: Preprocess the data Select appropriate features Select an appropriate model family Optimize model hyperparameters Postprocess machine learning models Critically analyze the results obtained. As the complexity of these tasks is often beyond non-ML-experts, the rapid growth of machine learning applications has created a demand for off-the-shelf machine learning methods that can be used easily and without expert knowledge. We call the resulting research area that targets progressive automation of machine learning  AutoML . AutoML draws on many disciplines of machine learning, prominently including ...