data science

Software Engineering for Data Scientists (New book!)

Very excited to announce the early-access preview (MEAP) of my upcoming book, Software Engineering for Data Scientists is available now!…

1 year ago

What to study if you’re under quarantine

If you're staying indoors more often recently because of the current COVID-19 outbreak and looking for new things to study,…

4 years ago

How to get an AUC confidence interval

Background AUC is an important metric in machine learning for classification. It is often used as a measure of a…

5 years ago

How to build a logistic regression model from scratch in R

Background In a previous post, we showed how using vectorization in R can vastly speed up fuzzy matching. Here, we…

6 years ago

ICA on Images with Python

Click here to see my recommended reading list. What is Independent Component Analysis (ICA)? If you're already familiar with ICA,…

6 years ago