Guide to Fuzzy Matching with Python

5 years ago

This post is going to delve into the textdistance package in Python, which provides a large collection of algorithms to…

How to read Word documents with Python

5 years ago

This post will talk about how to read Word Documents with Python. We're going to cover three different packages -…

Python, Basket Analysis, and Pymining

5 years ago

Background Python's pymining package provides a collection of useful algorithms for item set mining, association mining, and more. We'll explore…

How to get an AUC confidence interval

5 years ago

Background AUC is an important metric in machine learning for classification. It is often used as a measure of a…

Really large numbers in R

5 years ago

This post will discuss ways of handling huge numbers in R using the gmp package. The gmp package The gmp…

BeautifulSoup vs. Rvest

5 years ago

This post will compare Python's BeautifulSoup package to R's rvest package for web scraping. We'll also talk about additional functionality…

Testing the Collatz Conjecture with R

5 years ago

Background The Collatz Conjecture is a famous unsolved problem in number theory. If you're not familiar with it - the…

How to hide a password in R with the keyring package

5 years ago

This post will introduce using the keyring package to hide a password. Short background The keyring package is a library…

Web Browsing and Parsing with RoboBrowser and requests_html

5 years ago

Background So you've learned all about BeautifulSoup. What's next? Python is a great language for automating web operations. In a…

Does “Sell in May, Go Away” really work?

5 years ago

If you follow the stock market, you've probably heard the expression "Sell in May, Go Away." This expression generally refers…