Andrew Treadway

Guide to Fuzzy Matching with Python

This post is going to delve into the textdistance package in Python, which provides a large collection of algorithms to…

5 years ago

How to read Word documents with Python

This post will talk about how to read Word Documents with Python. We're going to cover three different packages -…

5 years ago

Python, Basket Analysis, and Pymining

Background Python's pymining package provides a collection of useful algorithms for item set mining, association mining, and more. We'll explore…

5 years ago

How to get an AUC confidence interval

Background AUC is an important metric in machine learning for classification. It is often used as a measure of a…

5 years ago

Really large numbers in R

This post will discuss ways of handling huge numbers in R using the gmp package. The gmp package The gmp…

5 years ago

BeautifulSoup vs. Rvest

This post will compare Python's BeautifulSoup package to R's rvest package for web scraping. We'll also talk about additional functionality…

5 years ago

Testing the Collatz Conjecture with R

Background The Collatz Conjecture is a famous unsolved problem in number theory. If you're not familiar with it - the…

5 years ago

How to hide a password in R with the keyring package

This post will introduce using the keyring package to hide a password. Short background The keyring package is a library…

6 years ago

Web Browsing and Parsing with RoboBrowser and requests_html

Background So you've learned all about BeautifulSoup. What's next? Python is a great language for automating web operations. In a…

6 years ago

Does “Sell in May, Go Away” really work?

If you follow the stock market, you've probably heard the expression "Sell in May, Go Away." This expression generally refers…

6 years ago