Python

Software Engineering for Data Scientists (New book!)

Very excited to announce the early-access preview (MEAP) of my upcoming book, Software Engineering for Data Scientists is available now! Check it out at this link. Use promo code au35tre to save 30% on this book and any products sold from Manning.

Why Software Engineering for Data Scientists?

Data science and software engineering have been merging more and more, especially over the last decade. Software Engineering for Data Scientists is my upcoming book that will help you learn more about software engineering and how it can make your life easier as a data scientist!

This book covers the following key topics:

  • Source control
  • How to implement exception handling and write robust code
  • Object-oriented programming for data scientists
  • How to monitor the progress of training machine learning models
  • Scaling your Python and data science code to handle millions (or billions) or rows. This includes making sure you are using standard tools like pandas to their max potential, as well as learning about Dask and PySpark
  • How to put Python into production, including deploying machine learning models and data pipelines
  • How to create Python packages
  • Creating automated reports and logging
  • How to create dashboards to monitor your machine learning model in production

…and much more! Additionally, almost every chapter in the book comes with a Practice on your own section that allows you to refine your skills on the topics taught in the book, and code samples that you can download from a Github repo.

Please check out the book, and make sure to share with your friends!

Andrew Treadway

Recent Posts

How to stop long-running code in Python

Ever had long-running code that you don't know when it's going to finish running? If…

2 years ago

Faster alternatives to pandas

Background If you've done any type of data analysis in Python, chances are you've probably…

3 years ago

Automated EDA with Python

In this post, we will investigate the pandas_profiling and sweetviz packages, which can be used…

3 years ago

How to plot XGBoost trees in R

In this post, we're going to cover how to plot XGBoost trees in R. XGBoost…

3 years ago

Python collections tutorial

In this post, we'll discuss the underrated Python collections package, which is part of the…

3 years ago

How to create PDF files with Python

In a previous article we talked about several ways to read PDF files with Python.…

3 years ago