Automate Smarter. Scale Faster.

April 22, 2022

How to Setup PySpark in a Jupyter Notebook in 2 Steps on a Mac

Read More
March 8, 2022

How to Dynamically Convert Pandas Object Columns to a String Data Type

Since Pandas allows you to have mixed data type columns, converting them to a string data type can be essential when exporting the data.
Read More
March 8, 2022

How to Return Pandas DataFrame dtypes as a Dictionary

Using a dictionary to set the data types for a Pandas DataFrame gives you greater control over the schema.
Read More
February 28, 2022

How to Compare Two Pandas DataFrame Columns

Comparing columns in a DataFrame is essential when trying to concatenate two Pandas DataFrames with a lot of columns.
Read More
February 28, 2022

How to Pull Jira using JQL with Python and Pandas

In this post, I’ll walk you through pulling Jira issues from the API using JQL with Python Pandas.
Read More
January 7, 2022

How to Install Spark on Google Colab

Installing Spark on your local machine can be a pain. In this post, I’ll show you how to install Spark on Google Colab so that you can easily get going with PySpark.
Read More
December 30, 2021

How to Easily Scrape data with Python from a Website that Requires a Login

Read More
December 17, 2021

How to Plot Geo-Spatial Data in Google Data Studio using Latitude and Longitude

Google Data Studio makes it so easy to take address data and convert it into an interactive map.
Read More
November 23, 2021

How to Return the Complete Address Information from a Place Name or Partial Address in Python

Leverage Google Maps API or Nominatim in Python to return complete address information that you can use for geo charts.
Read More