Data Analytics Simplified
Welcome to Data Analytics Simplified, a blog dedicated to helping you streamline data workflows, automate processes, and scale your infrastructure—without the headaches. Whether you’re battling messy spreadsheets, inefficient pipelines, or trying to get the most out of your data analytics investments, you’re in the right place.
I’ll share proven strategies, tips, and frameworks from my experience in data engineering and analytics, focusing on:
Data doesn’t have to be overwhelming. With the right approach, you can declutter, optimize, and build a solid foundation for data science and analytics.
Let’s get to work.
Plotting data that is organized into pivot table has a slightly different syntax than plotting a columns in a dataframe.
The Federal Bank of St. Louis (FRED) has one of the largest free databases of economic data. It’s an excellent and trusted data source to use for financial analysis. The Pandas Datareader package makes it so easy to start analyzing the data with a few lines of code in Python.
A weighted average calculation takes into account some numbers should influence the average more than others. I typically find myself needing this calculation when working with data that is already summarized in a table.
Save yourself time by using this script to automatically pull data from a CSV file you receive as a Gmail attachment into a Google Sheet. I find this script especially useful when working with CSV files I receive on a recurring basis.
Easily combine multiple CSV files with the Terminal on Mac.
Make the Google Sheets QUERY function even more flexible when you add variable parameters to your query.
By default, the FILTER function in Google Sheets is set up to take multiple criteria using AND logic. With a slight modification, you can make it use OR logic.
Let’s say you have a column with someone’s full name stored in the format of “LastName, FirstName” and you would rather have two columns – one with the first name and one with the last name. The below code to split out the names in your results window.
SQLite doesn’t have a date storage class which makes working with dates a bit tricky. You’ll have to store the data as ‘Text’. You can read more about that here.