Data Analytics Simplified
Welcome to Data Analytics Simplified, a blog dedicated to helping you streamline data workflows, automate processes, and scale your infrastructure—without the headaches. Whether you’re battling messy spreadsheets, inefficient pipelines, or trying to get the most out of your data analytics investments, you’re in the right place.
I’ll share proven strategies, tips, and frameworks from my experience in data engineering and analytics, focusing on:
Data doesn’t have to be overwhelming. With the right approach, you can declutter, optimize, and build a solid foundation for data science and analytics.
Let’s get to work.
In this post, I’ll show you how to return the common list items between two lists in Python.
Using the glob library, we can quickly and dynamically pull a list of files from any directory and loop through them.
You’ve just finished a stellar analysis in a Jupyter notebook and you are ready to share your work with colleagues. However, they’ve never heard of a Jupyter notebook. In this post, I’ll walk you through creating an HTML export that leverages Bootstrap to create a styled exportable report from a Jupyter notebook.
Most of the time, yes. In this post, I’ll explain why there is usually a better chart to choose from than a pie chart and visualization techniques for categorical data.
An analysis of the shows and movies in Netflix as of 2019 using a dataset from Kaggle.
Let’s say you have a column of data that is a string list separated by commas. In this post, I’ll walk you through two ways to break up the data and count the frequency of the list items.
Visualizing data that is highly skewed is tricky because most points get overshadowed. To fix that, try using a log scale.
If you have a list of string items and you want to perform a custom sort order (ie not alphabetical), you can use the key parameter in the list.sort() method to define the order.
An analysis of the 2020 presidential votes by county and voter turnout by state using datasets from Kaggle.