Data Analytics Simplified
Welcome to Data Analytics Simplified, a blog dedicated to helping you streamline data workflows, automate processes, and scale your infrastructure—without the headaches. Whether you’re battling messy spreadsheets, inefficient pipelines, or trying to get the most out of your data analytics investments, you’re in the right place.
I’ll share proven strategies, tips, and frameworks from my experience in data engineering and analytics, focusing on:
Data doesn’t have to be overwhelming. With the right approach, you can declutter, optimize, and build a solid foundation for data science and analytics.
Let’s get to work.
The matplotlib.ticker library provides an easy way to convert the y-axis number formatting in Matplotlib. Here’s how to turn a decimal y-axis and display them as a percentage.
The SQLite VACUUM command is essential to run after modifying your SQLite database, especially after dropping data.
Leveraging the GROUP_CONCAT function in MySQL allows you to concatenate query results into one row which you can then pass through as a user-defined variable in MySQL.
If you have leading 0’s in your cell, Excel and Google Sheets will remove them. However, those may be important. Here is how to keep them.
INDEX and MATCH function are close cousins to the VLOOKUP function and their combination creates a more flexible and better lookup function.
Cells showing up with a bunch of #’s is typically the result of cell content that is too large to display. Let’s check out a few solutions to fix this.
Using the SUBSTRING_INDEX function in MySQL makes it really easy to split your string data based on a specified delimiter. This comes in handy when your string data is not uniform.
Data stored as strings can be problematic if you are exporting raw data from MySQL and using Excel or another application to analyze the data.
Here’s the query you can run in SQLite to return all the columns from a specified table in SQLite.