Category: Python
-
Why Exploratory Data Analysis (EDA) is So Hard and So Manual
Exploratory Data Analysis (EDA) is crucial for gaining a solid understanding of your data and uncovering potential insights. However, this process is typically manual and involves a number of routine functions. Despite numerous technological advancements, EDA still requires significant manual effort, technical skills, and substantial computational power. In this post, we will explore why EDA…
-
Streamlining Data Analysis with Dynamic Date Ranges in BigQuery
Effective data analysis hinges on having complete data sets. Commonly, grouping data by days or months can result in significant gaps due to missing data points. In this post, I’ll guide you through a more efficient strategy: dynamically creating date ranges in BigQuery. This approach allows for on-the-fly date range generation without the overhead of…
-
Enhancing Data Accuracy: How to Fill Missing Date Gaps in Analysis with Python
Data gaps can occur when data is organized into time intervals but observations are missing for certain intervals. For example, let’s say you are tracking sales of snow shovels by month. Snow shovels are typically only in demand during winter months, so it is likely that there will be months with no sales at all.…
-
Exporting Database Tables to Parquet Files Using Python and Pandas
Managing MySQL databases can often be costly and time-consuming. If you’re working with databases containing static data, an effective alternative is to convert your database tables into individual Parquet files. By storing these files and leveraging Python for direct querying, you’ll maintain your existing querying capabilities and benefit from improved query performance, cost reduction, and…
-
How to Quickly and Easily Translate Code to Different Languages with ChatGPT
A Data Engineer is commonly working across multiple data applications that require knowledge of SQL, Python, and Excel, to name a few languages. However, switching between these languages can be time-consuming, especially when it comes to translating complex Excel formulas to SQL statements, for example. I have been really impressed with how well ChatGPT can…
-
Deploy Your Next Flask App Instantly and for Free Using Replit
Replit is a free tool that makes it easy to write Flask code and deploy it instantly. They handle of all the underlying infrastructure, allowing you to focus on building and refining your app without worrying about setup and maintenance.
-
A Short Practical Guide to Window Functions in SQL
A window function allows you to concisely compare rows in a single table.
-
How to Convert a Seconds Column to Minutes and Hours in Pandas
In this post, I’ll walk through how to convert a Pandas column that is in seconds and convert it to a datetime or a formatted string.
-
Things to do – Flask app
This is a little Flask web app I made to get recommendations for things to do when traveling.
-
How to Dynamically Generate a Date Dimension Table using Python Pandas
The Pandas package in Python allows you to generate a list of dates dynamically and then extract their attributes with various datetime functions.