Up to this point in the thesis series, everything has been about writing โ choosing a topic, drafting an outline, running the literature review. This post is a detour into tooling. I'd been using Claude through the web chat for a while, and I wanted to try Claude Code โ the terminal-based version โ alongside... Continue Reading →
The Literature Review Workflow (Part 2)
In the last post I talked about choosing a topic and drafting an outline. This one is about what I actually did once the outline existed: the literature review. Not the idea of a literature review. The tools I opened, the order I used them in, and how I went from a blank Introduction section... Continue Reading →
The First Steps of a Master’s Thesis (Part 1)
Iโm in the final months of my masterโs, and if I donโt finish the thesis, Iโm out. So this is me trying to get it done, and documenting the process as I go. My masterโs program is in Data Science and Artificial Intelligence, and the first question was the one that sounds easiest but isnโt:... Continue Reading →
How to Submit to a Kaggle Competition
Getting your predictions onto the leaderboard, step by step You have built a model, generated predictions, and saved them to a CSV file. Now you need to get that file onto Kaggle and see how it scores. The process is not complicated, but it has a few specific steps that are not immediately obvious the... Continue Reading →
Time Series with Machine Learning (Part 9)
Part 9: The Final Model and Submission Everything up to this point has been preparation. The feature engineering produced 142 inputs. The validation run found the optimal number of boosting iterations and confirmed a validation SMAPE around 13.6%. Feature importance analysis identified which of those 142 features actually contributed. Now all of that feeds into... Continue Reading →
Time Series with Machine Learningย (Part 8)
Part 8: Feature Importance In this section, we will explain the concept of decision tree feature importance and why it matters for understanding your model. The model has been trained. The next natural question is: out of the 142 features that went in, which ones actually drove the predictions? Tree-based models like LightGBM give a... Continue Reading →
Time Series with Machine Learning (Part 7)
Part 7: Training LightGBM; Gradient Boosting, Hyperparameters, and Early Stopping The feature set is ready. The validation split is in place. The next step is fitting the model, and understanding what is actually happening during that fit. LightGBM is a gradient boosted tree framework, which means the training process is iterative: it builds one tree... Continue Reading →
Part 4: What Happens After Someone Clicks (Google Analytics Explained)
In the previous part, I focused on how Google sees my blog. Now I want to answer a different question: What actually happens after someone clicks? Quick note: Google Analytics 4 (GA4) is the latest version of Google Analytics. It focuses on user behavior after someone lands on your site; what they do, how long... Continue Reading →
Roller Skating (Day 12)
Toe Stop (Part 2) I went back to toe stops today, but this time with a slightly different perspective. In the beginning, I treated them as a simple stopping tool. Just drag and slow down. But the more I watched and tried, the more I realized that toe stops are not really about โjust dragging.โ... Continue Reading →
Part 3: How Google Sees Your Blog (Search Console Explained)
After setting everything up, I started seeing metrics. But I didnโt really understand them. Search Console gives you a lot of data, but without context, itโs hard to interpret. So I tried to answer a simple question: What do these numbers actually mean? What Search Console really shows Search Console doesnโt tell you who visited... Continue Reading →
