How to Submit to a Kaggle Competition

Getting your predictions onto the leaderboard, step by step You have built a model, generated predictions, and saved them to a CSV file. Now you need to get that file onto Kaggle and see how it scores. The process is not complicated, but it has a few specific steps that are not immediately obvious the... Continue Reading →

Time Series with Machine Learning (Part 9)

Part 9: The Final Model and Submission Everything up to this point has been preparation. The feature engineering produced 142 inputs. The validation run found the optimal number of boosting iterations and confirmed a validation SMAPE around 13.6%. Feature importance analysis identified which of those 142 features actually contributed. Now all of that feeds into... Continue Reading →

Time Series with Machine Learningย (Part 8)

Part 8: Feature Importance In this section, we will explain the concept of decision tree feature importance and why it matters for understanding your model. The model has been trained. The next natural question is: out of the 142 features that went in, which ones actually drove the predictions? Tree-based models like LightGBM give a... Continue Reading →

Time Series with Machine Learning (Part 7)

Part 7: Training LightGBM; Gradient Boosting, Hyperparameters, and Early Stopping The feature set is ready. The validation split is in place. The next step is fitting the model, and understanding what is actually happening during that fit. LightGBM is a gradient boosted tree framework, which means the training process is iterative: it builds one tree... Continue Reading →

Roller Skating (Day 12)

Toe Stop (Part 2) I went back to toe stops today, but this time with a slightly different perspective. In the beginning, I treated them as a simple stopping tool. Just drag and slow down. But the more I watched and tried, the more I realized that toe stops are not really about โ€œjust dragging.โ€... Continue Reading →

Create a website or blog at WordPress.com

Up ↑