Bragadeesh’s Substack

Bragadeesh’s Substack

The VAR of Data Science: Ensuring Accuracy in Predictive Models

Bragadeesh's avatar
Bragadeesh
Jan 05, 2024
∙ Paid

In the electrifying arena of football, where every goal can sway the fate of teams, the Video Assistant Referee (VAR) plays a pivotal role in ensuring that each goal scored is legitimate, and every offside is accurately called. Now, let’s pivot slightly and wander into the realm of data science, where models predict outcomes and, metaphorically speaking, “score goals” in decision-making processes. How do we ensure that these predictions, these digital goals, are legitimate and accurate? Enter the VAR of data science: validation mechanisms.

The Kickoff: Building a Predictive Model

Imagine crafting a predictive model as assembling a football team. Each player (variable) is selected for their skills and potential contribution to scoring goals (accurate predictions). But having a team doesn’t guarantee victory; it’s the strategy (algorithm) and teamwork (data interactions) that will navigate the ball towards the goal post (desired outcome).

The Referee’s Dilemma: Ensuring Fair Play

In football, referees make real-time decisions, often under immense pressure. Similarly, predictive models make decisions, determining outcomes based on the data and algorithm at play. But how do we ensure that these decisions are accurate and fair? How do we validate that the model isn’t “cheating” by overfitting to the training data, or making biased predictions?

Keep reading with a 7-day free trial

Subscribe to Bragadeesh’s Substack to keep reading this post and get 7 days of free access to the full post archives.

Already a paid subscriber? Sign in
© 2025 Bragadeesh
Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture