Discover Spence's insights, data modeling best practices, and his experiences in Paradime's 'NBA Data Modeling Challenge.'
Welcome to the "NBA Challenge Rewind" series 🙌
This blog series will showcase the “best of” submissions from Paradime’s NBA Data Modeling Challenges, highlighting the remarkable data professionals behind them.
If you’re unfamiliar with the NBA Data Modeling Challenge, enrich your series experience by exploring these essential resources: the challenge introduction video and the winner’s announcement blog. They offer valuable background information to help you fully appreciate the insights shared in this series.
In each "NBA Challenge Rewind" blog, you’ll discover:
Now, let's dive into our next installment, featuring our first-place winner, Spence Perry, and his standout submission!
Hey! My name is Spence Perry (he/him). I am a Senior Data Scientist at Classy/GoFundMe. I live in the Washington, DC metro area, and I have been working in data since 2020.
Recently, I had the opportunity to compete in Paradime’s most recent modeling challenge. The month-long competition tasked data professionals with transforming raw NBA data into insights for fans and general managers. After four tough weeks of pushing code and developing insights, I’m excited to share that I managed to win first place!
Discovering the NBA Data Modeling Challenge on LinkedIn, a part of me that loves diving into new domains and trying out cool tools was instantly energized, thinking, "Just do it!" Basketball isn't just a sport I love to watch; it was also the focus of my very first data analysis class, which used basketball analytics as an introduction to the vast world of econometrics.
This challenge was an unparalleled chance to immerse myself in data, offering a playful yet insightful exploration of basketball. Along the way, it also became a journey of discovery into new data tools and best practices, enriching my understanding and skills.
To tackle this challenge, I used a set of familiar and new data tools:
Here's how I structured my project:
Before diving into SQL queries or Snowflake data exploration, I aimed to deepen my understanding of the vast world of NBA data and analytics. I immersed myself in books like Oliver Dean’s Basketball on Paper and Seth Partnow’s The Mid Range Theory. The depth of basketball analytics was mind-blowing; it's so rich that someone could potentially write a dissertation on a single player’s performance during just one quarter of play.
My thorough exploration led me to draft various questions that would be most interesting to NBA fans and general managers and serve as the backbone of my analysis, such as:
With a solid foundation in basketball analytics and my questions outlined, I turned to Snowflake to evaluate data quality. Here, I faced several hurdles, like missing game records and inconsistent player information. Realizing the existing datasets fell short of answering my inquiries, I incorporated additional data:
Armed with the requisite data and a clear roadmap, I proceeded to the final stages of my project—data modeling and visualization. This segment of the project entailed crafting insightful visualizations and models concurrently, culminating in an extensive analysis of basketball.
Now, let’s take a look at some of the insights I uncovered!
For a comprehensive exploration of my insights, visit the README.md in my GitHub repository. Here’s a summary:
My exploration aimed to detail the NBA salary cap's evolution—a subject with significant consequences for team success, player distribution, and satisfaction. This comprehensive analysis examines key elements of the salary cap, such as:
The surge of three-point shooting has been reshaping the game for over a decade. This analysis taps into field goal attempt data to uncover:
While there are 48 minutes of regulation play, there is something different that happens for fans, coaches, and players alike in the final moments of a game. My investigation aimed to unravel the dynamics of what I've termed "super crunch time" – the intense last 5 seconds of the 4th quarter and overtime. My analysis from 2000 to 2020 dives into:
Overall, participating in Paradime’s NBA Data Modeling Challenge was an exhilarating and insightful experience that diverged significantly from my usual day-to-day work. This month-long journey was filled with valuable lessons, some tied to the realm of data and others to life itself. Personally, it was a battle against self-doubt, a packed schedule, and the pursuit of perfection. This competition became my opportunity to "shoot my shot." Win or lose, I learned to take pride in seizing the chance and giving it my all.
For those interested in my contributions to the fundraising data sphere, check out my posts on Classy’s blog, where I share insights alongside a talented team of GoFundMe data experts.
Looking ahead, Paradime is gearing up for an exciting new challenge focusing on Movie data this April. Transitioning from the basketball court to the cinema, I’m eager to delve into what movie datasets can reveal. If data analysis sparks your interest and you’re a movie enthusiast, this upcoming challenge offers a unique opportunity to dive deep, learn, and potentially win prizes ranging from $500 to $1,500!