MOVIE EDITION

dbt™ Data
Modeling Challenge

Attention, analytics and data folks—it's your time to shine! Join Paradime and Lightdash in our hands-on data modeling challenge. Utilize real Movie and TV series data to craft SQL queries, develop dbt™ models, and derive insights, all for a chance to win a $1,500 Amazon gift card!

Introduction

The dbt™ Data Modeling Challenge - Movie Edition, brought to you by Lightdash and Paradime, is a hackathon-style competition designed for data practitioners worldwide. This exciting event provides a platform for analytics enthusiasts to demonstrate their skills in SQL, dbt™, and analytics. Participants will dive into historical movie and TV series data, extracting valuable insights, while competing for Amazon gift cards worth $500 to $1,500.

Join us from anywhere in the world and unleash your creativity! Submissions are open until May 26th. After that, a panel of judges will select the prize winners. Intrigued? Get the full scope below on how to enter, and get ready to showcase your talent.

May the data be ever in your favor.

🍿 Join the challenge 🔥

What’s in it for you

Time to put your game face on and demonstrate your SQL, dbt™, and data analytics expertise for a chance to win up to a $1,500 Amazon gift card! Lightdash and Paradime will honor the top three contestants as follows:

$1,500

Amazon gift card

$1,000

Amazon gift card

$500

Amazon gift card

🍿 Join the challenge 🔥

Exciting winning movie scene Polaroid collage represents the prize for data modeling challenge winners

About the challenge

Ready, Set, Analyze

April 17, 2024

Submission deadline

May 26, 2024

Winners announced

May 29, 2024

Judges

Christopher Hughes

Data Analytics Consultant, Hughes Analytics

Oliver Laslett

Co-founder & CTO, Lightdash

Jessica Cherny

Senior Data Analyst, Fivetran

Julie Beynon

Head of Data, Census

Rob Tucker

Founder & CPO, Fabric®

Timo Dechau

Co-Founder Deepskydata & NO SLIDES

How it works

When your registration is approved, you'll get access to:

Paradime for SQL & dbt™ development.
Snowflake Data Cloud for computing and storage, pre-loaded with seven historical Movie and TV Series datasets.
Lightdash for dashboards and visualization
GitHub repository with pre-configured models to get you started.

‍Your mission:
‍Craft insightful analyses and visualizations using SQL and dbt™.

While the tools and datasets above are required, incorporating additional tools, techniques, and data to amplify your insights is encouraged. Go on, get creative!

To better understand how this competition works, check out the challenge's GitHub repository.

Challenge details

Entry requirements

Participants must be current or former data professionals (Data Analysts, Analytics Engineers, Data Engineers, Data Scientists, etc.). Therefore, at this time, we can't accept any students.
Solo participation only.
Must have hands-on experience with SQL, dbt™, and Git.
Participants must use, but are not limited to, the following tools:
- Paradime for SQL & dbt™ development.
- Snowflake Data Cloud for computing and storage.
- Lightdash for Data Visualizations and Insights
- GitHub repository with pre-configured models to get you started!
Must be able to explain their code and insights comprehensive. You can use ChatGPT, but you better understand it! 🤣

Challenge deliverables

Participants are expected to submit:

A GitHub repository containing your dbt™ models (Example)
A README.md that briefly narrates your project's story — what you've built and your methodology (Example)
Data visualizations and associated analyses, incorporated into your README.md or presented through alternative formats

Judging criteria

Judges will score each submission based on:

Value of findings (1-10):
Are the results relevant to Movie fanatics?
- Get creative! Uncover something fun and accurate that you'd find interesting if you saw it on social media, for example.
Complexity of findings (1-10):
Are you creating relationships between datasets and providing in-depth analytical conclusions?
- Complexity ≠ value, but you should use multiple datasets to generate valuable conclusions.
Quality of materials (1-10):
Is your code of professional quality? Are your data visualizations well-designed? Are your insights' conclusions clear to the reader?
- Your SQL, dbt™, visualizations, and conclusions should be high quality. If it’s not something that you’d be comfortable sharing with your peers, it won’t be good enough for the judges.
Integration of new data (1-10):
How effectively have you integrated new, relevant data to enhance your project?
- Incorporating additional datasets has the potential to score you higher in other categories: value of findings, complexity of findings, and quality of materials.

How can I obtain technical support from Paradime?

Access our Slack community for 24/7 assistance. For specific tool setup issues (connecting to Paradime, Snowflake, or GitHub), Parker Rogers from Paradime is available for direct support via Zoom.

Is seeking external assistance permitted?

Definitely. While self-reliance is crucial, utilizing your network, online resources, and even ChatGPT to enhance your project is recommended, as long as you fully understand the concepts and their implementation.

Can I utilize additional tools beyond the required ones?

Yes, you are free to use any supplementary technologies or methods that enhance your workflow, in addition to the mandatory Paradime, Snowflake, Lightdash, and GitHub tools.

Do participants begin from scratch?

Not at all! Paradime provides the following resources:

The Paradime platform for SQL & dbt™ development.
The Snowflake Data Cloud for compute and storage.
Lightdash for Data Visualizations and Insights
A pre-configured GitHub repository to jumpstart your project (example data models included!)
Three comprehensive Movie and TV Series datasets to get you started:
- tmdb_movies - data on 900k movies, including comprehensive statistics and related information.
- omdb_movies - data on 500k movies, including comprehensive statistics and related information.
- tmdb_tv_series - data on 150k tv series, including comprehensive statistics and related information.

Am I allowed to incorporate additional data sources?

Incorporating extra data sources is required, additionally, you're welcome to change the provided data sources, as they are not 100% accurate.

Is this event a hackathon?

No, this is an asynchronous competition without scheduled kick-off meetings, working sessions, or prize ceremonies. Participants work at their own pace.

If I win, how will I receive a gift card?

Winners will receive an amazon e-gift card via email.

What are some key strategies for developing insights from movie and TV data sets?

Aim to generate insights appealing to movie and TV series fans using the three data sets provided and/or 3rd party data.

See Github repo for additional details:
- Generating insights
- Example submissions

Time is running out!
Join the challenge now

Thank you! We have received your submission and will be in touch shortly with next steps!

Looks like something went wrong while submitting the form. Can you try again?

Features

Resources

About us

dbt™ Data
Modeling Challenge

Introduction

What’s in it for you