Participants must be current or former data professionals (Data Analysts, Analytics Engineers, Data Engineers, Data Scientists, etc.). Therefore, at this time, we can't accept any students.
Solo participation only.
Must have hands-on experience with SQL, dbt™, and Git.
Participants must use, but are not limited to, the following tools:
Paradime for SQL & dbt™ development.
Snowflake Data Cloud for computing and storage.
Lightdash for Data Visualizations and Insights
GitHub repository with pre-configured models to get you started!
Must be able to explain their code and insights comprehensive. You can use ChatGPT, but you better understand it! 🤣
Participants are expected to submit:
Judges will score each submission based on:
Value of findings (1-10):
Are the results relevant to Movie fanatics?
Get creative! Uncover something fun and accurate that you'd find interesting if you saw it on social media, for example.
Complexity of findings (1-10):
Are you creating relationships between datasets and providing in-depth analytical conclusions?
Complexity ≠ value, but you should use multiple datasets to generate valuable conclusions.
Quality of materials (1-10):
Is your code of professional quality? Are your data visualizations well-designed? Are your insights' conclusions clear to the reader?
Your SQL, dbt™, visualizations, and conclusions should be high quality. If it’s not something that you’d be comfortable sharing with your peers, it won’t be good enough for the judges.
Integration of new data (1-10):
How effectively have you integrated new, relevant data to enhance your project?
Incorporating additional datasets has the potential to score you higher in other categories: value of findings, complexity of findings, and quality of materials.
Access our Slack community for 24/7 assistance. For specific tool setup issues (connecting to Paradime, Snowflake, or GitHub), Parker Rogers from Paradime is available for direct support via Zoom.
Definitely. While self-reliance is crucial, utilizing your network, online resources, and even ChatGPT to enhance your project is recommended, as long as you fully understand the concepts and their implementation.
Yes, you are free to use any supplementary technologies or methods that enhance your workflow, in addition to the mandatory Paradime, Snowflake, Lightdash, and GitHub tools.
Not at all! Paradime provides the following resources:
The Paradime platform for SQL & dbt™ development.
The Snowflake Data Cloud for compute and storage.
Lightdash for Data Visualizations and Insights
A pre-configured GitHub repository to jumpstart your project (example data models included!)
Three comprehensive Movie and TV Series datasets to get you started:
tmdb_movies - data on 900k movies, including comprehensive statistics and related information.
omdb_movies - data on 500k movies, including comprehensive statistics and related information.
tmdb_tv_series - data on 150k tv series, including comprehensive statistics and related information.
Incorporating extra data sources is required, additionally, you're welcome to change the provided data sources, as they are not 100% accurate.
No, this is an asynchronous competition without scheduled kick-off meetings, working sessions, or prize ceremonies. Participants work at their own pace.
Aim to generate insights appealing to movie and TV series fans using the three data sets provided and/or 3rd party data.
See Github repo for additional details: