Prediction Analysis 2022

The Hottest 100 of 2022 was a very interesting year for 100 Warm Tunas. This year, we once again got to leverage the fact that Triple J played the Hottest 200 the day before the Hottest 100, allowing us to eliminate songs from the Hottest 100 which had ranked in the Hottest 200.

However, in addition to this, this year we introduced two new features into 100 Warm Tunas. The first being website submission, which allowed us to significantly grow the size of our sample collected by allowing site visitors to upload their votes directly to 100warmtunas.com. The second, was the introduction of an ML model to try to adjust the prediction to account for bias in the sample collected.

Summary

  • We collected 4,623 entries (62% increase since 2021 🔺)
  • We tallied 41,731 votes across these entries (61.3% increase since 2021 🔺)
  • triple j counted 2,436,565 votes.
  • Therefore, we collected a sample of 1.71%.
  • We successfully predicted #1
  • We predicted 7 out of the top 10 songs.
  • We predicted 14 out of the top 20 songs.
  • We predicted 84 out of the top 100 songs played in the countdown.
  • Throughout December and January, 100warmtunas.com was loaded over 228,000 times by 65,000 users.

Technical Analysis

Overview

This year, we were again able to successfully able to predict #1, with the assistance of an ML model applied to the data which we had collected. Let's take a look at the top 10 of the official countdown and match it up with the predicted places in 100 Warm Tunas:

ArtistTitleABC RankTunas RankDifference
FlumeSay Nothing [Ft. MAY-A]110
Eliza Rose & Interplanetary CriminalB.O.T.A. (Baddest Of Them All)275
Spacey JaneHardlight341
Steve LacyBad Habit4106
Spacey JaneIt's Been A Long Day51813
Spacey JaneSitting Up682
LizzoAbout Damn Time72114
Ball Park MusicStars In My Eyes835
Gang of Youthsin the wake of your leave927
JojiGlimpse of Us10122

Let's pull apart this table a bit more and grab some statistics about how we did with our prediction overall:

PredictedOut Of Top NPercentage
71070.0%
142070.0%
223073.3%
29 (*27)4072.5%
39 (*35)5078.0%
47 (*42)6078.3%
56 (*49)7080.0%
64 (*56)8080.0%
76 (*65)9084.4%
84 (*70)10084.0%
15620078.0%

From the above data, we can see that:

  • The average error for the top ten ranks was about 5 positions (slightly higher than 2021 and 2020 which were about 3 positions of error).
  • Warm Tunas predicted 7 out of the top 10 songs.
  • Warm Tunas predicted 14 out of the top 20 songs.
  • Warm Tunas predicted 84 out of the 100 songs played in the countdown.

(note, asterisked values represent prediction without top 200 elimination)

Making exact predictions is difficult, especially when you only have a small fraction of data, which has certain selection bias. The data we collect will generally give a good indication, but will never allow us to predict, with certainty, that a particular outcome will occur.

The following tables depict positional accuracy and error breakdown exhibited across different cross-sections of the countdown:

Positional Accuracy of the Top 10:

Positions OffNum Occurences%Positions Off (Cumulative)OcurrencesCumulative %
0-4440%0-4440%
5-9440%0-9880%
10-14220%0-141010

Positional Accuracy of the Top 20:

Across the top 20, 70% of our predictions were accurate within 9 places of their actual position in the countdown.

Positions OffNum Occurences%Positions Off (Cumulative)OcurrencesCumulative %
0-91470%0-91470%
10-19420%0-191890%
40-4915%0-491995%
> 8015%> 020100%

Positional Accuracy of the Top 50:

The trend continues across the top 50, with 50% of our predictions being accurate within 9 places of their actual position in the countdown and 72% being accurate within 19 places. This shows that 100 Warm Tunas is still reasonably good at predicting a general guess of what songs will play during the countdown, but not necessarily their exact order.

Positions OffNum Occurences%Positions Off (Cumulative)OcurrencesCumulative %
0-92550%0-92550%
10-191122%0-193672%
20-2924%0-293876%
30-3924%0-394080%
40-4948%0-494488%
50-5912%0-594590%
60-6900%0-694590%
70-7912%0-794692%
> 8024%> 04896%
Did not rank24%---

Positional Accuracy of the Top 100:

For completeness, the accuracy bands of the top 100 have also been provided:

Positions OffNum Occurences%Positions Off (Cumulative)OcurrencesCumulative %
0-93232%0-93232%
10-191818%0-195050%
20-291010%0-296060%
30-3977%0-396767%
40-491111%0-497878%
50-5933%0-598181%
60-6922%0-698383%
70-7922%0-798585%
> 8055%> 09090%
Did not rank1010%---

Accuracy Deep Dive: Is accuracy decreasing over time?

A comment that is made often about 100 Warm Tunas (especially on Hottest 100 Day), is that the predictions are getting worse and are "way off". This also isn't new to just 2022's countdown predictions. To initially address this point, 100 Warm Tunas was, and never will be about providing predictions of exact positions. Whilst our data presentation on our website does display an ordering of songs and their predicted positions from 1-100 (and 101-200), this is provided as an indication only. From the tables in the previous section, and from all analysis in the past, we know there is inherent error in predicting exact positions.

However, in saying this, we believe it's important to provide an analysis of 100 Warm Tunas' accuracy over the last 6 years. In order to do this, we need to create a way to measure of "accuracy". For the sake of this analysis, we want to compare both;

  • Positional Accuracy (that is, how closely 100 Warm Tunas can predict exact positions)
  • Bucketed Accuracy (that is, how many songs 100 Warm Tunas can predict in specific buckets, ignoring order, e.g. 7 of top 10, 82 of top 100)

Positional Accuracy

To measure positional accuracy of 100 Warm Tunas' predictions, we will take the mean and median of the positional error for every year of data. To provide a more interesting analysis, we will make this calculation for the Top 10, Top 20, Top 50, and Top 100. To provide a fair comparison across all years, top 200 elimination is not used.

Mean Error, Without Top 200 Elimination:

Yearϵ Top 10ϵ Top 20ϵ Top 50ϵ Top 100
20225.515.340.249.9
20213.12750.358.5
2020311.222.133.3
20192.910.421.031.6
2018712.515.527.4
20172.14.810.118.1

Median Error, Without Top 200 Elimination:

Yearϵ Top 10ϵ Top 20ϵ Top 50ϵ Top 100
20225.06.59.519.5
20213.54.011.522.0
20202.03.519.026.0
20192.55.011.517.0
20183.54.510.016.5
20172.03.06.011.0

From the above data and visualisation we can deduce that there is an upward trend in error of the Top 10 and Top 20. However, it can be observed that compared to 2020, and 2021, the median error of 2022 in the Top 50 and Top 100 has actually reduced.

Bucketed Accuracy

To measure bucketed accuracy, we will count the number of songs predicted in the correct bucket. For the sake of simplicity, we will only use the following buckets: 1-10, 1-20, 1-50, 1-100.

Year#1-1# 1-10# 1-20# 1-50# 1-100
20221/17/1014/2039/50 (*35/50)84/100 (*70/100)
20211/18/1014/2035/50 (*34/50)82/100 (*73/100)
20201/18/1012/2033/5075/100
20190/18/1014/2038/5073/100
20181/17/1015/2037/5083/100
20171/18/1016/2042/5083/100

(note, asterisked values represent prediction without top 200 elimination)

From this data, we can deduce that the general bucketed accuracy of 100 Warm Tunas has not drastically changed since 2019. Even without Top 200 elimination, we can still achieve 35/50 and 70/100, which is generally on-trend with the last 4 years.

We can also visualise these trends over time with a stacked bar chart (showing results without top 200 elimination):

Data Volume over time:

Finally, a "data over time" analysis wouldn't be complete without looking at trends of the volume of data which we collect and compare that to the number of votes counted by Triple J.

YearTunas VotesABC VotesSample Size
202241,7312,436,5651.71%
202125,8772,500,4091.03%
202036,1562,790,2241.30%
201945,1123,211,5961.40%
201858,4632,758,5842.12%
201767,0852,386,1332.81%
201665,4122,250,000~2.91%

One thing that initially stands out is the fact that Triple J's number of votes counted has been decreasing since 2019. Is the Hottest 100 becoming less popular these days?

As for 100 Warm Tunas, we can observe that the data collected has been trending downward. Fortunately, this year we were able to buck the trend of a decreasing sample size, and managed to collect 61% more data than 2021, giving a sample larger than 2021, 2020, and 2019.

ML Performance Analysis

This year, we introduced an ML Model to adjust vote counts to reorder the prediction based on historical trends learned from our data for all previous predictions.

In this section we will analyse the performance of the model and determine whether it improved the outcome and accuracy of the prediction.

ArtistTitleABC RankTunas Rank (no ML)Tunas Rank (ML)
FlumeSay Nothing [Ft. MAY-A]131
Eliza RoseB.O.T.A. (Baddest Of Them All)297
Spacey JaneHardlight344
Steve LacyBad Habit41410
Spacey JaneIt's Been A Long Day51618
Spacey JaneSitting Up668
LizzoAbout Damn Time72021
Ball Park MusicStars In My Eyes823
Gang of Youthsin the wake of your leave912
JojiGlimpse of Us101712

When comparing ML vs no ML, we initially see that:

  • When ML is not used, 100 Warm Tunas does not successfully predict #1.
  • When ML is not used, 100 Warm Tunas only predicts 6 of the top 10 (vs 7 with ML).
  • When ML is used, some Spacey Jane songs are incorrectly down-ranked (Sitting Up 6 → 8). The same applies to Lizzo's 'About Damn Time' (20 → 21).

Positional Accuracy

Mean Error

ML/No MLϵ Top 10ϵ Top 20ϵ Top 50ϵ Top 100
No ML6.512.831.734.1
ML5.512.231.133.8

Median Error

ML/No MLϵ Top 10ϵ Top 20ϵ Top 50ϵ Top 100
No ML7.07.58.516.5
ML5.06.08.015.5

From this data, we can deduce that ML brings a general improvement to the prediction, since the error for every bucket has been reduced when ML adjustment is used.

Bucketed Accuracy

No MLMLOut Of Top N
6710
151420
222230
292940
393950
8484100

As we have already seen, using ML allows us to predict 7 of the top 10. Interestingly, using ML decreases the bucketed accuracy of the top 20, reducing it by 1, to 14/20. The remainder of the buckets maintain the same outcome, and this is expected, as we only apply the model to the top 25 predictions.

Vote Sources

100 Warm Tunas collects votes from a handful of different sources. This year we introduced the ability for users to upload their votes directly to the website. Here is the break-down of how many votes were counted across the different sources:

SourceNum EntriesNum VotesVotes Per Entry
Instagram DM32129839.2928
Instagram Story24020098.3708
Instagram Feed87177808.9323
Reddit22619998.8451
Twitter24921218.5181
Website Upload3143287419.1444
Total5050456339.0362
Total (after de-duplication)4623417319.0268

This year we collected a majority of our data through direct upload from our website.

A question that we'd like to ask ourselves is "Which is the most accurate source, or combination of sources?". We can determine this by doing both a positional and bucketed accuracy analysis on every combination of vote sources.

Accuracy by Source

From the data, we can deduce the following:

  • The best top 10 source is "IG DM", with a median error of 3.5 (Baseline is 5). Using just this source also predicts 3 of the top 5.
  • The worst top 10 and top 100 source is "Reddit Comment", predicting only 59 of the top 100 (Baseline is 84)

The tables above are interactive. You can use the column headers to sort the data to make your own deductions.

Vote Submissions

It comes as no surprise that the most popular day for people to share their votes was on the day voting closed with 287 entries collected:

We can further break down the data by source and apply a cumulative sum to visualise it in a "race" format:

Finally, we can then break down the votes collected for each song over time:

Vote Counts Per Day (with ML adjustments)

Vote Counts Per Day (without ML, raw vote counts)

Wrap Up

This year was an absolute blast - we got to experiment with new technology (ML Model), and successfully predicted #1.

We'd like to thank everyone who shared their votes, visited our site or interacted with us on Instagram. Without our audience, 100 Warm Tunas wouldn't be possible. See you next year!

If you enjoyed this analysis and would like to show your appreciation for the work that I do, you can show your appreciation by buying me a coffee ☕️.