Category Archive: Pre-Game

Predictions and pre-show analysis

Mar 20 2013

Top 9 pre-game

I’m working on a longer piece about song selection, and what choices are smart (statistically speaking) and which are not. For now, I’ve listed below all performances of Beatles songs, along with the breakdown of how well the performance was received and what the voting result was (the final column indicates if that contestant had already been in the Bottom 3). These are listed in order of approval rating. Anything below 50 is considered to have been scored below average (and, indeed, most of the not-safe people are in that half of the table).

There haven’t been too many repeats of any of these. Across the Universe (twice), Let It Be (twice) and Come Together (three times) have worked out well for those who sang them. Oh Darling and The Long and Winding Road have been a mixed bag—they worked for one person, not for another. That’s one reason I don’t put too much stock in analyses that focus solely on song choice.

 

Song Contestant WNTS Rating Result Bottom
Prev?
Let It Be Brooke White 84 Safe No
Come Together Crystal Bowersox 83 Safe No
Come Together Carly Smithson 81 Safe No
Eleanor Rigby David Cook 81 Safe No
She’s A Woman Chikezie 81 Safe No
Blackbird Carly Smithson 71 Bottom Group No
While My Guitar Gently Weeps James Durbin 70 Safe No
With A Little Help From My Friends Casey Abrams 70 Safe No
Let It Be Katie Stevens 68 Safe Yes
The Long And Winding Road David Archuleta 68 Safe No
Across The Universe Siobhan Magnus 65 Safe No
Oh! Darling Katelyn Epperly 64 Safe No
Day Tripper David Cook 63 Safe No
Something Taylor Hicks 63 Safe No
Fixing A Hole Lilly Scott 62 Safe No
Yesterday Syesha Mercado 62 Safe Yes
Across The Universe Michael Johns 59 Safe No
All My Loving Tim Urban 57 Safe Yes
If I Fell Jason Castro 57 Safe No
Hey Jude Lee DeWyze 56 Safe No
Eleanor Rigby Michael Lynche 51 Saved No
You Can’t Do That Amanda Overmyer 51 Safe No
I’ve Just Seen A Face Chikezie 47 Safe No
Come Together Kris Allen 46 Safe Yes
Oh! Darling Jen Hirsh 45 Eliminated No
Here Comes The Sun Brooke White 43 Safe No
Michelle Jason Castro 43 Safe No
Here, There & Everywhere Clay Aiken 39 Eliminated No
A Day In The Life Michael Johns 38 Safe No
Got To Get You Into My Life Syesha Mercado 38 Bottom Group No
Can’t Buy Me Love Andrew Garcia 36 Bottom Group No
I Should Have Known Better Ramiele Malubay 35 Safe No
In My Life Ramiele Malubay 33 Safe No
The Long And Winding Road Aaron Kelly 33 Bottom Group No
Back In The U.S.S.R. Amanda Overmyer 29 Eliminated No
I Saw Her Standing There David Hernandez 26 Eliminated No
We Can Work It Out David Archuleta 18 Safe No
You’ve Got To Hide Your Love Away Kristy Lee Cook 17 Bottom Group Yes
I Want To Hold Your Hand Haeley Vaughn 12 Safe No
Eight Days A Week Kristy Lee Cook 5 Bottom Group No

May 18 2012

Jessica Sanchez vs. Phil Phillips

Phil vs Jessica. These two contestants could hardly be more different.

Phillip Phillips has many traits that make him the favorite to win. He was featured in the auditions a lot, with a total pre-exposure time of more than 1000 seconds, and with his initial audition having been shown in-episode and in promos. He’s white, male, plays guitar, and sings mostly songs that are new to Idol. Compare that to winners past, and you check off a lot of boxes:

Contestant Season Sex Race Age
at start
Hometown Avg Of
WNTS Rating
New
songs
Bottom
2/3 ever?
Kelly Clarkson 1 F White 20 Burleson, TX 76.8 11 No
Ruben Studdard 2 M Black 24 Birmingham, AL 63.6 16 Yes (Top 5)
Fantasia Barrino 3 F Black 19 High Point, NC 65.4 10 Yes (twice)
Carrie Underwood 4 F White 21 Checotah, OK 57.3 15 No
Taylor Hicks 5 M White 29 Birmingham, AL 61.3 10 No
Jordin Sparks 6 F Black 17 Glendale, AZ 59.6 11 No
David Cook 7 M White 25 Blue Springs, MO 64.7 15 No
Kris Allen 8 M White 23 Conway, AR 61.1 7 Yes (Top 5)
Lee DeWyze 9 M White 23 Mount Prospect, IL 57.8 9 No
Scotty McCreery 10 M White 17 Garner, NC 52.5 13 No

Here are the stats for the finalists this year:

Contestant Sex Race Age
at start
Hometown Avg Of
WNTS Rating
New
songs
Bottom
2/3 ever?
Phillip Phillips M White 21 Leesburg, GA 54.1 11 No
Jessica Sanchez F Asian/
Hispanic
16 San Diego, CA 66.6 6 Yes (Top 7,
Saved)

The mean age of winners is 21.8 years, and Phil is 21. Phil was never in the Bottom Group, just like 7/10 of his predecessors. All winners except Lee DeWyze and Jordin Sparks hailed from the South or Midwest United States, and Phil is from Georgia.

On the flip side you have Jessica Sanchez. She had significantly less pre-exposure (600 seconds), with her audition never having been shown. She’s non-white (mixed race, if I’m not mistaken), female, has never played an instrument on stage, sings mostly songs that have been sung on Idol before. If she were to win, she would be the youngest ever, at 16. She was the bottom-vote-getter in the Top 7 and had to be saved. She is from California, which has not yet produced a winner.

So, on a purely demographic and historical basis, she would appear to be the underdog.

What can we put in Jessica’s column that favors her? Her approval rating, as computed by WNTS, is significantly higher than Phil’s, at about 67, much more in line with previous winners. She has four performances rated higher than 80 on WNTS (Love You I Do, And I Am Telling You I’m Not Going, I Will Always Love You, and Sweet Dreams), whereas Phil has only one (Volcano). The only contestants to win with as few as Phil has were Lee DeWyze (1 song over 80) and Scotty McCreery (0 songs over 80), so in this respect Jessica has the edge.

There is the matter that Jessica has a large following specifically because of her racial identity. Looking at Dialidol’s Geopredictions, Phil and Joshua barely registered any votes at all in Hawaii, a state with a large population of Filipinos. However, relying on that did not work out so well for Jasmine Trias, Camile Velasco, or Jonah Moananu.

As I said in an earlier post, I think Jessica can certainly win, but probably won’t. In light of the evidence above, I think you might agree that her odds hover somewhere around 2:1. But that’s enough of a margin that if she really kills it, she could take the title. This may be the first Finale since season 8 that actually mattered, and as such I think we can say that it’s been an entertaining year, despite the producer’s fumbles and the egregious judging.

Mar 07 2012

Top 13 pre-game analysis

Here are some thoughts that may be of interest for this week.

The theme is split (first time that has happened, to my recollection), with Men singing Stevie Wonder songs and Women singing Whitney Houston songs. The site TIP reported that they thought it might therefore be a double elimination. However, the listing of the episode name on zap2it is “1 voted off”, so I highly doubt this.

This theme to me seems to play most against Skylar Laine and Jermaine Jones. Jones is sort of a crooner of lumbering tunes, a bad fit for the typically uptempo  tenor vocal style of Stevie Wonder. Laine may decide to xerox the Dolly Parton original of “I will always love you”, but I will note that the Parton version never charted on the Hot 100 (although it did on the Country version), mainly because it sucks. Moreover, there would obviously be some competition for that tune, and the producers haven’t been letting contestants repeat songs in recent years (but, again, who knows?).

As I noted a couple days ago, 3 out of 5 seasons that had a Wild Card pick had one removed in the first voting round. If we want to narrow that a bit, I could point out that they were all women. That would seem to single out Erika Van Pelt, but I would be very cautious, as Whitney Houston songs must be her bread and butter. Moreover, season 8 was a double elimination, which arguably knocks the number down to 2 out of 5.

But if I’m going to pick on someone whom I think is the most unlikely of any finalist, it’s Jeremy Rosado. To me he seems to have none of the things that make for a successful Idol contestant. His choice last week (can you recall?) was a Coldplay song that I had never heard of, never charted, and had very low recognition. His performance was bland; it’s better to be bad than forgettable.

Feb 28 2012

Pre-game projections (for what they’re worth)

Here are the projections for the men and women in the first week, based on a model that assumes that each contestant puts in an average performance (50/100). As such, any variance depends only on pre-exposure. Note that I have included here Jermaine Jones and Schyler Dixon, and assumed that only 5 from each gender will get through to the finals based on voting, with perhaps 2-3 being chosen by the judges in a wild card round. This drags the percentages down from  the last pre-voting projection, which included only 12 people per gender and assumed 6 got through. As such, this is only a projection on what will happen on Thursday from voting. The individual probabilities are higher to “make the finals”, because that includes a wild card round, which a better singer is more likely to get picked in. These are probabilities for “being selected by the audience”. Read the rest of this entry »

May 18 2011

Top 3 pre-game: 3 chances to get it right

In the recent weeks, I’ve been saying that to the extent that any variable is important to who is eliminated, that variable was how well the contestant had done lately. That is, there was a significant effect of momentum, and contestants were eliminated even if they had a good week because people stuck with their favorites from the prior weeks.

For the Top 3, though, this isn’t the case. There is good statistical evidence that not only does performance matter, but the performance that matters most is the best performance that a contestant has that evening. The contestant can screw up on two songs so long as they don’t have the worst top-score for the night.

That being said, usually the worst average performer also has the lowest top-number for the night.

Supposing, though, that Haley Reinhart has a bad-to-disastrous run at What Is And What Should Never Be tomorrow night, which is very likely, as long as she comes out and kills it with one of her other songs (You Oughta Know being the most likely), I would still expect her to be safe. We don’t know what Lauren or Scotty will be singing, but historically these contestants have produced generally consistent but not stratospheric approval ratings. This is particularly true of Lauren Alaina, who is perceived (rightly) not to have had “a moment” on the show. This would be the time to do it, if she were going to.

So, no numbers this week. What happens this week is almost entirely dependent on what happens tomorrow night.

By the way, Dialidol has gotten 5/6 Top 3 calls correct. However, 7/9 calls were correct based on the above strategy, which is much stronger evidence.

May 11 2011

Top 4 pre-game odds

I’m actually not all that confident in James Durbin compared to where I was yesterday.

If I were to give my best guess as to the fit to the data, and assume that all contestants performed roughly how they did in the Top 5, then the model predicts the following probabilities (not normalized)

Name Prior Average (4 rounds) Probability of elim.
Scotty 53.333 0.323193
James 58.666 0.219311
Lauren 63.1666 0.164075
Haley 71.666 0.075083

Why is James higher than Lauren? He had a bad week. But, in fact, he’s had two bad weeks in a row compared with the other contestants. Lauren had one bad performance and one good performance. She was in the Bottom 2, and so may be running about 5 percentage points behind where she should have been. Scotty, on the other hand, has been running 5-10 points better than he should.

If you asked me my feeling, it’s going to be Lauren. The model predicts Scotty, then James.

Technical note: it may seem that I have been much more confident in the model in past weeks. There is a very good reason for this. Each round of American Idol decreases the total available data from week to week. In the Top 12, you have 12 pieces of data per season (in this case, about 110 data points). But in the Top 4, you have only 4 pieces of data per season. Yes, they sing 2 songs each, but we don’t get the response from individual songs—we only know the outcome of the vote, which was the result of both songs. As such, I only have 36 pieces of data to begin with for judging the Top 4. Factor in the fact that Dialidol only has results since Season 4, and you’re really operating on about 25 degrees of freedom for a statistical fit. And, well, that just adds up to a crummy statistical significance.

May 10 2011

Top 4: The winner may already be locked in

The Top 4 continues the trend of being much less about how somebody sings that week, and much more about how they’ve already done. It would be reasonable to say that America has already picked its favorite, and is just eliminating the others one by one until the finale.

This is not to say that it does this indiscriminately. In fact, it probably still eliminates its least favorite quicker. Nonetheless, even if James Durbin screeches out a tone deaf version of a Bee Gees song, my guess is he’ll still make the finale.

The above chart shows the WNTS approval rating for the 4 previous rounds and the outcome (safe or eliminated) that was the result. The points hew relatively close to the logistical regression curve, though the statistical significance leaves something to be desired. Here are variables that are even less well-correlated, though:

  • The contestant’s Top 4 average approval
  • The contestants Top 4 best approval or worst approval
  • Gender
  • Dialidol score

The chart also shows two regions, one that is “generally safe”, above 55, and generally in danger, below 55. There are certainly contestants who were doing well in the previous rounds and got booted. The 3 most prominent ones (the points in the upper right-hand part of the chart) were all women (Tamyra Gray, Latoya London, and Allison Iraheta). However, I wouldn’t read too much into that, as it’s likely statistical noise.

More than half of the people eliminated in the Top 4 fell below this dividing line.

Haley Reinhart is a bit of a puzzle to me. She has seriously underperformed her numbers in the voting, ending up in the Bottom Group when she clearly didn’t deserve it. However, this hasn’t happened for the past couple weeks, and she may have benefited disproportionately from the elimination of Casey Abrams. Scotty McCreery, on the other hand, has consistently overperformed his scores. Were he pulling the votes he deserved, Scotty would have been in the Bottom 2 last week instead of Lauren Alaina. Lauren and Scotty are the two that fall below the line. That being the case, Lauren is probably the contestant in the most danger.

May 04 2011

Top 5 Pre-game

As far as who’s going to be eliminated, I’m not going to say anything that isn’t well-known.

Contestant Sex Prior Average Probability of Elim.
Jacob Lusk M 32.5 0.63144
Scotty McCreery M 48.75 0.231642
Lauren Alaina F 55 0.133852
James Durbin M 63.5 0.058627
Haley Reinhart F 65.5 0.04788

There is no point in speculating about how well Jacob would have to do to be considered safe. The model believes that no matter what score Jacob gets, he will be eliminated.

Why is this the case? Funny enough, during the Top 5, America doesn’t vote based on how well the contestant sings. That may seem weird to you, but it doesn’t seem too far off to me. During the Top 5, America tends to shed the contestant who has been doing the worst lately, rather than the one who did the worst that day. There is, in fact, no significant correlation with the Top 5 performance approval and the outcome!

What is notable, though, is that Ms. Haley Reinhart may have moved into front-runner territory. Now, no winner of American Idol has ever been in the Bottom 3 three times, as Haley has (the closest is Fantasia, who was in the B3 twice). However, Haley is probably one of the most improved contestants that I can remember. Even Jordin Sparks started stronger than Haley, and while she did improve, it was not as markedly.

Apr 27 2011

Top 6 pre-game

Contestant Prior Average Prob
Jacob Lusk 41.0 0.217113
Scotty McCreery 48.0 0.158728
Casey Abrams 51.7 0.133645
Haley Reinhart 59.7 0.090357
Lauren Alaina 61.3 0.083084
James Durbin 69.0 0.056084

The Top 6 is going to be an entertaining show. Not that I think the performances will be good (I really don’t), but we can see that the rankings are getting very narrow. A bit of handicapping, though, should perhaps be entertained at this point. Looking at last week‘s prediction, we can say that Scotty McCreery appears to be overperforming by perhaps 6 points, and Haley underperforming by about 5 points. I expect this effect to diminish over time, but assuming it is still present, the rankings would be Jacob Lusk first, and then Casey and Haley essentially in a dead heat.

Continuing the practice from last week, I’ll give the approximate caliber performance needed to keep a contestant out of the Bottom 3 for those at risk, and the score that would put the a contestant in the Bottom 3 for those predicted to be safe.

Contestant Prior
Average
Safe Score/
Not Safe Score
Change from
recent average
Jacob Lusk 41 78 +37
Scotty McCreery 48 77 +29
Casey Abrams 51.66 72 +21
Haley Reinhart 59.66 39 -20
Lauren Alaina 61.33 37 -24
James Durbin 69 25 -44

Jacob Lusk is unlikely to be able to overcome that deficit. Although I would put him at a slight advantage this week, that advantage still requires a level of performance that we haven’t seen from him in many weeks. If Scotty persists in overperforming like he has been, he would need only a 60 to be considered safe according to the model.

In sum, there is a large gulf between the projected bottom 3 (Jacob, Scotty, Casey) and the top 3, if this year is like past years. Whether or not it is like past years is another matter.

As to this week’s being a duet, I don’t know that it changes things much. Last year’s contest saw Michael Lynche go home in what many considered a long-coming elimination. That was in the Top 4, and each contestant did a duet and a song by themselves. Even with the Top 6 doing duets, I would think they will do one by themselves and one in a duet, as only three songs during a show would be fewer than any other episode.

Apr 19 2011

Top 7 Pre-game: Anybody’s game

Now that we’re in the thick of it, any one of the remaining contestants could go this week. No result would be shocking. Based on prior years and based on the average of the previous 3 rounds, the pregame odds look something like this:

Contestant Sex PriorAverage Prob
Jacob Lusk M 44 0.203924
Stefano Langone M 46 0.185417
Casey Abrams M 51 0.144875
Haley Reinhart F 51 0.144875
Scotty McCreery M 57.5 0.103467
Lauren Alaina F 62 0.081281
James Durbin M 70.5 0.050831

Not even a James Durbin elimination would be a total shock. However, if things keep going like they have been, I expect Jacob to go.

We can take this analysis a bit further. Supposing that all the other contestants performed at their recent average, we can ask how much better a performance Jacob or Stefano would need to be predicted to be ranked higher than Casey and Haley:

Contestant Score Needed For Safety Prior Average Difference
Stefano Langone 61.0 46.0 +15.0
Jacob Lusk 65.0 44.0 +21.0

These scores are perfectly achievable. Many times in Idol history a contestant has managed a score on the order of 20 points above his average, or 20 points above his prior performance. It happens. However, I don’t think that this is the most likely week for it, as the theme (Songs from the 21st century) doesn’t particularly suit either contestant.

We can do the same thing in reverse for James and Lauren. How badly would they need to screw the pooch to be in danger?

Contestant Score Needed For Elimination Prior Average Difference
James Durbin 19.0 70.5 -51.5
Lauren Alaina 34.0 62.0 -28.0

Has a contestant ever had a week-to-week swing of 50 points? In fact, it has happened. Anthony Federov succeeded in this feat, getting a 64 in the Top 6 but a 12 in the Top 5. So I would not call James totally safe, but he’s pretty damn safe.

Lauren is another story. Many contestants have had a week where they underperformed by 25-30 points. I’m not ready to call this one safe, but I also wouldn’t put any money on Lauren.

Older posts «