Monthly Archive: May 2012

May 23 2012

That’s all for Season 11 (spoiler)

There you have it, folks. Phil, and conventional wisdom, wins, and Dialidol remains undefeated. Jessica’s fans did an admirable job tightening the race, but you can’t fight the entirety of the east coast with only two states. White guys with guitars is a monster to be slayed another day.

I still think that 60/40 was close to the right handicapping, but of course we’ll never know. Intrade had Phil at over 80% all day. Zabasearch and Votefair made a bad call, but that just shows that Idol prediction is a tough game.

We’ve had something like 40,000 page views this year, more than 10 times the traffic of last year. We were covered in at least two newspapers and several website articles. And I published a model that did … pretty lousy overall.

For regular readers, I hope that you’ve found my observations at least provocative, if not necessarily insightful. I’ve tried to be honest, thoughtful, and show the readers how I work through a problem.

This site is about to go totally dark, and no updates will be posted until next January. Feel free to hop over to my regular blog where I am running a web series on thermodynamics for non-scientists, Thermo for Normals. Follow Jessica at @jessicadennis and me at @schweinhundert on Twitter.

See you next year.

May 23 2012

The internet is underrating Jessica Sanchez’s chances

Right now many experts have selected Phil Phillips as the next American Idol. I’ve seen the sentiment expressed by MJ of mjsbigblog, by IdolBlogLive, by at least one member of each of the Idol Radio Show and FarmvilleRant Radio. And, of course, Dialidol. Intrade still has Phil at 80%.

I am not longer convinced that it should be nearly as high.

Yes, Dialidol has never been wrong about a finale. But I agree with what people have said: Dialidol has underestimated the Hawaii vote. I also think that within the margin of error, there’s a very good possibility that Dialidol has gotten lucky on the two calls that they’ve made where it wasn’t a landslide.

In short, I think I’m saying that it’s close to even odds. Anybody saying otherwise is being overconfident in Phil’s chances, perhaps specifically because of Dialidol.

Zabasearch, which does not have great accuracy this year but is OK, has projected Jessica to win. I have no idea how close it was, so I have no idea how confident to be in their results. But as one more indicator, it pushes the needle away from Phil at least a little bit, doesn’t it?

80% is too high for Phil. Maybe Phil has a 60% chance, and no more. The internet pundits seem to be 100% behind Phil, and I think that’s kind of crazy.

May 23 2012

Too close to call

Jessica Sanchez’s fans failed to pull her ahead in Dialidol, as her support petered out at about 3:30 AM eastern time last night:

They did, however, pull her much closer to even with Phil than at the beginning of the night. As such, there is no official Dialidol call, because the difference between the two is within the margin of error. Also note that I believe (not sure) that Dialidol should have left the poll open for another hour. However, at the rate measured at 4:00 AM, that would not have changed the fact that Phil would be on top.

Some people think that Phil being ahead on Dialidol is dispositive, but I’m not so sure. We saw just last week that Jessica’s votes were at least slightly undercounted compared with Joshua’s. The question is how big this effect is. We know that she’s being undercounted by at least 0.3 points. But could she be undercounted by 1.4? We don’t really know, but it could be.

Much has been made of Dialidol’s accuracy, but many of their calls are outside the margin of error. 2 out of 7 were too close to call. Yes, Dialidol ranked the winner #1 in both of those cases, but they had a 1 in 4 chance of that happening by random guess.

Meanwhile, Intrade’s estimate for Phil winning dropped from 90% to 80%, which is where I thought it ought to be. Also, you can read Votefair’s official explanation for why it’s OK that Jessica has led every poll for the year. I’m not sure I find it entirely convincing. He says that he is controlling for votes from the Philippines. But what about votes from Asian Americans? It’s not like he can IP filter those. Votefair is a self-selecting poll, and there are lots of potential problems with that.

I still lean toward Phil being the winner, but there isn’t enough information right now.

May 23 2012

Jessica may be closing the gap

UPDATE 2: Ok, I am going to bed. This is ridiculous. According to my projection, Jessica will overtake Phil in about an hour, 4:30 AM eastern. If that happens, well, you’re going to see a post here tomorrow morning that says I think Jessica Sanchez will win.

UPDATE: The effect is accelerating for Jessica Sanchez. To look at this, it appears that she will overtake Phil Phillips before the voting closes. Wow. I’ve updated the graph from my original post to reflect the change.

Original post: In between about 1 and 2 am tonight, the gap between Phil and Jessica on Dialidol narrowed from 16.477/11.431 to 16.119/11.543, or a change of about half a point. Looking at the DIHardGraphs, Jessica’s voters might be doing it.

(Times are all eastern time zone, times beyond 2:30 AM are projected by me) If Jessica can maintain the pace (that is a big if), her supporters can barely squeak her into the lead by the time Hawaii’s voting closes, I think. Now, if a majority of those votes come from California, it won’t happen, because their voting will close off. But if more from Hawaii replace them, they might be able to squeak by.

May 23 2012

Finale performance show impressions

The early returns from WNTS have been posted, and the contestants had an average score that was equal to within a rounding error (53). The proprietors over there say that it was neck-and-neck, but that the performances were perfunctory because people have already made up their minds.

Fair enough, that might be. But if anybody was going to be swayed, I do not think it would be in Jessica’s direction. Her rendition of the truly execrable Idol song “Change Nothing” scored a 26, tied for 6th worst of any performance during a finale. Phil’s version of “Stand By Me” was also panned, but does not rank in the top 10 worst finale performance polled by that service. With near-rave reviews of his Idol song, Phil would perhaps be at an advantage if any was to be had.

Dialidol, which I’ll remind you has not yet missed a call in 7 years of finales, has Jessica trailing badly in all metrics (Dialidol score, raw votes, and busy signals). As of right now, she’s being trounced. California can help her, but they will need to get busy. She may pick up some of Joshua’s votes, but there is no evidence that she has so far. The current slope in the DIHardGraphs show no flagging of Phil’s support. The voting closes at 1am on the east coast, so maybe after that she can pick up some steam.

Intrade, the betting market where people buy and sell “stock” in a given outcome, has Phil at a 90% probability of winning. That is the same probability they had Barack Obama at on Nov 3, 2008, an election whose outcome was more or less assumed. I happen to think that giving Jessica only a 10% chance is a bit low. My feeling is that it’s more like 80% to 20% in Phil’s favor.

Finally, what will we make of Votefair’s result? Currently Jessica is winning on that service 70% to 30%. It might seem compelling that she has 2.3 times as many votes as Phil, except for one little problem:

Choice Vote count Percentage
Jessica Sanchez 414 39
Phillip Phillips 159 15
Colton Dixon 141 13
Elise Testone 92 9
Hollie Cavanagh 86 8
Skylar Laine 82 8
Joshua Ledet 76 7

That’s Votefair’s results for the Top 7. She had 2.6 times as many votes as Phil that week on Votefair. You know who actually had more votes than Jessica? Phil, Joshua, Skylar, Colton, Hollie, and yes, even Elise. She was dead last.

What would be the point of crunching numbers? There isn’t currently any single substantive reason to think that Phil will not win. Squinting at data trying to see a given result is something that we’ve all been guilty of, but no amount of squinting is going to reveal a probable Jessica win. It would be a major upset, though I might stop short of saying a shocking upset.

Yes, upsets happen. The US beat the Russians in hockey at the 1980 Winter Olympics. Buster Douglas beat Mike Tyson. The Patriots beat the Rams after bringing a 6th round draft pick as their quarterback mid-season. Anything can happen, I suppose. But this isn’t where the smart money is.

May 22 2012

How the model works (and doesn’t work): Part 2

This is a continuation of a thorough explanation of the forecasting model I’ve used. The present article is a wrap-up of the season and an assessment of the model’s accuracy. Please see Part 1 of this series for a somewhat detailed explanation of the analytical model.

Overview

Most people’s perception of the model this season is that it’s really terrible, but in fact it made a ton of very good calls early on in the season. Many people did not start reading this blog until later, when its foibles became more evident. The semi-finals, Top 13, and Top 11 went swimmingly. It had great accuracy. Then the wheels came off.

Of the 13 lowest-vote getters, 6 of them were either the bottom contestant or the next-to-bottom contestant in the model:

Read the rest of this entry »

May 21 2012

Interview with me on San Diego Union Tribune

Read here: http://www.utsandiego.com/news/2012/may/21/conspiracy-is-always-on-the-idol-ballot/

Point of correction, though. I said “idiots like me”‘ not “people like me”.

May 20 2012

Why all models are wrong

“All models are wrong”, my PhD adviser told me one day. We were having a conversation about why a given experiment of ours was off compared to the predictions of another group. He smiled at me, but I knew, realized, right away that he was correct in a certain sense.

This seems to be a point that isn’t appreciated.

Suppose I were to write down Newton’s Law of gravitation. That law is correct (up to a small deviation very near the Sun due to General Relativity). I could then build a model of the orbit of the Earth around the Sun, which predicts an elliptical path. This is a good model, much better than even the Copernican model, probably one you learned about in school. But it’s wrong.

Why is it wrong? Because it was treated as a two-body system, neglecting the effect of all the other planets, asteroids, comets, of the micrometeoritic material around the Earth, etc. One can then try to add those factors in, which is a Herculean task necessitating tons of observations and computing time. You improve the model. Then, an asteroid comes up, and someone asks you if there will be a collision. As an example, the meteor designated 2011 AG5 will cross our path sometime between 2040 and 2047; NASA lists the probability of impact at 0.2%.

You could fairly ask why NASA would have to list a mere probability of impact, and not tell us definitely what will happen. The answer is that it’s because their response is based on a model, that model has statistical noise in its variables, and it’s imperfect in conception. It’s wrong.

In weather forecasts, often a probability of rain is listed. Why? Why can’t they just tell you whether or not it will rain? What does it even mean that there’s a 30% chance of rain? It means that 30% of the time when the conditions looked like today, it ended up raining. Is the person wrong if it rains? No. He said there was a chance: 30%. If he said there was a 0% chance, he would be wrong. By saying there was a 30% chance he is being honest about the uncertainty in his statistical analysis and weather modeling capability. His model is wrong.

An actuary is a person who works at an insurance company. He makes sure that the premiums collected on life insurance are enough to cover the losses due to people with policies who die. How does he know how many are going to die? Can he predict who is going to die? In a certain sense, yes. He builds a model very much like the one that I’ve built for this blog. He collects a data set of 10,000 people, looking at various variables: age, weight, gender, smoker/nonsmoker, miles driven per day, for instance. Then he builds a model that predicts the probability of someone dying in the next year under all of those conditions. If many of the people stop smoking, this will reduce the number of deaths. The premiums for those people can then be lowered.

But now let’s take a small subset of people, say 3 of them. Bob is 55, still smokes (though he’s tried to cut back), is a bit overweight and drives 20 miles a day. John is 20, nonsmoker, thin, and drives 50 miles a day. Alice is 40, nonsmoker, in good shape, and rides the subway.

The actuary’s boss brings these people into a room and asks the actuary which will die first. The actuary uses the model he’s built from the 10,000 data points and predicts a probability of dying. Bob is the most likely, as he’s male, older, and smokes, so he’s a prime candidate for heart disease. Next is John, who is a young man, and young men die in traffic collisions far more than 40 year old women. Finally, Alice has the lowest probability of dying. If the boss says that one of them will die, the actuary would make a table:

Name Age Sex Smoker? Miles/day
driven
BMI Probability
of death
Bob 55 M Y 20 27 56%
John 20 M N 50 20 27%
Alice 40 F N 0 22 17%

Later that year, John wraps his car around a tree, dying. The actuary’s boss comes into his office, screaming mad that the actuary was wrong. “But”, says the actuary, “I was not wrong. I said that Bob was the most likely to die, not that he would definitely die”.

Was the model wrong? Yes. Why? Because it didn’t take into account every single thing about the entire world and how it affected these three people. That is impossible. Instead, what it did was reduce their lives to a few quantitative variables and project what the likelihood of death was, and said that Bob was the most likely to die. But the actuary was not wrong. He gave the honest probability as well as anybody could possibly have determined. If the man’s boss interpreted the table above as saying “Bob will definitely die”, then that is his problem.

Surely the reader can understand what I am getting at by now.

Contestant WNTS
Rating (avg.)
Dialidol Rank Bottom 3 Previously? Previous Rating Probability of
Elimination (%)
Jessica Sanchez 45.7 2 Yes 74.5 45.8
Joshua Ledet 54.7 2 Yes 69.0 31.7
Phillip Phillips 61.0 1 No 63.5 22.4

This was my forecast on last Thursday. Right there it says that Jessica Sanchez was the most likely to be eliminated. She was ranked 3rd on Dialidol. People eliminated in the Top 3 are usually ranked 3rd on Dialidol (though I bumped her to a tie because of how close the numbers were). She had the lowest WNTS approval rating of the night. The person eliminated in the Top 3 typically has a lower WNTS approval rating than the others. As such, the model predicted a 46% chance of her being eliminated. For Joshua, he was ranked second on Dialidol and had the second-lowest WNTS rating. Sometimes those people are eliminated, and the probability is reckoned to be about 32%. This leaves Phil with a 22.4% chance.

Jessica Sanchez was not eliminated, and Joshua was. The one with the highest probability was not eliminated. My question is: so what? A 32% chance is not small, not even remotely surprising. Twice in the past week here in Atlanta there was a 30% chance of rain, and it rained. So what?

The model that I use is wrong. It reduces all of the possible effects on voting into just two variables, which is not correct. It is, however, feasible, and the most intellectually sound one I can find. If it gets a fair amount during the entire season, then I’m happy. If it misses any one particular one, that is totally meaningless.

The model is a formula. It’s not racist. It’s not sexist. It’s not based on falsehoods. It’s a straightforward correlation and logistical regression. It’s dispassionate, reductive, and wrong. However, it is not totally wrong. Sometimes it hits, and sometimes it misses, and it’s been better than some experts out there predicting what will happen. That’s more than I expected.

I don’t have skin in this game. I don’t care for any of the contestants, really. The last Idol contestant that I really liked was Blake Lewis. I kind of liked Erika Van Pelt. What the model “thinks” is not always what I think, nor is it what I want. To be clear, I would prefer if Jessica won over Phillip, if for no other reason that I think women have been badly treated in the past 4 years. I’ve remarked many times in the Liveblog that Phil isn’t singing what I recognize to be notes.

It would actually be much easier for me to just sit back and not make predictions. The reason I built a model was to see if it was possible to do a prediction of who’s eliminated from just a few statistical indicators. That model makes a lot of calls, and a lot are wrong. But there are some that are right, and there are many that are in the ballpark. If you get some enjoyment or information out of reading them, that’s great. If they’re wrong, you should take that as a sign that 1. the data is noisy and sparse, 2. things with the voting change, and 3. there are many factors that are not quantifiable. Any comments beyond that are pointless, and the unlettered attacks flying around in comments sections of this blog tell me that these points haven’t been communicated well.

May 18 2012

Jessica Sanchez vs. Phil Phillips

Phil vs Jessica. These two contestants could hardly be more different.

Phillip Phillips has many traits that make him the favorite to win. He was featured in the auditions a lot, with a total pre-exposure time of more than 1000 seconds, and with his initial audition having been shown in-episode and in promos. He’s white, male, plays guitar, and sings mostly songs that are new to Idol. Compare that to winners past, and you check off a lot of boxes:

Contestant Season Sex Race Age
at start
Hometown Avg Of
WNTS Rating
New
songs
Bottom
2/3 ever?
Kelly Clarkson 1 F White 20 Burleson, TX 76.8 11 No
Ruben Studdard 2 M Black 24 Birmingham, AL 63.6 16 Yes (Top 5)
Fantasia Barrino 3 F Black 19 High Point, NC 65.4 10 Yes (twice)
Carrie Underwood 4 F White 21 Checotah, OK 57.3 15 No
Taylor Hicks 5 M White 29 Birmingham, AL 61.3 10 No
Jordin Sparks 6 F Black 17 Glendale, AZ 59.6 11 No
David Cook 7 M White 25 Blue Springs, MO 64.7 15 No
Kris Allen 8 M White 23 Conway, AR 61.1 7 Yes (Top 5)
Lee DeWyze 9 M White 23 Mount Prospect, IL 57.8 9 No
Scotty McCreery 10 M White 17 Garner, NC 52.5 13 No

Here are the stats for the finalists this year:

Contestant Sex Race Age
at start
Hometown Avg Of
WNTS Rating
New
songs
Bottom
2/3 ever?
Phillip Phillips M White 21 Leesburg, GA 54.1 11 No
Jessica Sanchez F Asian/
Hispanic
16 San Diego, CA 66.6 6 Yes (Top 7,
Saved)

The mean age of winners is 21.8 years, and Phil is 21. Phil was never in the Bottom Group, just like 7/10 of his predecessors. All winners except Lee DeWyze and Jordin Sparks hailed from the South or Midwest United States, and Phil is from Georgia.

On the flip side you have Jessica Sanchez. She had significantly less pre-exposure (600 seconds), with her audition never having been shown. She’s non-white (mixed race, if I’m not mistaken), female, has never played an instrument on stage, sings mostly songs that have been sung on Idol before. If she were to win, she would be the youngest ever, at 16. She was the bottom-vote-getter in the Top 7 and had to be saved. She is from California, which has not yet produced a winner.

So, on a purely demographic and historical basis, she would appear to be the underdog.

What can we put in Jessica’s column that favors her? Her approval rating, as computed by WNTS, is significantly higher than Phil’s, at about 67, much more in line with previous winners. She has four performances rated higher than 80 on WNTS (Love You I Do, And I Am Telling You I’m Not Going, I Will Always Love You, and Sweet Dreams), whereas Phil has only one (Volcano). The only contestants to win with as few as Phil has were Lee DeWyze (1 song over 80) and Scotty McCreery (0 songs over 80), so in this respect Jessica has the edge.

There is the matter that Jessica has a large following specifically because of her racial identity. Looking at Dialidol’s Geopredictions, Phil and Joshua barely registered any votes at all in Hawaii, a state with a large population of Filipinos. However, relying on that did not work out so well for Jasmine Trias, Camile Velasco, or Jonah Moananu.

As I said in an earlier post, I think Jessica can certainly win, but probably won’t. In light of the evidence above, I think you might agree that her odds hover somewhere around 2:1. But that’s enough of a margin that if she really kills it, she could take the title. This may be the first Finale since season 8 that actually mattered, and as such I think we can say that it’s been an entertaining year, despite the producer’s fumbles and the egregious judging.

May 17 2012

Quick note about tonight’s result (spoilers)

Spoiler alert!
Read the rest of this entry »

Older posts «