Don't Count Your Models Before They've Hatched

Ezra Klein touts a new statistical model forecasting "an expected Democratic gain of 32 seats with Democratic control (a gain of 18 seats or more) a near certainty." Ezra remarks, "All the usual disclaimers apply, but things would have to go mighty awry for this election to slip through the left's fingers."

Well, let me offer some disclaimers. One is that there are two kinds of models based on historical data. One kind looks at the historical data, devises a model that fits the historical data well, and then offers a prediction based on the model. That's what these guys have done. In another kind of model, you do the same thing and then, having offered your prediction, you wait for the election outcome and it turns out that your prediction was good. Then, next time around, the same thing happens. That is to say that in the second sort of situation your model is not only based on historical data, but has an actual track record of success. I'd be a lot more confident in a model with a track record, since there are actually any number of formulae that might fit the historical data well.

More concretely, I still worry that we might see a new al-Qaeda video aimed at tipping things toward the GOP. I wish more liberals were out there putting this worry and Brad Plumer's argument about it, out there before it happens.

Comments

Haven't you heard? The GOP was so worried that Osama wouldn't make a video to help them like he did in 2004 the RNC went ahead and made one a nice Al-Queda recruiting video on their own. It was on all the news channels.

Posted by: DP on October 24, 2006 02:27 PM

My only electoral model is despair.

Whatever.

Posted by: Jeffrey Davis on October 24, 2006 02:30 PM

agreed that we should be out ahead of the next al-Qaeda video.

Even more important, we should be out ahead of Bush's surprise bombing of Iran.

Maybe it won't happen. But I don't like the moves afoot with the Navy.

Posted by: kid bitzer on October 24, 2006 02:34 PM

The researchers wouldn't have to wait until the next election to better evaluate the quality of their model. They could hold out a set of elections from the set on from which they derive the model's parameters, then use the held out set to fairly evaluate the quality of the model. (i.e. They could see how well their model predicts the outcomes in the held out set.)

I haven't read the paper, so I'm not sure if they did this. If they did, I don't think your quibble stands. If they didn't, I'd wonder why not.

Posted by: ErnieP on October 24, 2006 02:35 PM

Actually, all models of this sort are purely based on historical data (since if we had future data, there wouldn't be much reason to build a model). Let's say you have all the historical data and your model. Common practice is to take some of the data out (in time series data, that would be the most recent observations--in this case, the most recent election cycle(s)) and produce your model on the test data, then see how well it worked looking at the holdout data.

The 2nd kind of model you describe uses future data as its holdout. That's fine, but it means you have to wait a year before applying a model, which is totally unnecessary. The statisticians can just use the most recent election cycle as their holdout variable and use it to test the model. Indeed, I would be a little surprised if they didn't do that.

Posted by: RWB on October 24, 2006 02:43 PM

ErnieP beat me to it--and said it better.

Posted by: RWB on October 24, 2006 02:45 PM

Whatever model used may be thwarted by the coming election day antics. Screw-ups, both unplanned and planned might significantly raise the noise-floor for any useful signal they might glean from the model

Posted by: blank_stare on October 24, 2006 03:00 PM

Today is October 24. Four years ago, Paul Wellstone's plane crashed in October 25.

Don't discount the possibility of "things going mighty awry" at the last minute.

Posted by: Chris on October 24, 2006 03:04 PM

Agree totally. I wish some of the rest of Left Blogistan would cease and desist with the victory laps and concentrate on GOTV. This thing will pivot on turn-out.

Matt, any way you could prevail on, say, myDD and firedoglake to get real?

Posted by: BrklynLibrul on October 24, 2006 03:05 PM

A model that is validated using hold-out data is better supported than one that isn't. However, Matt's overall point still stands. Suppose I come up with thousands of different random models, and evaluate them all using hold-one-out cross validation. The one that scores the best will, in effect, have been "trained" on the hold out data.

In other words, in something as complicated as election forecasting, a model that is fit to past election results will end up being biased by factors particular to them. If the election process is described by nonstationary statistics, and you have enough historical elections to use in your training set, then the bias will be negligible. However, that is almost certainly not the case here.

Posted by: Jim W on October 24, 2006 03:10 PM

Maybe if you guys had a better foreign policy, voters would not run to the Republicans every time al Qaeda put out a new video.

Posted by: Al on October 24, 2006 03:22 PM

Maybe if the GOP actually cared about our country instead of enriching themselves, they wouldn't be doing Bin Laden's work for him.

Every time the GOP says "vote for us or die," Osama smiles.

Posted by: DP on October 24, 2006 03:40 PM

Just to restate my point to make it a little clearer: the problem is that if researchers used past elections as part of the process of developing their model, even if only in a hold-one-out validation procedure, then they will inevitably end up fitting their model to the validation data. This is because they will tweak the model parameters until the model does well on the validation data.

If, on the other hand, there was a past election that the researchers were completely unaware of, and then they tested their final model on that election, then that would be a true validation of the model, in just the same way that a future election will be.

Posted by: Jim W on October 24, 2006 03:46 PM

Speaking from a data mining perspective. Say one had perhaps 100 elections in which you had sufficient data to be useful -- I'd say it's probably even less than that, as what you really want is a lot of pre-election polling, and that didn't come into widescale use until the last half-century.

In the simplest case, you simply take about 2/3rds of the data to train your model, then test it on the remaining third. (In actuality, you'd want to do some other stuff -- some elections are more 'interesting' than others and you'd want to expand your samples to account for that).

This doesn't overfit the data -- your model must accurately predict the training data AND the test data. I mean, it COULD overfit the data if you kept went back and kept nudging it, but I'm used to automated processes here. All I get out of test data is how well I predicted it. I don't get stuff like "I was too optimistic about GOP chances in elections where condition X was less than Y". Just "67% accurate" or "15% more Type II errors than Model XYZ".

It'd be difficult to create a 100% model on the test data unless you allow some really rigorous feedback -- the sort you explicitly allow for training data and disallow for test data.

Posted by: Morat20 on October 24, 2006 04:00 PM

Maybe if you guys had a better foreign policy, voters would not run to the Republicans every time al Qaeda put out a new video.

Al, we'd still be the party of blacks, gays, atheists, and man-hating women.

Posted by: SomeCallMeTim on October 24, 2006 04:49 PM

SCMT: and latte-drinkers, Volvo-drivers, etc. But none of those things has anything to do with Matthew's expressed worry about an OBL video.

My only point is that you wouldn't have to worry about a late-arriving OBL video causing people to vote Republican if, you know, people didn't vote Republican when they see an OBL video. You might want to consider how to accomplish that.

On the other hand, it is entirely possible that OBL videos don't have any effect on people's voting at all... but that would ruin the (widely-held?) theory on the left that the late-breaking video in '04 caused Kerry to lose.

Posted by: Al on October 24, 2006 05:19 PM

I think OBL counts as a brown person, and therefor a Democrat. We can't win on this issue on until the Nazis come back. I think Bush might be overseeing the recreation of our Cold War enemies, but I don't know if he has time--only two years--to convince a reunified Germany to aggressively rearm. That might have to be left to McCain.

Posted by: SomeCallMeTim on October 24, 2006 06:20 PM

the (widely-held?) theory on the left that the late-breaking video in '04 caused Kerry to lose.

Strawman, from Al the Inveterate Fuckwit. But I didn't realise that the CIA's chief analysts were part of 'the left' (see The One Percent Doctrine, where Suskind's sourcing almost certainly is John McLaughlin.)

Posted by: pseudonymous in nc on October 24, 2006 07:13 PM

Everyone should make their own Osama video and post it to YouTube. Flood the zone!

Posted by: warren on October 24, 2006 07:38 PM
Posted by: Tassled Loafered Leech on October 24, 2006 08:46 PM

The best political analysis is one part deduction, and seven parts ESP.

Bill Clinton is a psychic whore.

Posted by: Linus on October 25, 2006 01:23 AM

Al, if your guys had a better foreign policy OBL wouldn't be making videos. It's all irrelevant anyway, though, as it looks like the New Jersey Supreme Court will come through to save the day for the Republicans.

Posted by: William Burns on October 25, 2006 10:13 AM

What BrklynLibrul said. Even October/November surprises aside, Dem cakewalks in recent history have had a tendency to fail to caketalk (or whatever). We need to get our voters to the polls. (Speaking of which, BL, there's a MoveOn phonebanking office in Brooklyn, email me if you're interested in making some calls...great way to have an impact outside of a district where the real election happened on 9/12.)

Posted by: tps12 on October 25, 2006 10:46 AM

Whoops, forgot my email address: tps12@columbia.edu

Posted by: tps12 on October 25, 2006 10:47 AM

It's GOTV stupid! The repubs beat us cause we are lazy fcks.

Posted by: Judson on October 25, 2006 12:26 PM

but I don't know if he has time--only two years--to convince a reunified Germany to aggressively rearm.

I stand corrected.

Posted by: SomeCallMeTim on October 25, 2006 03:35 PM

wmwebtr ödüllü seo yarışması

Posted by: wmwebtr ödüllü seo yarışması on December 3, 2007 08:03 AM

thankss

Posted by: oyun indir on January 12, 2008 06:26 AM

Sex Shop|Seksshop|Sex,Erotik Shop|Seks Shop|Sexshop|cinsel ürünler|Penis Büyütücü| Geciktiriciler| penis büyütücüler| Erotic shop|Seks,Penis büyütücü

Posted by: seks shop on April 10, 2008 06:20 AM

Sex Shop|Seksshop|Sex,Erotik Shop|Seks Shop|Sexshop|cinsel ürünler|Penis Büyütücü| Geciktiriciler| penis büyütücüler| Erotic shop|Seks,Penis büyütücü

Posted by: seks shop on April 10, 2008 06:20 AM

thanks, nice work

Posted by: kraloyun on May 29, 2008 06:09 PM

harbiarkadas.com
harbiarkadas.net
harbiarkadas.org
itirafet.org
ebedava.net
elektronikmarket.net
ameribress.com
clitoriacream.net
superspenisbuyutucu.com
megabress.com
rednightperformans.com
performansartirici.com
penisplus.tv
penispluspenisbuyutucu.com
penispluspenisbuyutucu.net
cinselmerkez.com
aseks.net
erotikcamasirlar.com
vajinatr.com
bakirevajina.com
cinselkozmetik.com
kozmetikmedikel.com
eturknet.com
tecavuz.net
yutuvideo.com
ponotubesex.com
laraperuk.com
sackanagimerkezi.com
peruksa.com
perukmarket.com
aseks.com
aloveshop.com
erotikgiyim.com
geciktiricispreyler.com
geciktiricihap.com
geciktiriciler.com
azdirici.com
bayanuyarici.com
fntazialemi.com
fantaziservisi.om
cinselmazemeler.com
cinselfantaziurunleri.com
erotikdakikalar.com
erotikmarketiniz.com
seksmarketiniz.com
sekshatlari.com
erotikdergiler.com
erotikderginiz.com
penisbuyutucuviprx.com
penisbuyutucuvigrx.com
penisbuyutuculer.com
vigrxpenisbuyutucu.com
sismebebekler.com
sismebebekshop.com
yemekeviniz.com
sanalmarketiniz.com
elektronikmarket.net
ebedava.net
kontortr.com
elaydin23.com
turkcellkontorcu.com
aveakontoral.com
vodafonekontoral.com
toptankontorcu.com
cinselkozmetik.com
bayanpartnerler.com
erkekpartnerler.com
kizarkadaslar.com
yonjaarkadas.com
siberalem-siberalem.com
sexpartnerler.com
sekspartnerler.com
erotikpartnerler.com
gencyuz.com
erkekarkadaslar.com
bayanarkadaslar.com
yemekeviniz.com
sanalmarketiniz.com
baskahaber.com
medikalkozmetik.net
kozmetikmedikal.com
zayiflamavediyet.net
zayiflamahapii.com
zayiflamabandii.com
kilovertr.com
zayiflamatr.net
diyettr.com
toksinbandi.net
botoxtr.com
botokstr.com
selulittedavii.com
selulitgiderici.net
selulitkremii.com
catlaktedavisii.com
catlakgiderici.net
catlakkremii.com

penis büyütücü

Posted by: sexshop on November 7, 2008 06:14 AM

Post A Comment

advertise_liberally.gif