Hey NBC & WSJ–What’s With Your Poll Report?


Today NBC and the Wall Street Journal released a new survey conducted by the polling company Hart/McInturff.  MSNBC.com’s deputy political director, Mark Murray writes,

The GOP’s ‘likely’ advantage

In the survey, 50 percent of likely voters say they prefer a Republican-controlled Congress, versus 43 percent who want Democrats in charge.

Last month, Republicans held a 46 percent to 43 percent advantage among likely voters on this question.

The GOP’s current seven-point lead, McInturff observes, is on pace — historically — to result in a shift of power in Congress. “The Democrats, with two weeks left, are facing very, very difficult arithmetic.”

Yet among the wider universe of registered voters, Democrats hold a two-point edge, 46 to 44 percent, which is up from the 44 percent to 44 percent tie in September.

But Hart calls that lead “hollow,” because not all registered voters will participate, especially in a midterm election.

Indeed, among those expressing a high interest in voting this midterm season, Republicans hold a 13-point advantage on the generic ballot, 53 percent to 40 percent.

So, as I often do, I clicked on the link to the published survey results and read it through looking for the results described in the article.  First time through I thought I missed this 50% to 43% advantage of the Republicans over the Democrats.  So I read it again, now looking line by line where that percentage comparison came up.  I couldn’t find it.  Okay, so one more time, very deliberately reading through the survey.  Nada.  What I did discover were two very interesting questions that belie a different mood in the electorate.

Keep in mind this was a survey of 1000 registered voters, which, as I have explained in previous posts, I hold to the theory that registered voters provide a more reliable sample and predictor of which way the election is more likely to go.  In two weeks we’ll know.

Let’s look first at question Q11a:

Q11a:   What is your preference for the outcome of this year’s congressional elections — (ROTATE:) a Congress controlled by Republicans or a Congress controlled by Democrats?

The result favors the Democrats 46% to Republicans 44%. That is mentioned in the quote above.  It’s also within the published margin of error of +/- 3.10%.  One could say, therefore, it’s a wash, but there is some interesting info in the trends.  The median (the midpoint of the 11 surveys NBC/WSJ has conducted since January 2010 for the Republicans is 44% and the Democrats, 43%.  Now that is tight!  The slope of those eleven surveys for this question is also small, 0.21% for the GOP and 0.25% for the Dems.  Not what you might write home about.  But still, when you apply those numbers to tens of millions of voters, small changes can make the difference.  It also shows that the Democrats are perhaps not in quite as bad a shape as the pundits have been droning on about month after month.

But there is more.  Question Q12A reads:

Q12a: If you had the choice in your congressional district, would you be more likely to vote for a (ROTATE:) Republican, Democratic, Libertarian, or  Green Party candidate for Congress?

Since the smaller party candidates are at best wild cards, we can’t make a prediction if those who claim affiliation will actually vote for them.  But here, we find the responses again favor the Democrats over the Republicans, 44% to 41%.  Unfortunately, the survey does not provide the historical results of this question.

If you have looked at the published survey results, you might have noticed a number of questions are missing.  So, we can perhaps infer that NBC and the WSJ decided the most interesting information in the survey they wanted to keep to themselves, which is their prerogative since they paid for it.  And perhaps that 50% Republican advantage is among those survey items that were, shall we say, redacted.  But if the Democrats are in such dire shape going into the election and the survey shows that is very clear, why bother with cutting questions out of the published report?  Wouldn’t a reasonable person, or especially a partisan one, such as the Wall Street Journal’s clear editorial preference for Conservatives, want that information right there for everyone to see?  I can’t answer that question, but I do find it perplexing.

Nevertheless, there is enough information to ponder the strength of the GOP’s “surge” as reported.  My updated graph with the trend lines still shows the Democrats in a stronger growth curve:

Registered Voters Surveys 18 Oct 10. Data Courtesy HuffPost Pollster

I will be eagerly looking forward to the next batch of polls to be released.  Each data point provides a world of information about might or can happen on November 2nd.

Where the Wild Things Are–Reading the Polls

Tuesday night, September 14, 2010: The final day of the big primaries prior to the general election in November.  I’m sitting watching the results come in.  My network of choice tonight is MSNBC.  It’s just so much fun watching Rachel Maddow narrate the primary like it was a Super Bowl.

Statistics during the election season are thrown around like cheap bead necklaces at a Mardi Gras parade.  Polls are quoted like they mean something and are perfect predictors of the future.

Well, I’m going to tell you something the politicians and pollsters and pundits would prefer you don’t know:

Polls Results Can Be Stretched Like Bungee-Jumping.

Yep, its true.  And I can give some simple examples.

You can follow along.  But open a new window in your browser so you can click back and forth more easily.

First click here on Pollster.com.  This should take you to the 2010 National Congressional Ballot. This is for the House of Representatives only.  What we’re interested in is the polling chart.  It should look like this:

2010 National Congressional Ballot. Image: Pollster.com

So, how do you read this mess of dots and lines?  The dots are called a scatter plot and each one represents a poll taken on a certain day or period of days (usually 2-3).  The date of the poll is on the horizontal line and the percentage Republican (red) and Democratic (blue) is on the vertical line.  The squiggly lines in the middle are called a trend line and represents the mid-point of the dots for that day.  For those of you who may have taken statistics some time in the past, this is also called a linear regression. The colors represent the same political parties as the dots.  Correction: Dr. John Bogen, an Extreme Thinkover contributor, corrected my error in labeling the trend line as a “linear regression.”  Although the trend lines are based on regression formulas, I should have labeled it as Pollster.com calls it, a “trend estimate.”  For more info on Pollster’s statistical methods for trend estimates, click here.

Now take a moment to go to the Pollster.com site and look at their “live” chart.  Each dot will open a fly-by box telling you the pollster, date and the results.  Pretty nifty, huh.  There is also an expand box in the upper right hand corner if you want to open the chart to fill your screen.  The features still all work.

What, then, does this chart, as presented, convey?  Notice the date begins in November 2008, at the time of the Presidential election and covers the time since then.  Here the trend line is easier to read because you can see the ups and downs of the popularity of each of the two major parties over the past two years.

If this chart was the only one you looked at you would conclude that the Republicans have made huge gains beginning about May 2010 and now hold a 47.1% to 40.6% lead over the Democrats.  And you would be wrong.  Something is missing.  First of all what about the undecided voters?  Where are they?  How many of them are there? What is their trend?  For that answer, click here.

A new set of black dots with a trendline appears on the chart representing those voters who answered “undecided” on who they plan to vote for.  You can also see that as this year has progressed, the line has trended just slightly upward, and only since August have more of the people made up their minds.  As of today though, the undecideds are still 10.4% of the total, which is larger than the gap between the Republicans and the Democrats.  This is where it gets interesting.

I submit that in this gap is where the wild things are, to reframe the title of Maurice Sendak’s beloved children’s book.

Reading the gaps is where the information about the most dynamic trends in the electorate are.  Follow me on this.  Go back to the original chart.  You should already have the Red, Blue and Black trends open.  On the footer is a button titled “Tools.” Click on it and it will open another footer just above with six different choices on it.  Click on “Filter.”  This will open a small window with three check boxes: Live phone interviews, automated phone (i.e. robocall) interviews, and internet.  Place your cursor on each one and you will get a list of “filter options.”  Notice on the first option, Live phone interviews, there is an arrow in the top right hand corner.  This option has three pages and we’ll use them.

We want to narrow our polling data to the most relevant and the highest chance for honest answers.  To do that, based on my criteria (you are free to choose your own), I say let’s eliminate the internet surveys, first.  They are very hard to get a true random sample and very easy to lie on.  Next, let’s eliminate the robocalls, too.  Even though the calls go out to a random sample (supposedly) it is very easy to lie to a machine.  That leaves us with the live interviews.  These surveyors, you will notice are familiar big name pollsters, who have a reputation to uphold, and nearly all of them publish their survey questions and results online for free access for anyone interested in reading them (which would include geeks like me).   We want to cull some of these, still.  They are the pollsters for both political parties because there is a greater chance they will ask weighted questions that favor their side.

So, uncheck the internet, robocalls, and on the live call pages every pollster ID’d with either a R or a D.  Now we have a select set of pollsters who are as neutral as possible and use real people to talk to voters to decrease the chance for lying or misrepresentation.

One more thing.  We really only want to look at the results for the current primary season.  So again, click on tools and then on “Date Range.”  On the left date, click on the month and set it to “01”, the day, “01” and the year, “10” and then click on the blue “Set Range” button.

Look at your results on the chart now.  The polling results have changed.  The Republicans sit at 47.7%, the Democrats at 41.1% and the undecided at a whopping 17.1%!  This, I would suggest is a much clearer picture of the state of the electorate regarding the races in the House of Representatives.  By eliminating those polls that introduce bias into the big picture, either by the way they are administered, or by the way they are designed to benefit their candidates, we can see that the November election is far less certain than most pundits and politicians are leading us to believe.

The fact that apparently over 17% of the electorate is still vacillating about who they will vote for in the general election means the predicted gains by the Republicans has to be called into question, the predicted losses by the Democrats has to be called into question, and the outcomes across the country will very possibly be different than is now being predicted.  It may also mean that the gains or losses may be greater than predicted and one party or the other end up with a significant lop-sided outcome.

But one principle in polling must not be forgotten.  Each poll is a snap-shot in time and by itself can be either an accurate or inaccurate reflection of the voters’ will. It is also important to remember the truism that all politics is local and as Dr. Bogen also points out, the undecided percentage is likely to be smaller on the local scene.  He also rightly suggests this local phenomenon, all things being equal, favors the challenger.   This same principle applies to groups of polls as well because they are aggregates of local polls.  Political trending, although becoming more sophisticated all the time still cannot reliably predict the outcome on election day every time.  We have far to go to reach the algorithmic precision of Isaac Asimov’s Foundation “psychohistory.”  In the mean time we  have to search for the data where the wild things are.