Archive for the 'General' Category

Testing the Seer-Sucker Theory: Are pundits any good at their jobs?

pundit watch logo

Earlier this month, Dan Frommer, a writer for Silicon Alley Insider, made a prediction right before Steve Jobs’s WWDC Keynote address.

“We also expect Jobs to unleash some sales milestones; to some extent, his keynotes are like mini earnings calls,” Frommer wrote. “Specifically, we think it’s likely he’ll announce that Apple has sold its 5 billionth song via iTunes.

The prediction was specific, easily understood, and reasonable. But, more importantly, it was wrong. Jobs made many announcements during that address, but iTunes’s 5 billionth sale wasn’t one of them.

One could argue that Jobs’s keynote addresses are primary targets for all sorts of predictions, many of which never turn out to be true. And normally Frommer’s wrong guess would have been drowned out by the thousands of other blog posts, articles, and twitter updates hitting the web that day.

But perhaps unbeknownst to the writer, someone was keeping score. A U.K. man named Nigel Eccles made a note of the prediction and its subsequent failure to come true. It was eventually published on Pundit Watch, a site that launched on June 6.

Several media critics have noted that punditry is one of the few professions in which its participants aren’t punished for being terrible at their jobs. Many of the pundits who predicted we’d find WMDs in Iraq, for instance, are still pulling in multi-million dollar contracts while continuing to impart their widely-discredited wisdom to the masses.

Last year, two researches, Kesten C. Green and J. Scott Armstrong, gave eight “conflict scenarios” to 106 experts — mostly business professors — and asked them to make predictions of outcome. They then turned around and gave the same scenarios to 169 students who were not considered experts. The results, published in the journal Interfaces, showed that predictions made by “experts” were only slightly better than those made by the general population.

This wasn’t the first time that one of the researchers, Armstrong, had dived into such a topic. In 1980 he published a paper titled “The Seer-Sucker Theory: The Value of Experts in Forecasting.”

Pundit Watch takes this theory and applies a modified version of it to the real world. “The Seer Sucker Theory shows that no matter how much evidence there is that proves seers (psychics) don’t exist, there will always be suckers who believe that they do,” Eccles told me in a phone interview yesterday.

To understand how Pundit Watch works, one must first consider the site with which it’s affiliated: Hub Dub.

In previous years, Eccles had worked on “betting exchange” sites, where users made predictions for certain scenarios and then bet for or against them. He began to notice that the way he consumed news closely followed this model; he viewed it as a series of outcomes and found himself making bets on how events would unravel. “I decided it was much more useful instead of reading the polls or taking analysis that I should just look at what the odds were,” he said. “The original idea was let’s do something where users could follow news stories, trade predictions, and in the process they could produce exciting markets.”

hubdub logo

He teamed up with three others and, after raising a seed round of investing, launched Hub Dub in January.

When a new user signs up, he’s given $1,000 in play money. He can then take that currency and bet on user-created questions or write a question of his own. Questions span across a whole range of categories — movie opening weekends, gas prices, election results– and the amount of money at stake hinges on how many people are betting on a particular outcome. Most of the questions are U.S.-centric; though the company is based in the U.K., they realized that the States offered a much larger, “sophisticated” market, and over 80% of their current users live in America.

“So about three months ago we were thinking about all the journalists and pundits out there and saying to ourselves, ‘I wonder how they would perform if they were on the site?’” Eccles said. “So we decided that instead of inviting them on, let’s follow their predictions and record it on Pundit Watch. We did that through stealth for a month, and then at the end of the month we launched it and said, ‘Hey, we’ve been following your predictions, and here is your performance.’”

The three pundit categories — technology, politics, and celebrity gossip — are assigned to “category editors” who are supposed to follow the pundits very closely, making note of any forecasts they make. When Chris Matthew said that Hillary Clinton may wait until the Democratic Convention to drop out of the race, a Pundit Watch editor was there to record it. And when she dropped out well before the convention, that same person returned to the site to deliver the verdict — case closed.

chris matthews pundit

So how do the nine pundits they chose to follow measure up? Not always so well. Michael Arrington, founder of TechCrunch, has only seen 14% of his predictions come true (though technically Pundit Watch is following all the writers on his site). Pat Buchanan, surprisingly enough, is currently running four for four; all his fortellings have come to fruition.

“Some of the pundits we’re tracking are saying, ‘Well it’s really subjective, and I don’t think I made that prediction,’” Eccles said. “What we’ve said is we are trying to be as objective as possible; we’re taking very much a reader’s view. Some of the pundits said, ‘I didn’t make that prediction’ And we replied, ‘Well, you reported a rumor that that was the case.’ We take the view that reporting a rumor is quite similar to making a prediction.”

His category is technology, and he noticed that pundit forecasts tend to ebb and flow; before major conferences there are huge spikes, followed by inactive lulls once the event has passed.

Unlike many media critics, Eccles doesn’t view a wrong prediction as necessarily a bad thing. In his philosophy, punditry is more of a game than a serious stab at making weighed decisions. “To me as a reader, I want to reach someone with a strong opinion who makes interesting points and does so in a clear fashion,” he said. “These are the guys who are prepared to do that…I think the fact of the matter is — and we can’t admit it — is that a lot of news is entertainment, and I would personally much prefer to read an entertaining journalist who’s wrong a lot of the time than one who’s right most of the time and is quite dry and dull.”

I asked Eccles about the implications that such a site could have over the long term, especially if it became more successful. If these pundits became more aware that someone is keeping a quasi-scientific score of their predictions, would they be more careful and reluctant to make them?

“My biggest fear is that I force a lot of entertaining journalists to turn into dull caveat-driven detail-oriented journalists who wouldn’t be that interesting, would make fewer predictions, and would only make predictions if they were 100% sure [of their outcomes],” he replied. “I think that would be a really unwanted result…To some extent I think of it as an experiment. What will happen? Will the journalists themselves change how they perceive things, or will it change how the readers perceive them? I don’t know what the answer is.”

For his part, a low success rate for a pundit’s forecasts won’t make Eccles any less likely to read or listen to the person’s opinions. “The classic example is TechCrunch. They’re at the bottom of our leader board and yet I never for a moment thought I wouldn’t continue reading it. Because I find it very entertaining writing with very opinionated mixed predictions. Though usually they’re wrong.”

Reason why Youtube was invented #5939389

I love amateur content like this (via hcwdb):

Now that’s what I call uber niche!

abandoned couch

In my last post about the flaws of hyperlocal, I thought about including what I consider to be the future of internet news: not hyperlocal but hyperniche. I’ve been thinking actually about writing an article about this topic for awhile but haven’t gotten around to it yet.

But anyway, I by chance just now came across the perfect example of what I call uber-niche: Abandoned Couches.

Think about it; before Web 2.0. was such a news source possible? How could someone take an idea similar to this and turn it into a profit?

And it’s been around since 2006!

Does anyone know how to do a screen grab?

Hey, does anyone out there know how to do a screen grab? For instance, if I want to take out a slice of something that I see in my browser window from a website and save it as an image, how do I do this? What if I don’t want to grab the whole screen but a very specified part of the screen?

I currently use Firefox. Any help would be greatly appreciated. Just leave the answer in my comments section.

Atheists DO Have Holiday Cards!

Damn, it appears that my opening line in my article on the mainstreaming of atheism is false, atheists do have holiday cards, though these don’t come from Hallmark. A reader named Andrew Shaffer wrote in to say:

Re: Hunter’s comments in the “Dawkins Effect” article: “If they had hallmark cards, maybe they wouldn’t feel so left out. We have Christmas cards. We have Kwanza cards now. Maybe they need to get some atheist cards and get that whole ball rolling so more people can get involved with what they’re doing.”

…funny thing is, we DO have holiday cards!

(wink, wink, points at links to own shop below)

Atheist Holiday Card

***
Related posts:
1. Facebook users fight back

How poker bots will affect online gaming

I first heard about the rise of poker bots a year or two ago. They’re basically computer programs that can enter online poker games on your behalf and gamble — and they’re getting increasingly good at it. Poker websites have tried to figure out ways of detecting them, but it’s not an easy task.

The Freakonomics blog tackles this issue and compares it to a computer program that plays a perfect game of checkers:

It is not an opportune time to start an online gambling site for checkers.

In July, researchers at University of Alberta “solved the game” using brute computer force. As such, their computer knew the best strategy to play in any of the possible 50 billion checker positions. Humans should now be very scared to bet money against any virtual opponent, for fear that they are really facing the Alberta computer or its clone. (You also wouldn’t want to play a money game against a computer in Connect Four or Othello, or even backgammon.)

I don’t particularly agree with the comparison because — unlike Checkers — there’s no such thing as a perfect game of Poker. Though the programs have some statistical advantages, they don’t have the insight to make risky judgment calls. While they probably do have an advantage over less skilled players, I think it’s still very possible to beat a poker bot. But still, it does attenuate the honor of the game.

Cingular: Raising the (apparently useless) bar

Everyone who has seen commercials for Cingular Wireless (now AT&T) has no doubt heard its slogan: “Raising the bar.” What it’s referring to, of course, are the bars that indicate the strength of the wireless signal.

What they fail to mention is that sometimes those bars are utterly useless. For the last half hour, my phone has shown that it has three bars, but whenever I try to make phone calls, it beeps at me saying “Emergency Calls Only.” And this isn’t a rare occurrence; it happens a few times a week. The bars certainly don’t mean anything if all they allow you to do is dial 911.

Well, this is the second major wireless company I’ve tried (I had Verizon before) that turned out to be shitty. You would think that with all the hatred people have for the major wireless phone companies out there that there would be a viable competitor to emerge. I guess that the current ones have such a monopoly with the FCC that it’s difficult for an underdog to rise.

But once one does, the blogosphere will be waiting to cheer it on. It’ll get all the free publicity that money can’t buy.