« Radiohead Freeloaders Abound, But Does the Business Model Work? | Main | The Growing Importance of Online Sales »

For those of you wondering about comScore's Radiohead study…


By Andrew Lipsman

Radiohead released a statement today calling into question the validity of our recent report. Since we released our report on Monday, November 5, there has been a great deal of media coverage, and while much of the reporting has been excellent, there have been instances in both news media and blog postings that indicate not everyone understands how comScore arrives at its market projections.

comScore reports are derived from a representative sample of 2 million Internet users, who opt in to our panel and allow us to observe their actual online behavior, including e-commerce transactions. Because the data are based on passively observed consumer behavior, as opposed to polls or survey responses, there is no potential for recall error. When we observe an e-commerce transaction in our panel, the value we observe represents the actual price paid by that consumer.

As an affirmation of the validity and representivity of our panel, we regularly release quarterly U.S. e-commerce spending estimates, several weeks in advance of the U.S. Department of Commerce releasing its own figures, and during the past 7 years our figures have rarely deviated from the official Commerce numbers by more than a few percent.

For the Radiohead study, we observed the activity of nearly one thousand people who visited the “In Rainbows” site, a significant percentage of whom downloaded the album. We ultimately observed several hundred paid transactions, all of which ranged between $0-$20, representing a very robust sample for estimating the average price paid per transaction. It’s true that any sample has natural variability, so these numbers are, in fact, estimates. However, when you have a relatively large sample falling within a narrow range of values (i.e. there’s a small standard deviation), the margin of error in the estimate is minimized.

If you’re not yet convinced that sampling can produce an accurate estimate, think of it this way....

Let’s say you work in an office with 500 people and you want to find out how much the average person in the office spent on lunch today. You decide to randomly select 50 people from the office to get an estimate. Now you could ask them to tell you how much they spent on lunch, and you might get a reasonable estimate if they can remember. Some people might have forgotten what they spent, others may round to the nearest dollar, etc. But if you averaged the response of 50 people, you’d probably be in the ballpark.

Now what if, instead of asking them how much they spent on lunch, you asked for their lunch receipt. You would have data based on their actual observed behavior, and you would be able to calculate a much more accurate average. With the lunch receipts of 50 people in hand, how close of an estimate do you think it would produce? Because most people probably spend between $5-$10 on lunch (with a lunch of more than $15 being a rarity), you would get a very accurate estimate. Probably within pennies of the actual average.

This is effectively what we did with the comScore study on Radiohead’s album sales. We observed the actual online spending behavior from a robust sample of hundreds of individuals in order to produce an accurate estimate. If we didn’t have a reasonable sample from which to extrapolate, we wouldn’t have released the data. But we did, and we’re confident in what the data showed.

I hope this helps clarify any confusion on the study.

TrackBack

TrackBack URL for this entry:
http://www.comscore.com/mt/mt-tb.cgi/77

Listed below are links to weblogs that reference For those of you wondering about comScore's Radiohead study…:

» I Radiohead ed il bicchiere mezzo vuoto from Il Blog di Stefano Epifani
[Read More]

» RadioHead Speaks back - "take that, Comscore ... your all wet. from WebMetricsGuru
I just wrote RadioHead Download Metrics - 60% don't pay anything - Comscore a few days ago, when the Comscore study came out finding 3 out of 5 people downloading "In Rainbows", Radiohead's new album, did not pay anything - were... [Read More]

Comments (6)

ra:

You have a sample set of a "couple of hundred" buyers out of 2mm users (who all are potential consumers) in the narrowest sense you would have 1000 potential consumers from which you got these "couple of hundred". You sample set is a fraction of a percent of the 2MM you imply in your write-up.

Data does not lie, but too often it gets molded to meet our message.

luke:

Looks like time will tell.
How do you control for selection bias?

Just a couple of quick questions to help me get a more meaningful understanding on this.

- The 2 MM internet users that serve as your sampling pool. How do you maintain integrity to insure you have a wide variety of internet users (in this case radiohead fans vs. non-radiohead fans). In other words, what is the incentive you offer for people who "opt-in" to have their online behavior tracked by ComScore? How do you guys think about that effecting the overall personality of your pool?

- Do you guys know the relative basis your 1000 "opt-in" sampling is to the overall number downloads? That seems important. In other words, how have you guys internally rationalized your sampling pools online behavior vs. the average radiohead fan visiting the site to download the new album from their favorite band. Is that relationship and the behaviors each may exhibit important in ComScope's eyes?

Finally, to the analogy - eating lunch is not something that you "are" or "are not" a fan of. If there was a group of people who were "devoted fans" of lunch, and ComScore had no access to their data. And another group who may or may not have been a fan of lunch, but ate lunch anyway and gave you the receipt afterward you would have a situation closer to the Radiohead situation. The analogy is mathematically tenable, but it does not seem relevant in my opinion. But your comments are welcomed on this.

To stay that over 60% of "fans" did not pay (which unfortunately was the headline) is different than saying 60% of the 1000 people who are part of a group that allow us to monitor their online behavior did not pay.

It sounds like that was what you are clarifying here to some degree, but the story has already broken, so the message may be lost coming this late in the game.

Thanks and I look forward to your response.

Andrew:

"We ultimately observed several hundred paid transactions" how exactly did ComScore accomplish this?

Rishi

I think that Mitchell's questions are good ones. I also wonder whether the knowledge that one's online behavior is being captured and analyzed changes said behavior.
Having said that (and as I just posted on my own blog), I don't share all the doubts expressed about the report. In particular, the *size* of the sample doesn't bother me.

Let me subscribe everything Mitchell Davis said. I am also waiting for your further clarifications.

Post a comment

(comments policy)

Please complete the test below to help us reduce spam:

About

This page contains a single entry from the blog posted on November 8, 2007 2:24 PM.

The previous post in this blog was Radiohead Freeloaders Abound, But Does the Business Model Work? .

The next post in this blog is The Growing Importance of Online Sales.

Many more can be found on the main index page or by looking through the archives.