Survey Software, Web Survey, Online Surveys, and Enterprise Feedback Management solutions from Vovici

Your email:
   

Welcome to the Listening Post!

Your single source for everything Voice of the Customer (VoC) and Customer Experience (CxP). And, don’t forget you can follow us on twitter @vovici, or come check us out on Facebook and join the Vovici Network on LinkedIn.

 

Current Articles | RSS Feed RSS Feed

Query Bias in Social Media Research

 

Civilization VOne of the difficulties in conducting social media market research is the sheer quantity of information available. The traditional way to weed through it is to use search queries. For instance, if you were interested in what people thought of the video game Sid Meier's Civilization V, just do a search on “Civilization V” right?

But what about all those people who typed “Civilization 5”? Or “Civ V” or “Civ 5”? Or used the hashtags “#CivV” or “#Civ5” on Twitter? (Hashtags are a convention for marking a tweet as being about a specific subject.) Heck, some people might be writing about “the new Civ” or “the new Civilization” or “the new Firaxis game” or “the new Sid Meier game”. Where do you draw the line? Does it really matter?

Sadly, it matters a tremendous amount.

To demonstrate this, I downloaded 100 comments from Twitter for six variations of “Civilization V” and did a manual sentiment analysis, scoring positive sentiment a +1, negative sentiment a -1 and mixed or neutral sentiment a 0. Sentiment varied widely by search query, with only “#CivV” having negative average sentiment. If someone used that as their only query, they’d be missing much of the picture.

Query Bias for Sentiment Analysis

Why the wide variance? Well, the types of tweets differ by language formality. The following chart shows the percent of tweets by query term that were “Newsy” (i.e., they contained a link) and the percent that were “Conversational” (i.e., were replies to another Twitter user).

Query Bias for Language Register

The phrase “Civilization V” was newsy and unconversational, reflecting its level of formality; sadly, this is the term you first think to query on, since it is closest to the full product name. Such a search returns tweets that echo the headlines of game reviews and link to those reviews. The phrase “Civilization 5” was less newsy and less unconversational, since it’s a slight misrepresentation of the formal name. The “Civ” abbreviations were much more conversational and much less likely to contain links.

As Andrew Jeavons points out, there were 25 billion tweets on Twitter last year – there’s no way to read them all and see which apply to the subject you are researching. You have to use queries to narrow your search. Just be aware that your choice of query terms introduces its own bias.

Comments

I think this really reinforces how important it is to get to know the subject matter you're researching. A researcher without a whole lot of experience in the video game industry or even with the Civ series would not necessarily know all the usual slang terms for the games. But someone who has spent the time in the franchise and in the industry, or who has done their homework would know at least some of the slang terms and include them on a search. The legwork matters.
Posted @ Thursday, January 20, 2011 9:10 AM by Christine Luft
Great point Jeffrey. There are lots of good examples of terminology issues for brands. First, everyone knows Walmart. But, we also talk about wallyworld, and walfart and walbarf and waltard (aren't we childish!). What about McDonald's where we talk about MickeyD and McD. And, what about Starbucks which rhymes quite nicely with Star****s. Anyone doing social media research should have a number of different automated and manual processes in place to accurately identify all of the important slang terms, misspellings, and alternate spellings of brand names in order to conduct quality work. Quality is job #1. :)
Posted @ Thursday, January 20, 2011 10:15 AM by Annie Pettit
To Kris: 
That's a good point. Some of the worst and biggest brand names include Target and the Gap. And even in this age of social media, we've already seen cases of poor name change choices - Electrasol changed to Finish; Spray 'n Wash changed to Resolve. These are all horrible names for social media research. But, with very sophisticated techniques, a quality social media research company will still be able to collect sufficient quantities of quality data, just no where near the amounts that could be collected with distinct names like Adidas and Microsoft.
Posted @ Thursday, January 20, 2011 11:20 AM by Annie Pettit
And what about Dick's (sporting goods) and IBM! 
 
I still shake my head at the first one -- an obvious URL for them is definitely where a shopper does NOT want to venture.
Posted @ Thursday, January 20, 2011 3:40 PM by Kris Hodges
This issue is just one example why anyone planning on using this powerful new methodology (or set of methods, to be precise) needs to know the potential challenges up front. Pardon the plug, but I think I captured some of the best thinking on the pros and cons of social media research from smart folks including Annie and Jeffrey in this white paper (published a few months ago): http://www.researchrockstar.com/social-media-research-white-paper/
Posted @ Monday, February 14, 2011 3:49 PM by kathryn korostoff
Post Comment
Name
 *
Email
 *
Website (optional)
Comment
 *

Allowed tags: <a> link, <b> bold, <i> italics

Latest Posts

Loading
What's New
Don't Be in the 4%
VoC on Twitter
Verint Blog
Verint Blog: Read the Latest from the Verint Systems Blog