24.8.09

Mining the Web for Feelings, Not Facts

Published: August 23, 2009

Computers may be good at crunching numbers, but can they crunch feelings?



Peter DaSilva for The New York Times

Margaret Francis and Mars Hall of the San Francisco company Scout Labs.

Enlarge This Image

Minh Uong/The New York Times

The rise of blogs and social networks has fueled a bull market in personal opinion: reviews, ratings, recommendations and other forms of online expression. For computer scientists, this fast-growing mountain of data is opening a tantalizing window onto the collective consciousness of Internet users.


An emerging field known as sentiment analysis is taking shape around one of the computer world’s unexplored frontiers: translating the vagaries of human emotion into hard data.

This is more than just an interesting programming exercise. For many businesses, online opinion has turned into a kind of virtual currency that can make or break a product in the marketplace.


Yet many companies struggle to make sense of the caterwaul of complaints and compliments that now swirl around their products online. As sentiment analysis tools begin to take shape, they could not only help businesses improve their bottom lines, but also eventually transform the experience of searching for information online.


Several new sentiment analysis companies are trying to tap into the growing business interest in what is being said online.


“Social media used to be this cute project for 25-year-old consultants,” said Margaret Francis, vice president for product at Scout Labs in San Francisco. Now, she said, top executives “are recognizing it as an incredibly rich vein of market intelligence.”


Scout Labs, which is backed by the venture capital firm started by the CNet founder Halsey Minor, recently introduced a subscription service that allows customers to monitor blogs, news articles, online forums and social networking sites for trends in opinions about products, services or topics in the news.


In early May, the ticket marketplace StubHub used Scout Labs’ monitoring tool to identify a sudden surge of negative blog sentiment after rain delayed a Yankees-Red Sox game.


Stadium officials mistakenly told hundreds of fans that the game had been canceled, and StubHub denied fans’ requests for refunds, on the grounds that the game had actually been played. But after spotting trouble brewing online, the company offered discounts and credits to the affected fans. It is now re-evaluating its bad weather policy.


“This is a canary in a coal mine for us,” said John Whelan, StubHub’s director of customer service.


Jodange, based in Yonkers, offers a service geared toward online publishers that lets them incorporate opinion data drawn from over 450,000 sources, including mainstream news sources, blogs and Twitter.


Based on research by Claire Cardie, a former Cornell computer science professor, and Jan Wiebe of the University of Pittsburgh, the service uses a sophisticated algorithm that not only evaluates sentiments about particular topics, but also identifies the most influential opinion holders.


Jodange, whose early investors include the National Science Foundation, is currently working on a new algorithm that could use opinion data to predict future developments, like forecasting the impact of newspaper editorials on a company’s stock price.


In a similar vein, The Financial Times recently introduced Newssift, an experimental program that tracks sentiments about business topics in the news, coupled with a specialized search engine that allows users to organize their queries by topic, organization, place, person and theme.


Using Newssift, a search for Wal-Mart reveals that recent sentiment about the company is running positive by a ratio of slightly better than two to one. When that search is refined with the suggested term “Labor Force and Unions,” however, the ratio of positive to negative sentiments drops closer to one to one.


Such tools could help companies pinpoint the effect of specific issues on customer perceptions, helping them respond with appropriate marketing and public relations strategies.

For casual Web surfers, simpler incarnations of sentiment analysis are sprouting up in the form of lightweight tools like Tweetfeel, Twendz and Twitrratr. These sites allow users to take the pulse of Twitter users about particular topics.


A quick search on Tweetfeel, for example, reveals that 77 percent of recent tweeters liked the movie “Julie & Julia.” But the same search on Twitrratr reveals a few misfires. The site assigned a negative score to a tweet reading “julie and julia was truly delightful!!” That same message ended with “we all felt very hungry afterwards” — and the system took the word “hungry” to indicate a negative sentiment.


While the more advanced algorithms used by Scout Labs, Jodange and Newssift employ advanced analytics to avoid such pitfalls, none of these services works perfectly. “Our algorithm is about 70 to 80 percent accurate,” said Ms. Francis, who added that its users can reclassify inaccurate results so the system learns from its mistakes.


Published: August 23, 2009

(Page 2 of 2)


Translating the slippery stuff of human language into binary values will always be an imperfect science, however. “Sentiments are very different from conventional facts,” said Seth Grimes, the founder of the suburban Maryland consulting firm Alta Plana, who points to the many cultural factors and linguistic nuances that make it difficult to turn a string of written text into a simple pro or con sentiment. “ ‘Sinful’ is a good thing when applied to chocolate cake,” he said.

Skip to next paragraph
Enlarge This Image

Scout Labs tracks positive and negative sentiments about keywords like “cash for clunkers.”


The simplest algorithms work by scanning keywords to categorize a statement as positive or negative, based on a simple binary analysis (“love” is good, “hate” is bad). But that approach fails to capture the subtleties that bring human language to life: irony, sarcasm, slang and other idiomatic expressions. Reliable sentiment analysis requires parsing many linguistic shades of gray.


“We are dealing with sentiment that can be expressed in subtle ways,” said Bo Pang, a researcher at Yahoo who co-wrote “Opinion Mining and Sentiment Analysis,” one of the first academic books on sentiment analysis.


To get at the true intent of a statement, Ms. Pang developed software that looks at several different filters, including polarity (is the statement positive or negative?), intensity (what is the degree of emotion being expressed?) and subjectivity (how partial or impartial is the source?).

For example, a preponderance of adjectives often signals a high degree of subjectivity, while noun- and verb-heavy statements tend toward a more neutral point of view.


As sentiment analysis algorithms grow more sophisticated, they should begin to yield more accurate results that may eventually point the way to more sophisticated filtering mechanisms. They could become a part of everyday Web use.


“I see sentiment analysis becoming a standard feature of search engines,” said Mr. Grimes, who suggests that such algorithms could begin to influence both general-purpose Web searching and more specialized searches in areas like e-commerce, travel reservations and movie reviews.

Ms. Pang envisions a search engine that fine-tunes results for users based on sentiment. For example, it might influence the ordering of search results for certain kinds of queries like “best hotel in San Antonio.”


As search engines begin to incorporate more and more opinion data into their results, the distinction between fact and opinion may start blurring to the point where, as David Byrne once put it, “facts all come with points of view.”

Share/Save/Bookmark

Sun and snow

Whether you're heading to the beach or the slopes, there are precautions you should take to avoid the effects of the weather


At the beach

Even if it's cloudy, protect yourself from harmful UV rays by using a suncream with a high protection factor. Reapply it every couple of hours and don't forget to protect your ears, lips, hair parting and the tip of your nose with sunblock.

Stay in the shade between 11am and 3pm when the rays are at their most damaging.

Sunburn

Sunburn leads to premature ageing and an increased risk of skin cancer. To avoid it, make sure you:

  • Wear a wide-brimmed hat and loose clothing
  • Choose sunglasses with UV filters to protect your eyes
  • Protect children and babies from direct sunlight - apply a suncream with SPF25 or higher, and make sure they wear hats and long sleeves
  • Keep babies under 12 months out of the sun
FULL ARTICLE>>>


Share/Save/Bookmark

Why is the African continent poor?

Children in South Sudan

By Mark Doyle
BBC world affairs correspondent

The desolate, dusty town of Pibor on South Sudan's border with Ethiopia has no running water, no electricity and little but mud huts for the population to live in.

You would be hard put to find a poorer place anywhere on earth.

I went there as part of a journey across Africa to ask the question "Why is Africa poor?" for a BBC radio documentary series.

We have oil and many other minerals - go name it
Barnaba Benjami, South Sudan regional co-operation minister

I was asked to investigate why it is that every single African country - with the exceptions of oil-rich Gabon and Algeria - is classified by the United Nations as having a "low" broadly defined Human Development Index - in other words an appalling standard of living for most of the people.

In Pibor, the answer to why the place is poor seems fairly obvious.

The people - most of whom are from the Murle ethnic group - are crippled by tribal conflicts related to disputes over cattle, the traditional store of wealth in South Sudan.

The Murle have recently had fights with the Lol Nuer group to the north of Pibor and with ethnic Bor Dinkas to the west.

In a spate of fighting with the Lol Nuer earlier this year several hundred people, many of them women and children, were killed in deliberate attacks on villages.

There has been a rash of similar clashes across South Sudan in the past year (although most were on a smaller scale than the fights between the Lol Nuer and the Murle).

And so the answer to why South Sudan is poor is surely a no-brainer: War makes you destitute.

Why is there so much war?

And yet South Sudan is potentially rich.

Fisherman on Lake Victoria
Our leaders, they just want to keep on being rich. And they don't want to pay taxes
Fisherman on Lake Victoria

"It's bigger than Kenya, Uganda, Rwanda and Burundi combined," the South Sudan Regional Co-operation Minister Barnaba Benjamin, enthused.

"Tremendous land! Very fertile, enormous rainfall, tremendous agricultural resources. Minerals! We have oil and many other minerals - go name it!"

The paradox of rich resources and poor people hints at another layer of explanation about why Africa is poor.

It is not just that there is war. The question should, perhaps be: "Why is there so much war?"

And the headline question is in fact misleading; Africans as a people may be poor, but Africa as a place is fantastically rich - in minerals, land, labour and sunshine.

That is why outsiders have been coming here for hundreds of years - to invade, occupy, convert, plunder and trade.

But the resources of South Sudan, for example, have never been properly developed.

During colonial rule South Sudan was used as little more than a reservoir of labour and raw materials.

Then independence was followed by 50 years of on-off war between the south and north - with northerners in Khartoum continuing the British tactic of divide and rule among the southern groups.

Some southerners believe this is still happening today.

Corruption

On my journey across the poorest, sub-Saharan swathe of the continent - that took in Liberia and Nigeria in the west, Sudan in the centre, and Kenya in the east - people explored the impact that both non-Africans and Africans had had on why Africa is poor.

Ellen Johnson-Sirleaf
Ellen Johnson-Sirleaf says she underestimated the problem of graft

Almost every African I met, who was not actually in government, blamed corrupt African leaders for their plight.

"The gap between the rich and the poor in Africa is still growing," said a fisherman on the shores of Lake Victoria.

"Our leaders, they just want to keep on being rich. And they don't want to pay taxes."

Even President Ellen Johnson-Sirleaf of Liberia came close to this when she told me she had underestimated the level of corruption in her country when she took office.

"Maybe I should have sacked the whole government when I came to power," she said.

"Africa is not poor," President Johnson-Sirleaf added, "it is poorly managed."

This theme was echoed by an architect in Kenya and a senior government official in Nigeria.

Both pointed out that the informal sector of most African economies is huge and almost completely unharnessed.

Eastleigh in Nairobi
Eastleigh has the most expensive real estate in Nairobi

Marketplaces, and a million little lean-to repair shops and small-scale factories are what most urban Africans rely upon for a living.

But such is their distrust of government officials that most businesspeople in the informal sector avoid all contact with the authorities.

Kenyan architect and town planner Mumo Museva took me to the bustling Eastleigh area of Nairobi, where traders have created a booming economy despite the place being almost completely abandoned by the government.

Eastleigh is a filthy part of the city where rubbish lies uncollected, the potholes in the roads are the size of swimming pools, and the drains have collapsed.

Kenyan architect and town planner Mumo Musev
Africa is not poor. Africa is just poorly managed
Architect Mumo Museva

But one indication of the success of the traders, Mr Museva said, was the high per-square-foot rents there.

"You'll be surprised to note that Eastleigh is the most expensive real estate in Nairobi."

He added that if Eastleigh traders trusted the government they might pay some taxes in return for decent services, so creating a "virtuous circle".

"It would lift people out of poverty," he said.

"Remember, poverty is related to quality of life, and the quality of life here is appalling, despite the huge amount of wealth flowing through these areas."

Then the young Kenyan architect echoed the Liberian president, some 5,000km (3,000 miles) away on the other side of the continent.

"Africa is not poor," he also said.

"Africa is just poorly managed."


You can hear the first of Mark Doyle's programmes Why is Africa poor? on the BBC World Service on Monday 24 August 2009 at 0906 GMT, 1406 GMT and 1906 GMT. It will also be available for a week on the website.




Share/Save/Bookmark