classifying microblogs for disasters

21
Classifying Microblogs for Disasters Sarvnaz Karimi Jessie Yin Cecile Paris

Upload: sarvnaz-karimi

Post on 07-Jul-2015

174 views

Category:

Data & Analytics


0 download

DESCRIPTION

Paper presentation at ADCS 2013, Brisbane Images used in the presentation are taken from various websites. Credits goes to their creators.

TRANSCRIPT

Page 1: Classifying Microblogs For Disasters

Classifying Microblogs for Disasters

Sarvnaz Karimi Jessie Yin Cecile Paris

Page 2: Classifying Microblogs For Disasters

Social media plays an important role during disasters

CSIRO: positive impact | Classifying Microblogs for Disasters | Sarvnaz Karimi2 |

• Realtime, popular, free• Accessible• Available

Page 3: Classifying Microblogs For Disasters

During disasters people share useful information

• lyttelton tunnel had reopened last night #eqnz

Or ask for help or information

• Kindercare in Fendalton, Christchurch - all okay? We are trying to get through with no luck. #eqnz

• Need help. Any donors of medicines for diarrhea cases in Baganga, Davao Oriental pls? #reliefPH #PabloPH pls tweet @KarloPuerto

Or even offer help

• I hv final yr medstudents in parade rd addington! They cn help. Bruce n boys #eqnz

And sometimes not so useful

• Someone just wondered aloud if the #eqnz was just another sign from God that he doesn't want The Hobbit to get made. #maybe?

CSIRO: positive impact | Classifying Microblogs for Disasters | Sarvnaz Karimi3 |

Page 4: Classifying Microblogs For Disasters

Challenges of Working with Twitter Data

• In fact, lots of times Tweets are useless babbles

• Tweets are really short (140 characters)

• People often speak informal language

• And even in serious messages, tweets can be abbreviated to compensate for the length

CSIRO: positive impact | Classifying Microblogs for Disasters | Sarvnaz Karimi4 |

Finding useful content can become looking for a needle in a haystack!

I hv final yr medstudents in parade rd addington! They cn help. Bruce n boys #eqnz

Page 5: Classifying Microblogs For Disasters

How to filter massive amount of Twitter messages in order to identify high value tweets related to natural or man-made disasters, or even specific types of disaster?

CSIRO: positive impact | Presentation title | Presenter name5 |

Page 6: Classifying Microblogs For Disasters

Keyword search to find disaster-related tweets

• Lots of false-positives due to multiple senses or ambiguities of keywords such as “fire”, or even “earthquake”

CSIRO: positive impact | Classifying Microblogs for Disasters | Sarvnaz Karimi6 |

She’s a natural disaster: a tsunami in her eyes an earthquake in her chest a hurricaneflooding her mind she’s a travelingcatastrophe

In a pool of over 5700 tweets retrieved using keyword search, we had over 50% false positives.

Page 7: Classifying Microblogs For Disasters

Our work: Classify Twitter Stream for Disasters

•Classify tweets as Disaster and Non-disasterBinary Classification

•Classify tweets into disaster types:

– Earthquake

– Storm (hurricane, tornado, cyclone)

– Fire

– Flooding

– Other (e.g Civil disorder, Traffic accident)Multi-class classification problem

CSIRO: positive impact | Classifying Microblogs for Disasters | Sarvnaz Karimi7 |

Page 8: Classifying Microblogs For Disasters

Related Studies

• Tweet classification: o Papers that used classifiers for categories such as news and junk, or opinion,

and private messages.

o Papers that heavily used hashtags.

o Adding context to short tweets by aggregating those that share the same hashtags, or by adding URL contents.

• Twitter during disasters:o Qualitative analysis on tweets published during a specific event to study

microblogger behaviour.

o On of the most cited works is by Sakaki et al. (2010), which made a classifier for earthquake to alert people. Their classifier was based on tweet length, position of query term (earthquake or shaking) in the tweet, n-grams, context of the query terms.

CSIRO: positive impact | Presentation title | Presenter name8 |

We do not focus on specific incidents, and do not assume the hashtags are known.We study different types of disasters, not just one.

Page 9: Classifying Microblogs For Disasters

Twitter Data

• Sampled a total of 6,500 tweets published in a range of two years, from December 2010 till November 2012

• Data was gathered using keyword search (fire, flooding, storm, tornado, hurricane, cyclone, and earthquake, accident).

• No retweets

• A number of disasters were included, among others: earthquake in Christchurch, New Zealand, 2011, Cyclone Yasi QLD, 2011, QLD floods, 2010-2011, bushfires in VIC, 2011, and the Hurricane Sandy, US 2012.

CSIRO: positive impact | Classifying Microblogs for Disasters | Sarvnaz Karimi9 |

Page 10: Classifying Microblogs For Disasters

Annotations

• Two stage annotations

• Crowd-sourced the annotations using Crowdflower.

• Annotators where asked:1. Is this tweet talking about a disaster? (Yes or No);

2. What type of disaster is it talking about? (multiple choice)

• Each tweet was annotated by three annotators

• 5,747 had full agreement

• 2850 tweets were identified as disaster-related and 2,897 as non-disaster

CSIRO: positive impact | Classifying Microblogs for Disasters | Sarvnaz Karimi10 |

Page 11: Classifying Microblogs For Disasters

Classifiers

• SVM Classifier

• Multinomial Naive Bayes Classifier

• We only reported SVM. Naive Bayes consistently performed worse in all the experiments.

CSIRO: positive impact | Classifying Microblogs for Disasters | Sarvnaz Karimi11 |

C. Chang and C. Lin. LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology

Page 12: Classifying Microblogs For Disasters

Classification Features

Specific Features:• N-grams

• Hashtags

• Mentions

Generic Features:• Mention count

• Hashtag count

• Links

• Tweet length

CSIRO: positive impact | Presentation title | Presenter name12 |

What is the effect of using incident-specific compared to generic features inclassification accuracy? What are the best features to use for disaster classifiers?

Page 13: Classifying Microblogs For Disasters

Evaluation: Cross-validation vs. Time-Split

• K-fold cross-validation (e.g., 10 fold) is used in most similar studies (Sriram et al., 2010, Takemura and Tajima, 2012, Vosecky et al., 2012)

Problem:

• It overlooks the time-dependency among microblog data, and uses future-evidence, including hashtags, disaster names

Alternative:

• Time-split evaluation: Sort the data based on time, take the latest chunk as testing and others for training.

CSIRO: positive impact | Classifying Microblogs for Disasters | Sarvnaz Karimi13 |

Page 14: Classifying Microblogs For Disasters

Disaster or Non-Disaster

CSIRO: positive impact | Classifying Microblogs for Disasters | Sarvnaz Karimi14 |

Page 15: Classifying Microblogs For Disasters

Disaster-Type Classification

CSIRO: positive impact | Classifying Microblogs for Disasters | Sarvnaz Karimi15 |

Page 16: Classifying Microblogs For Disasters

What features worked

• When training data is small, counts were better features. – Disaster-related tweets had 1.2 hashtags on average, versus 0.4 for non-

disaster tweets

• When our knowledge of an event is limited, hashtags or mentions are not so useful.

• In our experiments, classification accuracy using bigram features was worse than unigram.

CSIRO: positive impact | Presentation title | Presenter name16 |

Page 17: Classifying Microblogs For Disasters

Generic Features vs. Event-specific Features

• We need to learn the patterns that imply a type of natural or man-made disaster:

Same location, no disaster:

CSIRO: positive impact | Presentation title | Presenter name17 |

A massive cloud of smoke can be seen in south-west LakeMacquarie from the Wyee bushfire #nswfires #wyeefire@NewcastleHerald

Lake Macquarie is big & beautiful http: // lockerz.com/ s/ 257143427

Page 18: Classifying Microblogs For Disasters

Can we cross-train for disaster types?

CSIRO: positive impact | Classifying Microblogs for Disasters | Sarvnaz Karimi18 |

Application:

- Compromise for disaster types with little training data.

- Reduce ambiguity

Training Testing

Page 19: Classifying Microblogs For Disasters

Cross-Disaster Classification

CSIRO: positive impact | Classifying Microblogs for Disasters | Sarvnaz Karimi19 |

Generic featureSpecific Feature

How much our classifiers can be generalised to identify previously unseen disaster types?

• We used under-sampling to create training and testing data

Page 20: Classifying Microblogs For Disasters

Can we cross-train for disaster types?

CSIRO: positive impact | Classifying Microblogs for Disasters | Sarvnaz Karimi20 |

• Yes! Our results showed promise, especially for fire.

• “Language of disaster”

• Using generic features was more effective.

Page 21: Classifying Microblogs For Disasters

What’s Next

Events are often associated with a location1. Better Classifiers: We can use existence of location information

as a feature to strengthen our classifiers

2. Help taking actions on the information: Once we know a tweet is talking about a disaster, we can then extract information on locations. This could help emergency responders in resource allocation.

• We have already established that traditional Named Entity Recognisers are able to identify locations in tweets with high accuracy*. Now we need to pinpoint them on the map!

CSIRO: positive impact | Classifying Microblogs for Disasters | Sarvnaz Karimi21 |

* J. Lingad, S. Karimi, J. Yin, Location Extraction From Disaster-Related Microblogs, Proceedings of the 22nd international conference on World Wide Web companion, 2013