用户名: 密码: 验证码:
Efficient online extraction of keywords for localized events in twitter
详细信息    查看全文
  • 作者:Hamed Abdelhaq ; Michael Gertz ; Ayser Armiti
  • 关键词:Local keywords ; Localized event ; Event detection ; Social media
  • 刊名:GeoInformatica
  • 出版年:2017
  • 出版时间:April 2017
  • 年:2017
  • 卷:21
  • 期:2
  • 页码:365-388
  • 全文大小:
  • 刊物类别:Earth and Environmental Science
  • 刊物主题:Geographical Information Systems/Cartography; Data Structures, Cryptology and Information Theory; Computer Science, general; Information Storage and Retrieval; Multimedia Information Systems;
  • 出版者:Springer US
  • ISSN:1573-7624
  • 卷排序:21
文摘
Messages published via social media sites, such as Twitter, Facebook, and Foursquare hide a considerable amount of information about real world events. The timely identification of such events from this huge, unstructured, and noisy user-generated content plays an important role in increasing situation awareness and in supporting useful applications such as recommendation systems. Interestingly, a large number of these messages are enriched with location information, due to the recent advancements of today’s location acquisition techniques. This, in turn, enables location-aware event mining, i.e., the detection and tracking of localized events such as sport events, demonstrations, or traffic jams, to name but a few. The main building blocks of a localized event are local keywords that exhibit a surge in usage at the event location. In this paper, we propose an approach that aims at extracting local keywords from a stream of Twitter messages by (1) identifying local keywords, and (2) estimating the central location of each keyword. This extraction procedure is performed in an online fashion using a sliding window over the Twitter stream. Additionally, we address the problem of spatial outliers that adversely affect a sound identification of local keywords. Spatial outliers occur when people far away from the location of an event use related keywords in their Tweets. We handle this problem by adjusting the spatial distribution of keywords based on their co-occurrence with place names that may refer to the location of an event. To ensure scalability, we utilize a hierarchical spatial index to gradually prune the geographic space and thus to efficiently perform complex spatial computations. Extensive comparative experiments are conducted using Twitter data. The analysis of the experimental results demonstrates the superiority of our approach over existing methods in terms of efficiency and precision of the obtained results.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700