用户名: 密码: 验证码:
A data-intensive approach for discovering user similarities in social behavioral interactions based on the bayesian network
详细信息    查看全文
文摘
Discovering user similarities from social media can establish the basis for user targeting, product recommendation, user relationship evolution and understanding. User similarities not only depend on the topological structure but also the dependence degrees between users. In this paper, we adopt Bayesian network (BN), an important and popular probabilistic graphical model, as the underling framework and propose a data-intensive approach for discovering user similarities. First, upon the massive social behavioral interactions, we give the method for measuring direct similarities between users and the MapReduce-based algorithm for constructing a BN to describe these similarities, called user Bayesian network and abbreviated as UBN. We also give the idea for storing large-scale UBNs in a distributed file system. Then, to measure indirect similarities between users, we give the method for measuring the closeness of user connections in terms of the properties of UBN's graphical structure. Further, we give the MapReduce-based algorithm for measuring the dependence degrees by means of UBN's probabilistic inferences. By combining the above two perspectives of measures, the indirect similarity degree between users can be achieved, while guaranteeing the applicability theoretically. Finally, we give experimental results and show the efficiency and effectiveness of our method.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700