用户名: 密码: 验证码:
MIRACLE: A multiple independent random walks community parallel detection algorithm for big graphs
详细信息    查看全文
文摘
Community detection is a common problem in various types of complex networks. With the emerging of large scale real networks like social networks, community detection meets new technology challenges of extremely large computational cost and lack of prior information. Although several literatures recently try to solve these new challenges, they still have limitations of parallelism and running time. With the scale of data increases sharply, the parallelism is necessary but few codes exist. In this work, we analyze the process of random walking in graphs, and observe that the weight of an edge gotten by processing the vertices visited by the walker could be an indicator to measure the closeness of vertex connection. Based on this finding, we first propose a novel parallel computing community detection algorithm for big unweighted undirected graphs in the true sense. The algorithm consists of three steps, including random walking using multiple independent random walks, weight calculating for edges and community detecting. The time complexity of our algorithm is class="mathmlsrc">class="formulatext stixSupport mathImg" data-mathURL="/science?_ob=MathURL&_method=retrieve&_eid=1-s2.0-S1084804516300947&_mathId=si0008.gif&_user=111111111&_pii=S1084804516300947&_rdoc=1&_issn=10848045&md5=37d2775fb15d210b0d310e24218e2295" title="Click to view the MathML source">O(nlogn)class="mathContainer hidden">class="mathCode">O(nlogn) without prior information. In order to implement our parallel computing algorithm efficiently, we also propose a novelty graph partition model. Experimental results show that our algorithm is capable of detecting the community structure and the overlapping parts of communities in real-world effectively (reduce the running time by 400 times at least), and handling the challenges of community detection in big graph era.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700