Community Detection in Node Attributed Networks: A Late-fusion Approach

Loading...
Thumbnail Image

Institution

University of Alberta

Degree Level

Master's

Degree

Master of Science

Department

Department of Computing Science

Specialization

Statistical Machine Learning

Supervisor / Co-Supervisor and Their Department(s)

Citation for Previous Publication

Link to Related Item

Abstract

With the burgeoning of online social media and the deluge of information in today's "big data" era, traditional community mining that relies on the connections of the nodes no longer suffices to find communities where the attributes of these nodes play an important role. Though vast research has been done to incorporate attribute information in search of network communities, few have focused on the late-fusion approach, where two partitions of a network are identified with traditional community detection and clustering algorithms respectively and are later combined to produce the final communities. We propose a new late-fusion method that assimilates two sources of information by creating an integrated graph whose edges represent the agreement of communities coming from the two partitions. We design a new technique to cope with networks with binary or categorical attributes in a way that clusters reflecting node similarities are found by a community detection algorithm on a virtual graph. We introduce a weighting parameter to allow for leveraging the strength between node connections and attributes. We experimentally demonstrate the performance of our method on various synthetic and real networks. We show that our late-fusion method comes as a flexible, accurate and efficient solution to the problem of community detection in attributed networks.

Item Type

http://purl.org/coar/resource_type/c_46ec

Alternative

License

Other License Text / Link

Permission is hereby granted to the University of Alberta Libraries to reproduce single copies of this thesis and to lend or sell such copies for private, scholarly or scientific research purposes only. Where the thesis is converted to, or otherwise made available in digital form, the University of Alberta will advise potential users of the thesis of these terms. The author reserves all other publication and other rights in association with the copyright in the thesis and, except as herein before provided, neither the thesis nor any substantial portion thereof may be printed or otherwise reproduced in any material form whatsoever without the author's prior written permission.

Subject/Keywords

Language

en

Location

Time Period

Source