• search hit 20 of 2684
Back to Result List

LazyFox: fast and parallelized overlapping community detection in large graphs

  • The detection of communities in graph datasets provides insight about a graph's underlying structure and is an important tool for various domains such as social sciences, marketing, traffic forecast, and drug discovery. While most existing algorithms provide fast approaches for community detection, their results usually contain strictly separated communities. However, most datasets would semantically allow for or even require overlapping communities that can only be determined at much higher computational cost. We build on an efficient algorithm, FOX, that detects such overlapping communities. FOX measures the closeness of a node to a community by approximating the count of triangles which that node forms with that community. We propose LAZYFOX, a multi-threaded adaptation of the FOX algorithm, which provides even faster detection without an impact on community quality. This allows for the analyses of significantly larger and more complex datasets. LAZYFOX enables overlapping community detection on complex graph datasets with millionsThe detection of communities in graph datasets provides insight about a graph's underlying structure and is an important tool for various domains such as social sciences, marketing, traffic forecast, and drug discovery. While most existing algorithms provide fast approaches for community detection, their results usually contain strictly separated communities. However, most datasets would semantically allow for or even require overlapping communities that can only be determined at much higher computational cost. We build on an efficient algorithm, FOX, that detects such overlapping communities. FOX measures the closeness of a node to a community by approximating the count of triangles which that node forms with that community. We propose LAZYFOX, a multi-threaded adaptation of the FOX algorithm, which provides even faster detection without an impact on community quality. This allows for the analyses of significantly larger and more complex datasets. LAZYFOX enables overlapping community detection on complex graph datasets with millions of nodes and billions of edges in days instead of weeks. As part of this work, LAZYFOX's implementation was published and is available as a tool under an MIT licence at https://github.com/TimGarrels/LazyFox.show moreshow less

Export metadata

Additional Services

Search Google Scholar Statistics
Metadaten
Author details:Tim Garrels, Athar KhodabakhshORCiD, Bernhard Y. RenardORCiDGND, Katharina BaumORCiDGND
DOI:https://doi.org/10.7717/peerj-cs.1291
ISSN:2376-5992
Pubmed ID:https://pubmed.ncbi.nlm.nih.gov/37346513
Title of parent work (English):PEERJ Computer Science
Publisher:PeerJ Inc.
Place of publishing:London
Publication type:Article
Language:English
Date of first publication:2023/04/20
Publication year:2023
Release date:2024/06/24
Tag:C++ tool; Community analysis; Graph algorithm; Heuristic triangle estimation; Large networks; Open source; Overlapping community detection; Parallelized algorithm; Runtime improvement; Weighted clustering coefficient
Volume:9
Article number:e1291
Number of pages:30
Funding institution:Add-on Fellowship for Interdisciplinary Life Sciences of the Joachim; Herz Stiftung; Deutsche Forschungsgemeinschaft (DFG, German Research; Foundation) [491466077]
Organizational units:Digital Engineering Fakultät / Hasso-Plattner-Institut für Digital Engineering GmbH
DDC classification:0 Informatik, Informationswissenschaft, allgemeine Werke / 00 Informatik, Wissen, Systeme / 004 Datenverarbeitung; Informatik
Peer review:Referiert
Grantor:Publikationsfonds der Universität Potsdam
Publishing method:Open Access / Gold Open-Access
DOAJ gelistet
License (German):License LogoCC-BY - Namensnennung 4.0 International
Accept ✔
This website uses technically necessary session cookies. By continuing to use the website, you agree to this. You can find our privacy policy here.