• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

Computing communities in large networks using random walks. Computer and Information Sciences-ISCIS (2005)

by P Pons, M Latapy
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 226
Next 10 →

Comparing community structure identification

by Leon Danon, Albert Díaz-guilera, Jordi Duch - Journal of Statistical Mechanics: Theory and Experiment , 2005
"... ..."
Abstract - Cited by 295 (8 self) - Add to MetaCart
Abstract not found
(Show Context)

Citation Context

...n the link with the highest link clustering). This time Zhou presents an algorithm to detect communities similar to hierarchical clustering algorithms described In a similar approach Latapy and Pons (=-=Latapy and Pons, 2004-=-) also employ the intuitive idea that a random walker will get trapped for a longer time in a a densely connected community. They calculate a distance measure between two nodes, and apply an agglomera...

Graph Clustering Based on Structural/Attribute Similarities

by Yang Zhou, Hong Cheng, Jeffrey Xu Yu
"... The goal of graph clustering is to partition vertices in a large graph into different clusters based on various criteria such as vertex connectivity or neighborhood similarity. Graph clustering techniques are very useful for detecting densely connected groups in a large graph. Many existing graph cl ..."
Abstract - Cited by 99 (7 self) - Add to MetaCart
The goal of graph clustering is to partition vertices in a large graph into different clusters based on various criteria such as vertex connectivity or neighborhood similarity. Graph clustering techniques are very useful for detecting densely connected groups in a large graph. Many existing graph clustering methods mainly focus on the topological structure for clustering, but largely ignore the vertex properties which are often heterogenous. In this paper, we propose a novel graph clustering algorithm, SA-Cluster, based on both structural and attribute similarities through a unified distance measure. Our method partitions a large graph associated with attributes into k clusters so that each cluster contains a densely connected subgraph with homogeneous attribute values. An effective method is proposed to automatically learn the degree of contributions of structural similarity and attribute similarity. Theoretical analysis is provided to show that SA-Cluster is converging. Extensive experimental results demonstrate the effectiveness of SA-Cluster through comparison with the state-of-the-art graph clustering and summarization methods. 1.
(Show Context)

Citation Context

...vertex distances and similarities. Jeh and Widom [12] designed a measure called SimRank, which defines the similarity between two vertices in a graph by their neighborhood similarity. Pons and Latapy =-=[18]-=- proposed to use short random walks of length l to measure the similarity between two vertices in a graph for community detection. Other studies which use random walk with restarts include connection ...

WEB-based GEne SeT AnaLysis Toolkit (WebGestalt): update 2013. Nucleic Acids Res. 2013;41(Web Server issue):W77–83

by Jing Wang, Dexter Duncan, Zhiao Shi, Bing Zhang
"... Functional enrichment analysis is an essential task for the interpretation of gene lists derived from large-scale genetic, transcriptomic and proteomic studies. WebGestalt (WEB-based GEne SeT AnaLysis Toolkit) has become one of the popular software tools in this field since its publication in 2005. ..."
Abstract - Cited by 87 (0 self) - Add to MetaCart
Functional enrichment analysis is an essential task for the interpretation of gene lists derived from large-scale genetic, transcriptomic and proteomic studies. WebGestalt (WEB-based GEne SeT AnaLysis Toolkit) has become one of the popular software tools in this field since its publication in 2005. For the last 7 years, WebGestalt data holdings have grown substantially to satisfy the re-quirements of users from different research areas. The current version of WebGestalt supports 8 or-ganisms and 201 gene identifiers from various data-bases and different technology platforms, making it directly available to the fast growing omics com-munity. Meanwhile, by integrating functional categories derived from centrally and publicly curated databases as well as computational analyses, WebGestalt has significantly increased the coverage of functional categories in various bio-logical contexts including Gene Ontology, pathway, network module, gene–phenotype association, gene–disease association, gene–drug association and chromosomal location, leading to a total of 78 612 functional categories. Finally, new interactive features, such as pathway map, hierarchical network visualization and phenotype ontology visu-alization have been added to WebGestalt to help users better understand the enrichment results. WebGestalt can be freely accessed through
(Show Context)

Citation Context

.... Here we briefly describe our network clustering method. A detailed description of our implementation will be included in a separate manuscript. First, using the random walk-based walktrap algorithm =-=(20)-=-, we identify the best partition of the network by maximizing the modularity score (21); Secondly, we use the edge switching algorithm (22) to generate 1000 random networks with the same attributes as...

Community Structure in Graphs

by Santo Fortunato , Claudio Castellano , 2007
"... Graph vertices are often organized into groups that seem to live fairly independently of the rest of the graph, with which they share but a few edges, whereas the relationships between group members are stronger, as shown by the large number of mutual connections. Such groups of vertices, or communi ..."
Abstract - Cited by 44 (0 self) - Add to MetaCart
Graph vertices are often organized into groups that seem to live fairly independently of the rest of the graph, with which they share but a few edges, whereas the relationships between group members are stronger, as shown by the large number of mutual connections. Such groups of vertices, or communities, can be considered as independent compartments of a graph. Detecting communities is of great importance in sociology, biology and computer science, disciplines where systems are often represented as graphs. The task is very hard, though, both conceptually, due to the ambiguity in the definition of community and in the discrimination of different partitions and practically, because algorithms must find “good ” partitions among an exponentially large number of them. Other complications are represented by the possible occurrence of hierarchies, i.e. communities which are nested inside larger communities, and by the existence of overlaps between communities, due to the presence of nodes belonging to more groups. All these aspects are dealt with in some detail and many methods are described, from traditional approaches used in computer science and sociology to recent techniques developed mostly within statistical physics.

Multi-level algorithms for modularity clustering

by Andreas Noack, Randolf Rotta , 2008
"... ..."
Abstract - Cited by 42 (0 self) - Add to MetaCart
Abstract not found
(Show Context)

Citation Context

... of merge prioritizers are clearly possible. For example, vertex distances from random walks or eigenvectors of certain matrices have been successfully applied in several clustering algorithms (e.g., =-=[31, 28, 11]-=-). However, preliminary experiments suggest that these relatively complicated and computationally expensive prioritizers may not be more effective than the simple prioritizers in this section [34]. 4 ...

Laplacian dynamics and multiscale modular structure in networks

by R. Lambiotte, J. -c. Delvenne, M. Barahona - ArXiv , 2009
"... Most methods proposed to uncover communities in complex networks rely on their structural properties. Here we introduce the stability of a network partition, a measure of its quality defined in terms of the statistical properties of a dynamical process taking place on the graph. The time-scale of th ..."
Abstract - Cited by 35 (3 self) - Add to MetaCart
Most methods proposed to uncover communities in complex networks rely on their structural properties. Here we introduce the stability of a network partition, a measure of its quality defined in terms of the statistical properties of a dynamical process taking place on the graph. The time-scale of the process acts as an intrinsic parameter that uncovers community structures at different resolutions. The stability extends and unifies standard notions for community detection: modularity and spectral partitioning can be seen as limiting cases of our dynamic measure. Similarly, recently proposed multi-resolution methods correspond to linearisations of the stability at short times. The connection between community detection and Laplacian dynamics enables us to establish dynamically motivated stability measures linked to distinct null models. We apply our method to find multi-scale partitions for different networks and show that the stability can be computed efficiently for large networks with extended versions of current algorithms. The relation between the structure of a network and the dynamics that takes place on it 1 Lambiotte et al. has been studied extensively in the last years 1,2,3. A growing body of research has shown how
(Show Context)

Citation Context

...hich a partition is optimal. Let P1 be the optimal partition of a network N1 for some value of t, i.e., P1 optimises the quality function ∑ [ Aij Q(t) = (1 − t) + ∑ C∈P i,j∈C t − 2m1 kikj (2m1) 2 ] , =-=(12)-=- where m1 is the total number of links in N1. Let us now consider a new network Ntot made of N1 and of another network N2 such that there are no links between N1 and N2. Then one can show that P1 stil...

A fast algorithm to find overlapping communities in networks

by Steve Gregory - In Proc. of ECML/PKDD 2008 , 2008
"... Abstract. Many networks possess a community structure, such that vertices form densely connected groups which are more sparsely linked to other groups. In some cases these groups overlap, with some vertices shared between two or more communities. Discovering communities in networks is a computationa ..."
Abstract - Cited by 28 (1 self) - Add to MetaCart
Abstract. Many networks possess a community structure, such that vertices form densely connected groups which are more sparsely linked to other groups. In some cases these groups overlap, with some vertices shared between two or more communities. Discovering communities in networks is a computationally challenging task, especially if they overlap. In previous work we proposed an algorithm, CONGA, that could detect overlapping communities using the new concept of split betweenness. Here we present an improved algorithm based on a local form of betweenness, which yields good results but is much faster. It is especially effective in discovering small-diameter communities in large networks, and has a time complexity of only O(n log n) for sparse networks. 1 Introduction and Related Work In recent years, networks (graphs) have increasingly been used to represent various kinds of complex system in the real world. Many networks exhibit community structure: the tendency of vertices to form communities (or modules) such that intracommunity edges are denser than the edges between communities. Communities often

An experimental investigation of graph kernels on a collaborative recommendation task

by François Fouss, Kevin Françoisse, Luh Yen, Alain Pirotte, Marco Saerens - Proceedings of the 6th International Conference on Data Mining (ICDM 2006 , 2006
"... This paper presents a survey as well as a systematic empirical comparison of seven graph kernels and two related similarity matrices (simply referred to as graph kernels), namely the exponential diffusion kernel, the Laplacian exponential diffusion kernel, the von Neumann diffusion kernel, the regul ..."
Abstract - Cited by 27 (7 self) - Add to MetaCart
This paper presents a survey as well as a systematic empirical comparison of seven graph kernels and two related similarity matrices (simply referred to as graph kernels), namely the exponential diffusion kernel, the Laplacian exponential diffusion kernel, the von Neumann diffusion kernel, the regularized Laplacian kernel, the commute-time kernel, the random-walk-with-restart similarity matrix, and finally, three graph kernels introduced in this paper: the regularized commute-time kernel, the Markov diffusion kernel, and the cross-entropy diffusion matrix. The kernel-on-a-graph approach is simple and intuitive. It is illustrated by applying the nine graph kernels to a collaborative-recommendation task and to a semisupervised classification task, both on several databases. The graph methods compute proximity measures between nodes that help study the structure of the graph. Our comparisons suggest that the regularized commute-time and the Markov diffusion kernels perform best, closely followed by the regularized Laplacian kernel. 1
(Show Context)

Citation Context

...stance at the other end was proposed in [111] while a correlation or co-betweenness measure between nodes was developed in [54, 67]. The commute-time kernel is described in Section 4.5. Recent papers =-=[61, 71, 72, 79]-=- proposed an intuitively appealing distance measure between nodes of a graph based on a continuous-time diffusion process, called the “diffusion distance”. We defined a discrete-time counterpart of th...

A family of dissimilarity measures between nodes generalizing both the shortest-path and the commute-time distances

by Luh Yen, Marco Saerens, Amin Mantrach, Masashi Shimbo - in Proceedings of the 14th SIGKDD International Conference on Knowledge Discovery and Data Mining
"... This work introduces a new family of link-based dissimilarity measures between nodes of a weighted directed graph. This measure, called the randomized shortest-path (RSP) dissimilarity, depends on a parameter θ and has the interesting property of reducing, on one end, to the standard shortest-path d ..."
Abstract - Cited by 24 (11 self) - Add to MetaCart
This work introduces a new family of link-based dissimilarity measures between nodes of a weighted directed graph. This measure, called the randomized shortest-path (RSP) dissimilarity, depends on a parameter θ and has the interesting property of reducing, on one end, to the standard shortest-path distance when θ is large and, on the other end, to the commute-time (or resistance) distance when θ is small (near zero). Intuitively, it corresponds to the expected cost incurred by a random walker in order to reach a destination node from a starting node while maintaining a constant entropy (related to θ) spread in the graph. The parameter θ is therefore biasing gradually the simple random walk on the graph towards the shortest-path policy. By adopting a statistical physics approach and computing a sum over all the possible paths (discrete path integral), it is shown that the RSP dissimilarity from every node to a particular node of interest can be computed efficiently by solving two linear systems of n equations, where n is the number of nodes. On the other hand, the dissimilarity between every couple of nodes is obtained by inverting an n × n matrix. The proposed measure can be used for various graph mining tasks such as computing betweenness centrality, finding dense communities, etc, as shown in the experimental section.
(Show Context)

Citation Context

...en et al. [32, 33]. In this work, the authors propose to replace the effective conductance by a cycle-free effective conductance. In two recent papers [39, 40], Nadler et al. as well as Latapy et al. =-=[46]-=- proposed a well-formulated distance measure between nodes of a graph based on a diffusion process, called the “diffusion distance”. A valid kernel, called the “Markov diffusion kernel” has been deriv...

Graph nodes clustering with the sigmoid commute-time kernel: A . . .

by Luh Yen, et al. - DATA & KNOWLEDGE ENGINEERING , 2009
"... ..."
Abstract - Cited by 23 (8 self) - Add to MetaCart
Abstract not found
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University