Topology

Algorithms for the analysis of graph topology.

Connectivity

sknetwork.topology.get_connected_components(input_matrix: scipy.sparse.csr.csr_matrix, connection: str = 'weak', force_bipartite: bool = False) → numpy.ndarray[source]

Extract the connected components of a graph.

Parameters

input_matrix – Input matrix (either the adjacency matrix or the biadjacency matrix of the graph).
connection – Must be 'weak' (default) or 'strong'. The type of connection to use for directed graphs.
force_bipartite (bool) – If True, consider the input matrix as the biadjacency matrix of a bipartite graph.

Returns

Connected component of each node. For bipartite graphs, rows and columns are concatenated (rows first).

Return type

labels

sknetwork.topology.is_connected(input_matrix: scipy.sparse.csr.csr_matrix, connection: str = 'weak', force_bipartite: bool = False) → bool[source]

Check whether the graph is connected.

Parameters

input_matrix – Input matrix (either the adjacency matrix or the biadjacency matrix of the graph).
connection – Must be 'weak' (default) or 'strong'. The type of connection to use for directed graphs.
force_bipartite (bool) – If True, consider the input matrix as the biadjacency matrix of a bipartite graph.

sknetwork.topology.get_largest_connected_component(input_matrix: scipy.sparse.csr.csr_matrix, connection: str = 'weak', force_bipartite: bool = False, return_index: bool = False) → Union[scipy.sparse.csr.csr_matrix, Tuple[scipy.sparse.csr.csr_matrix, numpy.ndarray]][source]

Extract the largest connected component of a graph. Bipartite graphs are treated as undirected.

Parameters

input_matrix – Adjacency matrix or biadjacency matrix of the graph.
connection – Must be 'weak' (default) or 'strong'. The type of connection to use for directed graphs.
force_bipartite (bool) – If True, consider the input matrix as the biadjacency matrix of a bipartite graph.
return_index (bool) – Whether to return the index of the nodes of the largest connected component in the original graph.

Returns

output_matrix (sparse.csr_matrix) – Adjacency matrix or biadjacency matrix of the largest connected component.
index (array) – Indices of the nodes in the original graph. For bipartite graphs, rows and columns are concatenated (rows first).

Structure

sknetwork.topology.is_bipartite(adjacency: scipy.sparse.csr.csr_matrix, return_biadjacency: bool = False) → Union[bool, Tuple[bool, Optional[scipy.sparse.csr.csr_matrix], Optional[numpy.ndarray], Optional[numpy.ndarray]]][source]

Check whether a graph is bipartite.

Parameters

adjacency – Adjacency matrix of the graph (symmetric).
return_biadjacency – If True, return a biadjacency matrix of the graph if bipartite.

Returns

is_bipartite (bool) – A boolean denoting if the graph is bipartite.
biadjacency (sparse.csr_matrix) – A biadjacency matrix of the graph if bipartite (optional).
rows (np.ndarray) – Index of rows in the original graph (optional).
cols (np.ndarray) – Index of columns in the original graph (optional).

sknetwork.topology.is_acyclic(adjacency: scipy.sparse.csr.csr_matrix) → bool[source]

Check whether a graph has no cycle.

Parameters: adjacency – Adjacency matrix of the graph.
Returns: is_acyclic – A boolean with value True if the graph has no cycle and False otherwise
Return type: bool

class sknetwork.topology.DAG(ordering: Optional[str] = None)[source]

Build a Directed Acyclic Graph from an adjacency.

Graphs
DiGraphs

Parameters

ordering (str) –

A method to sort the nodes.

If None, the default order is the index.
If 'degree', the nodes are sorted by ascending degree.

Variables

indptr_ (np.ndarray) – Pointer index as for CSR format.
indices_ (np.ndarray) – Indices as for CSR format.

fit(adjacency: scipy.sparse.csr.csr_matrix, sorted_nodes=None)[source]

Fit algorithm to the data.

Parameters

adjacency – Adjacency matrix of the graph.
sorted_nodes (np.ndarray) – An order on the nodes such that the DAG only contains edges (i, j) such that sorted_nodes[i] < sorted_nodes[j].

Core decomposition

class sknetwork.topology.CoreDecomposition

K-core decomposition algorithm.

Graphs

Variables

labels_ (np.ndarray) – Core value of each node.
core_value_ (int) – Maximum core value of the graph

Example

>>> from sknetwork.topology import CoreDecomposition
>>> from sknetwork.data import karate_club
>>> kcore = CoreDecomposition()
>>> adjacency = karate_club()
>>> kcore.fit(adjacency)
>>> kcore.core_value_
4

fit(adjacency: scipy.sparse.csr.csr_matrix) → sknetwork.topology.kcore.CoreDecomposition

K-core decomposition.

Parameters: adjacency – Adjacency matrix of the graph.
Returns: self
Return type: CoreDecomposition

fit_transform(adjacency: scipy.sparse.csr.csr_matrix)

Fit algorithm to the data and return the core value of each node. Same parameters as the fit method.

Returns: Core value of the nodes.
Return type: labels

Cliques

class sknetwork.topology.Triangles(parallelize: bool = False)

Count the number of triangles in a graph, and evaluate the clustering coefficient.

Graphs

Parameters

parallelize – If True, use a parallel range while listing the triangles.

Variables

n_triangles_ (int) – Number of triangles.
clustering_coef_ (float) – Global clustering coefficient of the graph.

Example

>>> from sknetwork.data import karate_club
>>> triangles = Triangles()
>>> adjacency = karate_club()
>>> triangles.fit_transform(adjacency)
45

fit(adjacency: scipy.sparse.csr.csr_matrix) → sknetwork.topology.triangles.Triangles

Count triangles.

Parameters: adjacency – Adjacency matrix of the graph.
Returns: self
Return type: Triangles

fit_transform(adjacency: scipy.sparse.csr.csr_matrix) → int

Fit algorithm to the data and return the number of triangles. Same parameters as the fit method.

Returns: n_triangles_ – Number of triangles.
Return type: int

class sknetwork.topology.Cliques(k: int)

Clique counting algorithm.

Parameters: k (int) – Clique order (e.g., k = 3 means triangles).
Variables: n_cliques_ (int) – Number of cliques.

Example

>>> from sknetwork.data import karate_club
>>> cliques = Cliques(k=3)
>>> adjacency = karate_club()
>>> cliques.fit_transform(adjacency)
45

References

Danisch, M., Balalau, O., & Sozio, M. (2018, April). Listing k-cliques in sparse real-world graphs. In Proceedings of the 2018 World Wide Web Conference (pp. 589-598).

fit(adjacency: scipy.sparse.csr.csr_matrix) → sknetwork.topology.kcliques.Cliques

K-cliques count.

Parameters: adjacency – Adjacency matrix of the graph.
Returns: self
Return type: Cliques

fit_transform(adjacency: scipy.sparse.csr.csr_matrix) → int

Fit algorithm to the data and return the number of cliques. Same parameters as the fit method.

Returns: n_cliques – Number of k-cliques.
Return type: int

Isomorphism

class sknetwork.topology.WeisfeilerLehman(max_iter: int = - 1)[source]

Weisfeiler-Lehman algorithm for coloring/labeling graphs in order to check similarity.

Parameters: max_iter (int) – Maximum number of iterations. Negative value means until convergence.
Variables: labels_ (np.ndarray) – Label of each node.

Example

>>> from sknetwork.topology import WeisfeilerLehman
>>> from sknetwork.data import house
>>> weisfeiler_lehman = WeisfeilerLehman()
>>> adjacency = house()
>>> labels = weisfeiler_lehman.fit_transform(adjacency)
>>> labels
array([0, 2, 1, 1, 2], dtype=int32)

References

Douglas, B. L. (2011). The Weisfeiler-Lehman Method and Graph Isomorphism Testing.
Shervashidze, N., Schweitzer, P., van Leeuwen, E. J., Melhorn, K., Borgwardt, K. M. (2011) Weisfeiler-Lehman graph kernels. Journal of Machine Learning Research 12, 2011.

fit(adjacency: Union[scipy.sparse.csr.csr_matrix, numpy.ndarray]) → sknetwork.topology.weisfeiler_lehman.WeisfeilerLehman[source]

Fit algorithm to the data.

Parameters: adjacency (Union[sparse.csr_matrix, np.ndarray]) – Adjacency matrix of the graph.
Returns: self
Return type: WeisfeilerLehman

fit_transform(adjacency: Union[scipy.sparse.csr.csr_matrix, numpy.ndarray]) → numpy.ndarray[source]

Fit algorithm to the data and return the labels. Same parameters as the fit method.

Returns: labels – Labels.
Return type: np.ndarray

sknetwork.topology.are_isomorphic(adjacency1: scipy.sparse.csr.csr_matrix, adjacency2: scipy.sparse.csr.csr_matrix, max_iter: int = - 1) → bool[source]

Weisfeiler-Lehman isomorphism test. If the test is False, the graphs cannot be isomorphic, otherwise, they might be.

Parameters

adjacency1 – First adjacency matrix.
adjacency2 – Second adjacency matrix.
max_iter (int) – Maximum number of coloring iterations. Negative value means until convergence.

Returns

test_result

Return type

bool

Example

>>> from sknetwork.topology import are_isomorphic
>>> from sknetwork.data import house, bow_tie
>>> are_isomorphic(house(), bow_tie())
False

References

Douglas, B. L. (2011). The Weisfeiler-Lehman Method and Graph Isomorphism Testing.
Shervashidze, N., Schweitzer, P., van Leeuwen, E. J., Melhorn, K., Borgwardt, K. M. (2011) Weisfeiler-Lehman graph kernels. Journal of Machine Learning Research 12, 2011.