Similarity for Topic Maps?

The Weisfeiler-Lehman algorithm and estimation on graphs by Alex Smola.

From the post:

Imagine you have two graphs $G$ and $G’$ and you’d like to check how similar they are. If all vertices have unique attributes this is quite easy:

FOR ALL vertices $v \in G \cup G’$ DO

• Check that $v \in G$ and that $v \in G’$
• Check that the neighbors of v are the same in $G$ and $G’$

This algorithm can be carried out in linear time in the size of the graph, alas many graphs do not have vertex attributes, let alone unique vertex attributes. In fact, graph isomorphism, i.e. the task of checking whether two graphs are identical, is a hard problem (it is still an open research question how hard it really is). In this case the above algorithm cannot be used since we have no idea which vertices we should match up.

The Weisfeiler-Lehman algorithm is a mechanism for assigning fairly unique attributes efficiently. Note that it isn’t guaranteed to work, as discussed in this paper by Douglas – this would solve the graph isomorphism problem after all. The idea is to assign fingerprints to vertices and their neighborhoods repeatedly. We assume that vertices have an attribute to begin with. If they don’t then simply assign all of them the attribute 1. Each iteration proceeds as follows:

(…)

Something a bit more sophisticated than comparing canonical representations in syntax.