Graph Edit Distance Computation
This repository implements graph edit distance (GED) computation and GED verification (i.e., verify whether the GED between two (labeled) graphs is smaller than a given threshold) algorithms proposed in the following two papers. Note that, our implementations assume uniform edit cost. If you are using the code, please cite our papers.
Lijun Chang, Xing Feng, Xuemin Lin, Lu Qin, Wenjie Zhang, Dian Ouyang. Speeding Up GED Verification for Graph Similarity Search. Proceedings of the 36th International Conference on Data Engineering (ICDE’20), 2020
Lijun Chang, Xing Feng, Kai Yao, Lu Qin, Wenjie Zhang. Accelerating Graph Similarity Search via Efficient GED Computation. Accepted by TKDE in Feburary 2022
Compile the code
$ make clean
$ make
It generates an executable “ged”.
Run the code
You can find how to use the code by
$ ./ged -h
An example of graph similarity search on the AIDS dataset with threshold 5 is as follows
$ ./ged -d datasets/AIDS.txt -q datasets/AIDS_query100.txt -m search -p astar -l LSa -t 5
An example of one-by-one GED computation between graphs in two files is as follows
$ ./ged -d datasets/graph_g.txt -q datasets/graph_q.txt -m pair -p astar -l LSa -g
Data format
t [starts a new graph, followed by two arbitrary strings]
v [vertex_id] [vertex_label]
e [vertex_id1] [vertex_id2] [edge_label]
graph_q.txt and graph_g.txt are two example data files. Note that, vertex_id must be consecutive numbers starting from 0.