ADVERTISEMENT

Fast Computation Of Graph Edit Distance

Measuring the similarity between two graphs is a basic operation with many applications. For example, in the fields of chemical informatics and drug design, we often need to compare the similarity of a chemical (as modeled by a labeled graph) to a known drug (as modeled by another labeled graph) in order to evaluate the potential of the chemical for novel drug design. Applications include bioinformatics, data mining, pattern recognition, and graph classification.

Graph edit distance (GED) is a useful metric that gives a measure of similarity of two graphs. Intuitively, graph edit distance measures how many operations must be applied in order to transform the first graph into the second. Typical operations include inserting or deleting a node, inserting or deleting an edge, changing a node label, and changing an edge label. Computing the graph edit distance is an NP-hard problem, and thus computational cost is a concern, especially when we need to compute graph edit distances many times, as in some applications.

ADVERTISEMENT
Figure 1: An Illustration of Graph Edit Distance. Figure republished with permission from Elsevier from doi.org/10.1016/j.knosys.2018.10.002

Several approaches have been developed for the efficient computation of graph edit distance.  Typically, the possible graph mappings (between the two graphs) are organized as an ordered search tree, where the inner nodes denote partial mappings and the leaf nodes denote complete mappings.  Riesen et al. recently proposed the A-GED method.  The method is vertex-based because the inner nodes (i.e., partial mappings) are extended in an iterative manner by extending unmapped vertices of the two graphs.  The authors developed a heuristic function to estimate a lower bound on the GED in order to prune the search and reduce the number of mappings considered.  Nevertheless, a major drawback of A-GED is that it often needs to store too many partial mappings, resulting in huge memory consumption; in practice, A-GED is only applicable to relatively small graphs.

In this paper, Chen and co-authors present BSS_GED, a novel vertex-based mapping method that calculates the GED in a reduced search space, which is enabled by more aggressively identifying and discarding invalid and redundant mappings.  BSS_GED employs the beam-stack search paradigm, a widely utilized search algorithm in AI, combined with two specially designed heuristics to improve the GED computation, achieving a trade-off between memory utilization and expensive backtracking calls.  Through extensive experiments, the authors demonstrate that BSS_GED is highly efficient on both sparse and dense graphs and outperforms the state-of-the-art methods.  BSS_GED was further evaluated to solve the well-investigated graph similarity search problem.  The experimental results show that this method is an order-of-magnitude faster than the state-of-the-art graph similarity search methods.

These findings are described in the article entitled An Efficient Algorithm for Graph Edit Distance Computation, recently published in the journal Knowledge-Based Systems, 163, January 2019, 762–775, doi.org/10.1016/j.knosys.2018.10.002. This work was conducted by Xiaoyang Chen, Hongwei Huo, Jun Huan, and Jeffrey Scott Vitter.

ADVERTISEMENT

Comments

READ THIS NEXT

Fatty Acid Dynamics In The Fish Gulf Menhaden

“Essential fatty acids” are critical to organismal growth and development and, because they cannot be synthesized, they must be obtained […]

Growth Mindset Plays An Important Role In Health And Fitness

Growth mindsets have become quite a popular topic, and for a good reason. Today we have an enormous amount of […]

Using Open Door/Windows To Ventilate Bedrooms And The Consequent Effect On Sleep Quality

Bedrooms are an important part of our homes, considering we spend around one-third of our life sleeping. Further, our sleep […]

The Study Of The Universe

The study of the universe is called Cosmology, it is the physical investigation of the universe from the earliest forms […]

Social Cognitive Theory And Exercise: How Can We Stay Motivated?

We know that exercise is good for us. There is plenty of evidence regarding its health benefits! So why do […]

Formal Region: Definition And Types

Regions are categories, and like all categories, they exist to help us group things together and make sense of the […]

The Need For Sustainability In Our Soil

Soil forms at the interface of the lithosphere, biosphere, atmosphere, and hydrosphere and is the foundation of terrestrial life. It […]

Science Trends is a popular source of science news and education around the world. We cover everything from solar power cell technology to climate change to cancer research. We help hundreds of thousands of people every month learn about the world we live in and the latest scientific breakthroughs. Want to know more?