Academia.eduAcademia.edu

Graph Pattern Matching

description9 papers
group0 followers
lightbulbAbout this topic
Graph Pattern Matching is a computational problem that involves identifying subgraphs within a larger graph that match a specified pattern or structure. This field encompasses algorithms and techniques for efficiently searching, comparing, and analyzing graph data, with applications in areas such as social network analysis, bioinformatics, and database querying.
lightbulbAbout this topic
Graph Pattern Matching is a computational problem that involves identifying subgraphs within a larger graph that match a specified pattern or structure. This field encompasses algorithms and techniques for efficiently searching, comparing, and analyzing graph data, with applications in areas such as social network analysis, bioinformatics, and database querying.

Key research themes

1. How can graph pattern matching scale efficiently on large, distributed, or dynamic graphs with temporal and structural constraints?

This research theme addresses the challenges of performing graph pattern matching on large-scale graphs that are distributed across multiple machines or are dynamic and evolving over time. It includes handling temporal information in graphs, minimizing computational overhead especially in distributed and multiprocessor environments, and enabling pattern matching without costly index reconstructions. Efficiently scaling graph pattern matching is critical for applications such as social networks analysis, bioinformatics, malware detection, and other big data scenarios.

Key finding: Introduces the concept of time-respecting flow graphs and presents asynchronous, distributed algorithms tailored for temporal graphs on NUMA (non-uniform memory access) multiprocessor systems. It demonstrates how by... Read more
Key finding: Presents a data-oriented architecture (DORA) enabling efficient asynchronous graph pattern matching on NUMA multiprocessor systems that preserves data locality and minimizes concurrency bottlenecks. Through fine-grained... Read more
Key finding: Proposes a method that eliminates the need for maintaining or reconstructing inverted indexes upon graph database changes, enabling concurrent index updating and querying on dynamic graph databases. The approach utilizes a... Read more
Key finding: Introduces GraphINC, a novel framework that leverages programmable network switches to offload the processing of skewed dense regions in graph pattern mining. By partitioning and isolating hotspots and processing them on... Read more

2. What formal foundations and algorithmic techniques enable efficient and expressive graph pattern matching and querying over property graphs and dynamic graph data?

This theme encompasses the theoretical underpinnings, query language design, and algorithmic frameworks that support graph pattern matching on property graph models and streaming graphs. It highlights the formal semantics of graph pattern languages standardizing match semantics, handling attributes and complex constraints, and enabling practical implementations capable of operating on streaming or evolving graphs with controlled computational complexity.

Key finding: Analyzes the common graph pattern matching sub-language (GPML) standardized by ISO for both GQL (a forthcoming property graph query language) and SQL/PGQ (an extension of SQL enabling graph views and queries). It formalizes... Read more
Key finding: Develops streaming algorithms that operate on edges arriving as streams, enabling approximate computation of graph descriptors (vector embeddings) that capture structural properties of graphs without storing the entire graph... Read more
Key finding: Presents DIONYSUS, a system architecture that integrates scale-out distribution with scale-up optimizations to handle high-volume, heterogeneous RDF graph streams. It addresses scalability, state management, and integration... Read more
Key finding: Demonstrates a methodology for implementing linear-time graph algorithms using rule-based graph programs (GP 2 language) by leveraging rooted graph transformation rules that enable constant-time matching under bounded degree... Read more

3. How can approximate, heuristic, or constraint-based techniques improve graph pattern matching effectiveness and reduce complexity in real-world applications?

This research direction focuses on approximate matching methods, heuristic algorithms, constraint-enforced mining, and pattern selection strategies that balance scalability, accuracy, and interpretability. By incorporating domain-specific knowledge, leveraging statistical or sampling approaches, and defining structural constraints, these methods aim to overcome computational hardness (e.g., NP-completeness), handle pattern redundancy, and improve relevance of discovered graph patterns.

Key finding: Introduces a novel graph-to-string coding scheme called OSG-L that converts unlabeled neighborhood subgraphs into strings, enabling efficient approximate graph matching via Levenshtein edit distance. This method simplifies... Read more
Key finding: Proposes MaNIACS, a randomized, sampling-based algorithm for the approximate discovery of frequent subgraph patterns in large vertex-labeled graphs under the MNI-frequency measure. Using concepts from statistical learning... Read more
Key finding: Presents gPrune, a unified framework that integrates both traditional and structural constraints (e.g., density, diameter) into graph pattern mining by exploiting newly identified pruning properties, notably... Read more
Key finding: Proposes an unsupervised, domain-knowledge-driven pattern selection method that reduces redundancy among frequent subgraphs by selecting representative 'unsubstituted' patterns informed by a similarity (substitution) matrix.... Read more
Key finding: Studies the problem of mapping sequences onto De Bruijn graphs, an NP-complete problem due to path constraints, and develops heuristic algorithms based on edit distance minimization to efficiently find walks in De Bruijn... Read more

All papers in Graph Pattern Matching

Approximate sub-graph matching is important in many graph data mining fields. At present, current solutions can be difficult to implement, have an expensive pre-processing phase, or only work for given types of graph. In this paper a... more
Node similarity is a fundamental problem in graph analytics. However, node similarity between nodes in different graphs (inter-graph nodes) has not received a lot of attention yet. The inter-graph node similarity is important in learning... more
Given a query graph q and a data graph G, computing all occurrences of q in G, namely exact all-matching, is fundamental in graph data analysis with a wide spectrum of real applications. It is challenging since even finding one occurrence... more
Graph pattern matching, one of the most fundamental graph problems, has been extensively investigated in the literature. Nonetheless, existing efforts mostly focus on general graphs without time information, few studies concentrate on... more
Emails: { edwardgb,dcorrales,jcorral }@unicauca.edu.co. http://www.unicauca.edu.co , jiglesia.at.inf.uc3m.es https://www.uc3m.es Highlights • A set of rules are created based on weather attributes and area properties of crops. • An expert... more
Diseases in agricultural production systems represent one of the main reasons of losses and poor-quality products. For coffee production, experts in this area suggest that weather conditions and crop physical properties are the main... more
Arguably, the most significant obstacle to handle the emerging application’s data deluge is to design a system that addresses the challenges for big data’s volume, velocity and variety. Work in RDF stream processing (RSP) systems partly... more
Approximate sub-graph matching is important in many graph data mining fields. At present, current solutions can be difficult to implement, have an expensive pre-processing phase, or only work for given types of graph. In this paper a... more
Node similarity is a fundamental problem in graph analytics. However, node similarity between nodes in different graphs (inter-graph nodes) has not received a lot of attention yet. The inter-graph node similarity is important in learning... more
Attributed graphs are powerful data structures for the representation of complex objects. In a graph-based representation, vertices and their attributes describe objects (or part of objects) while edges represent interrelationships... more
In this paper, a new binary linear programming formulation for computing the exact Graph Edit Distance (GED) between two graphs is proposed. A fundamental strength of the formulations lies in their genericity since the GED can be computed... more
This paper presents a binary linear program which computes the exact graph edit distance between two richly attributed graphs (i.e. with attributes on both vertices and edges). Without solving graph edit distance for large graphs, the... more
Graphs play notable role in daily life. For instance, they are used in variety fields such as social networks, malware detection, and biological networks. Graph data processing performed to extract useful information is known as graph... more
In this paper, we characterize the strong connected resolving sets in the join and corona of graphs. We also determine the strong connected resolving numbers of these graphs.
by usman ali and 
1 more
Topological index (numeric number) is a mathematical coding of the molecular graphs that predicts the physicochemical, biological, toxicological, and structural properties of the chemical compounds that are directly associated with the... more
neighbourhood vertices have relatively prime labels. Gaussian integers are the complex numbers whose real and imaginary parts are both integers. We extend the neighbourhood prime labelling concept to Gaussian integers. Using the order on... more
The rust is the main coffee crop disease in the world. In the Colombian and Brazilian plantations, the damage leads to a yield reduction of 30 % and 35 % respectively in regions where the meteorological conditions are propitious to the... more
This is a paper in  Graph theory. Here i used dynamic algorithm to solve the traveling salesman problem.
For k = 2, 3 and a cubic graph G let ν k (G) denote the size of a maximum k-edge-colorable subgraph of G. Mkrtchyan, Petrosyan and Vardanyan proved that ν 2 (G) ≥ 4 5 · |V (G)|, ν 3 (G) ≥ 7 6 · |V (G)| . They were also able to show that
Approximate sub-graph matching is important in many graph data mining fields. At present, current solutions can be difficult to implement, have an expensive pre-processing phase, or only work for given types of graph. In this paper a... more
Diseases in Agricultural Production Systems represent one of the biggest drivers of losses and poor quality products. In the case of coffee production, experts in this area believe that weather conditions, along with physical properties... more
For agroindustry, crop diseases constitute one of the most common problems that generate large economic losses and low production quality. On the other hand, from computer science, several tools have emerged in order to improve the... more
The climate change has caused threats to agricultural production; the extremes of temperature and humidity, and other abiotic stresses are contributing factors to the etiology of disease and pest on crops. About the matter, recent... more
Rust is a disease that leads to considerable losses in the worldwide coffee industry. In Colombia, the disease was first reported in 1983 in the department of Caldas. Since then, it spread rapidly through all other coffee departments in... more
Abstract: Social and technical information systems usually consist of a large number of interacting physical, conceptual, and human/societal entities. Such individual entities are interconnected to form large and sophisticated networks,... more
Download research papers for free!