Key research themes
1. How can string matching algorithms be optimized for biological sequence analysis, particularly DNA pattern matching?
This body of research focuses on developing and evaluating string matching algorithms tailored for bioinformatics applications, especially DNA and protein sequence analysis. These algorithms aim to efficiently handle huge biological datasets, improve exact and approximate pattern matching accuracy, and reduce computational time and resource usage, addressing key challenges arising from the complexity and volume of biological sequence data.
2. What advances in data structures and indexing methods can improve multiple and approximate string matching performance?
This research cluster explores algorithmic and data structural innovations for multiple pattern matching and approximate matching. The work aims to speed up searches across large texts or databases by reducing redundant computations, using compact indexing techniques, and combining automata with transformation approaches. Emphasis is placed on optimizing runtime, memory efficiency, and scalability, with applications spanning bioinformatics, text retrieval, and network security.
3. How can string matching algorithms be effectively applied in practical systems such as customer data management and healthcare text analysis?
This theme covers the application-driven development of string matching algorithms designed to optimize search accuracy and speed within real-world systems. Research here focuses on addressing noisy, unstructured, or approximate matching challenges in domains like customer databases and clinical pathology reports. These applied studies adapt and enhance classical algorithms to accommodate domain-specific needs including fuzzy matching, phonetic similarity, and workflow integration.