Academia.eduAcademia.edu

Sequential Patterns

description52 papers
group2 followers
lightbulbAbout this topic
Sequential patterns refer to ordered sets of events or items that occur in a specific sequence within a dataset. This concept is widely used in data mining and analysis to identify trends, behaviors, or relationships over time, enabling the discovery of meaningful patterns in temporal or sequential data.
lightbulbAbout this topic
Sequential patterns refer to ordered sets of events or items that occur in a specific sequence within a dataset. This concept is widely used in data mining and analysis to identify trends, behaviors, or relationships over time, enabling the discovery of meaningful patterns in temporal or sequential data.

Key research themes

1. How can sequential pattern mining algorithms be adapted to handle partially ordered and weighted sequences for improved pattern generalization and significance?

This research area addresses the limitations of classical sequential pattern mining algorithms that rely on strictly ordered sequences and uniform item importance. It explores more flexible pattern definitions such as partially-ordered sequential rules and weighted sequential patterns. These approaches help to discover patterns that are more generalizable across multiple sequences, reduce redundancy by capturing unordered itemsets within sequences, and highlight patterns with significant but possibly infrequent items by incorporating weights. This theme matters as it improves the utility, interpretability, and prediction accuracy of sequential patterns in complex, real-world datasets where temporal orderings may vary or item importance differs.

Key finding: Introduced the concept of partially-ordered sequential rules (POSR) which generalizes sequential rules by allowing unordered itemsets in antecedents and consequents, reducing redundant variation in rules. Developed RuleGrowth... Read more
Key finding: Proposed a weighted sequential pattern mining algorithm (WSPM PreTree) combining minimum support and item weight constraints to discover high-value patterns that contain infrequent yet important items. Utilized a prefix-tree... Read more
Key finding: Identified performance degradation in SPADE algorithm when mining sequences with frequent consecutive item repetitions, common in discretized quantitative data. Introduced concept of generalized occurrences to represent... Read more

2. What advances in algorithmic strategies and data structures enable effective, scalable, and incremental mining of sequential patterns in large or evolving sequence databases?

This research theme focuses on designing efficient algorithms and data structures for sequential pattern mining that address scalability, dynamic datasets, and domain-specific challenges. Contributions include tree-based pattern growth methods, incremental mining techniques to avoid rescanning large updated databases, efficient data representations (such as WAP-trees and decision diagrams), and similarity-based sequence clustering frameworks leveraging hidden Markov models. These advances enable practical mining solutions suitable for big data, streaming, and applications requiring rapid adaptation to data changes.

Key finding: Presented STISPM, an incremental mining algorithm that uses a sequence tree space structure to store all frequent sequences and their counts. It enables mining new sequential patterns from updated databases without rescanning... Read more
Key finding: Developed Seq2Pat, a Python tool using multi-valued decision diagrams for efficient constraint-based sequential pattern mining (CSPM). The library supports high-level declarative pattern mining with complex constraints,... Read more
Key finding: Proposed GO-SPADE, an extension of SPADE that efficiently manages sequences with consecutive repetitions by introducing generalized occurrences and operators. This compact data structure reduces memory and computational... Read more
Key finding: Introduced a similarity-based clustering scheme where each sequence is represented as a vector of similarities to other sequences, with similarities computed using Hidden Markov Models. This feature-based approach transfers... Read more

3. How do methodological innovations in sequential pattern mining contribute to the discovery of non-trivial, unexpected, or hierarchically-structured patterns with implications for learning and complex system analysis?

This theme investigates the use of sequential pattern analysis beyond frequency-based discovery to uncover surprising, semantically unexpected, or hierarchically embedded patterns. It includes mining patterns that contradict domain knowledge (unexpected sequences), exploring hierarchical and nested structures in sequence data (important in language and cognitive psychology), and applying pattern concepts to educational recommendations and complex systems modeling. Such innovations expand the understanding of sequence data, enabling new insights into human cognition, educational performance, and emergent system behaviors.

Key finding: Proposed the USER approach for mining unexpected sequential patterns that semantically contradict existing domain beliefs, combining subjective belief-driven measures with objective support and confidence criteria. This... Read more
Key finding: Demonstrated that humans can learn and anticipate hierarchical, recursively nested structures in binary sequences generated by deterministic Fibonacci grammars, beyond mere statistical or flat pattern learning. Results from... Read more
Key finding: Identified four variations of socially shared metacognitive regulation (SSMR) among university students during peer tutoring through latent class clustering: 'interrogative', 'affirmative', 'interfering', and 'progressive'.... Read more

All papers in Sequential Patterns

Fouille de données de mobilité 2Définition Extraction automatique ou semi-automatique de connaissances cachées, potentiellement utiles, à partir de données stockées dans des grandes bases de données. connaissances cachées
An Organization need to understand their customers' behavior, preferences and future needs which depend upon past behavior. Web Usage Mining is an active research topic in which customers session clustering is done to understand the... more
The present study aims at investigating whether events of socially shared metacognitive regulation (SSMR) differ from each other when comparing their characteristics. These differences are labelled "variations in SSMR". The study is... more
A real world challenging task of the web master of an organization is to match the needs of user and keep their attention in their web site. So, only option is to capture the intuition of the user and provide them with the recommendation... more
Data mining application includes a variety of methodologies that have been developed by commercial & research centers. This technique has been used for industrial, commercial and scientific purposes. It is most useful in an exploratory... more
Millions of visitors interact daily with web sites around the world. The several kinds of data have to be organized in a manner that they can be accessed by several users effectively and efficiently. Web mining is the extraction of... more
Data mining methods extract knowledge from huge amounts of data. Recently with the explosion of mobile technologies, a new type of data appeared. The resulting databases can be described as spatiotemporal data in which spatial information... more
ÐDiscovering sequential patterns is an important problem in data mining with a host of application domains including medicine, telecommunications, and the World Wide Web. Conventional sequential pattern mining systems provide users with... more
The number of patterns discovered by data mining can become tremendous, in some cases exceeding the size of the original database. Therefore, there is a requirement for querying previously generated mining results or for querying the... more
The rapid development of bioinformatics has resulted in the explosion of DNA sequence data which is characterized by large number of items. Studies have shown that biological functions are dictated by contagious portions of the DNA... more
A real world challenging task of the web master of an organization is to match the needs of user and keep their attention in their web site. So, only option is to capture the intuition of the user and provide them with the recommendation... more
A real world challenging task of the web master of an organization is to match the needs of user and keep their attention in their web site. So, only option is to capture the intuition of the user and provide them with the recommendation... more
Mining sequential patterns from sequence database has consequential responsibility in the data mining region as it can find the association from the ordered list of events. Mining methods that predicated on the pattern growth approach,... more
Les avancées technologiques en termes d'acquisition de données permettent de mieux surveiller les phénomènes évolutifs dans divers domaines dont l'environnement. Les données collectées sont de plus en plus complexes (spatiales,... more
Tracking technologies and location-acquisition have led to the increase of the availability of trajectory data. Many efforts are devoted to develop methods for mining and analysing trajectories due to its importance in lots of... more
Sequence pattern mining is one of the essential data mining tasks with broad applications. Many sequence mining algorithms have been developed to find a set of frequent sub-sequences satisfying the support threshold in a sequence... more
The number of patterns discovered by data mining can become tremendous, in some cases exceeding the size of the original database. Therefore, there is a requirement for querying previously generated mining results or for querying the... more
Topic directories are popular means of organizing information resources in the web. In this work, we introduce a methodology for personalizing topic directories. The key feature of our methodology is that the personalization is based on... more
In this paper we describe both the theoretical and practical results of a novel approach that combines hybrid techniques of association analysis and classical sequentiation algorithms of genomics to generate the grammatical structures of... more
In recent years, there are a great deal of efforts on sequential pattern mining, but some challenges have not been resolved, such as large search spaces and the ineffectiveness in handling highly similar, dense and long sequences. This... more
This paper presents the first available system for mining patterns from Displacement Field Time Series (DFTS) along with the confidence measures inherent to these series. It consists of four main modules for data preprocessing, pattern... more
Textual documents created and distributed on the Internet are ever changing in various forms. Most of existing works are devoted to topic modeling and the evolution of individual topics, while sequential relations of topics in successive... more
Many users of social media publishing many posts over Internet through social media like Facebook and Twitter. They share different views about different stories each day. Sequential pattern mining is exploring idea for finding the most... more
The discovery of frequent sequences in temporal databases is an important data mining problem. Most current work assumes that the database is static, and a database update requires rediscovering all the patterns by scanning the entire old... more
In this paper we develop new techniques for predicting failures and monitoring in categorical event sequences. New techniques are needed because failures are rare and previous data mining algorithms were overwhelmed by the staggering... more
In this paper, we introduce a model to classify cooking activities using their visual and temporal coherence information. We fuse multiple feature descriptors for fine-grained activity recognition as we would need every single detail to... more
In this paper, we use Sequential Pattern Mining from Probabilistic Databases to learn trajectory patterns. Trajectories which are a succession of points are firstly transformed into a succession of zones by grouping points to build the... more
Nowadays, the explosive growth of the World Wide Web generates tremendous amount of web data and consequently web data mining has become an important technique for discovering useful information and knowledge. Web mining is a vivid... more
Sequential pattern mining discovers frequent subsequences as patterns in a sequence database. Most of the previously developed sequential pattern mining methods, such as GSP, explore a candidate generation-and-test approach [1] to reduce... more
Multidimensional databases have been designed to provide decision makers with the necessary tools to help them understand their data. This framework is different from transactional data as the datasets contain huge volumes of historicized... more
In this paper, we propose an improvement of pattern growth-based PrefixSpan algorithm, called I-PrefixSpan. The general idea of I-PrefixSpan is to use sufficient data structure for Seq-Tree framework and separator database to reduce the... more
Mining sequential patterns on environment sensor data is a challenging task; the data can present noises and may also contain sparse patterns, which are difficult to be detected. The knowledge extracted from environment sensor data can be... more
In this paper, we develop a new indexing method for large web access-logs. We are concerned with pattern queries, which advocate the search for access sequences that contain certain query patterns. This kind of queries find applications... more
The number of patterns discovered by data mining can become tremendous, in some cases exceeding the size of the original database. Therefore, there is a requirement for querying previously generated mining results or for querying the... more
Two important and active areas of current research are data mining and the World Wide Web. A natural combination of the two areas, sometimes referred to as Web mining, has been the focus of several recent research projects and papers. As... more
Web-based organizations often generate and collect large volumes of data in their daily operations. Analyzing such data can help these organizations to determine the life time value of clients, design cross marketing strategies across... more
The assessment of the interestingness of sequential rules (generally temporal rules) is a crucial problem in sequence analysis. Due to their unsupervised nature, frequent pattern mining algorithms commonly generate a huge number of rules.... more
The assessment of the interestingness of sequential rules (generally temporal rules) is a crucial problem in sequence analysis. Due to their unsupervised nature, frequent pattern mining algorithms commonly generate a huge number of rules.... more
Despite the fact that the facial expressions of emotions are naturally dynamic social signals, their communicative value has typically been studied using static photographs. In this paper, we focus on the perception of emotions from... more
Mining sequential patterns aims at discovering correlations between events through time, like A customer who bought a TV and a DVD player at the same time later bought a recorder. Even if many works have dealt with sequential pattern... more
Millions of visitors interact daily with web sites around the world. The several kinds of data have to be organized in a manner that they can be accessed by several users effectively and efficiently. Web mining is the extraction of... more
A real world challenging task of the web master of an organization is to match the needs of user and keep their attention in their web site. So, only option is to capture the intuition of the user and provide them with the recommendation... more
With the mining capabilities of the several data mining methodologies, there are several interesting extensions on frequent pattern mining. The discovery of sequential patterns is one of them. It has a vast array of real world... more
A real world challenging task of the web master of an organization is to match the needs of user and keep their attention in their web site. So, only option is to capture the intuition of the user and provide them with the recommendation... more
Pattern mining is an important field of data mining. The fundamental task of data mining is to explore the database to find out sequential, frequent patterns. In recent years, data mining has shifted its focus to design methods for... more
Frequent sequence mining is a fundamental and essential operation in the process of discovering the sequential rules. Most of the sequence mining algorithms use apriori methodology or build the larger sequences from smaller patterns, a... more
In this paper we describe both the theoretical and practical results of a novel approach that combines hybrid techniques of association analysis and classical sequentiation algorithms of genomics to generate the grammatical structures of... more
The goal of this work is to increase the relevance and the interestingness of patterns discovered by a Web Usage Mining process. Indeed, the sequential patterns extracted on web log files, unless they are found under constraints, often... more
Download research papers for free!