Academia.eduAcademia.edu

Compressed Data Structures

description16 papers
group30 followers
lightbulbAbout this topic
Compressed data structures are specialized data representations that utilize algorithms to reduce the amount of memory required to store data while maintaining efficient access and manipulation capabilities. They are designed to optimize space complexity, enabling the storage of large datasets in a compact form without significant loss of performance.
lightbulbAbout this topic
Compressed data structures are specialized data representations that utilize algorithms to reduce the amount of memory required to store data while maintaining efficient access and manipulation capabilities. They are designed to optimize space complexity, enabling the storage of large datasets in a compact form without significant loss of performance.
The growing interest in big data problems implies the need for unsupervised methods for data visualization and dimensionality reduction. Diffusion Maps (DM) is a recent technique that can capture the lower dimensional geometric structure... more
We propose a forecasting procedure based on multivariate dynamic kernels, with the capability of integrating information measured at different frequencies and at irregular time intervals in financial markets. A data compression process... more
Let D = {T1, T2,. .. , TD} be a collection of D string documents of n characters in total, that are drawn from an alphabet set Σ = [σ]. The top-k document retrieval problem is to preprocess D into a data structure that, given a query (P... more
In many cases, the relation between encoding space and execution time translates into combinatorial lower bounds on the computational complexity of algorithms in the comparison or external memory models. We describe a few cases which... more
A repetitive sequence collection is one where portions of a base sequence of length n are repeated many times with small variations, forming a collection of total length N . Examples of such collections are version control data and genome... more
In many cases, the relation between encoding space and execution time translates into combinatorial lower bounds on the computational complexity of algorithms in the comparison or external memory models. We describe a few cases which... more
Operations rank and select over a sequence of symbols have many applications to the design of succinct and compressed data structures managing text collections, structured text, binary relations, trees, graphs, and so on. We are... more
The inverted index supports efficient full-text searches on natural language text collections. It requires some extra space over the compressed text that can be traded for search speed. It is usually fast for single-word searches, yet... more
The growing interest in big data problems implies the need for unsupervised methods for data visualization and dimensionality reduction. Diffusion Maps (DM) is a recent technique that can capture the lower dimensional geometric structure... more
Streaming techniques use main memory and avoid random access to disk [DFR06].
An important challenge in Data Mining and Machine Learning is the proper analysis of a given dataset, especially for understanding and working with functions defined over it. In this paper we propose Auto-adaptive Laplacian Pyramids (ALP)... more
The growing interest in big data problems implies the need for unsupervised methods for data visualization and dimensionality reduction. Diffusion Maps (DM) is a recent technique that can capture the lower dimensional geometric structure... more
Streaming techniques use main memory and avoid random access to disk [DFR06].
Operations rank and select over a sequence of symbols have many applications to the design of succinct and compressed data structures managing text collections, structured text, binary relations, trees, graphs, and so on. We are... more
Nowadays, rapid evolution of computers and mobile devices has caused the explosive increase in network traffic. So it becomes more and more necessary to archive network traffic for analyzing network events and a lot of emerging... more
SECOMPAX (Scope-Extended COMPressed Adaptive indeX), a new bitmap index compression scheme, can performs better compression ratio and achieves fast encoding speed compared with the state-of-art bitmap index compression algorithm, such as... more
Download research papers for free!