Big high-dimension data cube designs for hybrid memory systems.

Imagem de Miniatura

Data

2020

Título da Revista

ISSN da Revista

Título de Volume

Editor

Resumo

In Big Data cubes with hundreds of dimensions and billions of tuples, the indexing and query operations are a challenge and the reason is the time-space exponential complexity when a full cube is computed. Therefore, solutions based on RAM may not be practical and the solutions based on hybrid memory (RAM and disk) become viable alternatives. In this paper, we propose a hybrid approach, named bCubing, to index and query high-dimension data cubes with high number of tuples in a single machine and using RAM and disk memory systems. We evaluated bCubing in terms of runtime and memory consumption, comparing it with the Frag-Cubing, HIC and H-Frag approaches. bCubing showed to be faster and used less RAM than Frag-Cubing, HIC and H-Frag. bCubing indexed and allowed to query a data cube with 1.2 billion tuples and 60 dimensions, consuming only 84 GB of RAM, which means 35% less memory than HIC. The complex holistic measures mode and median were computed in multidimensional queries, and bCubing was, on average, 50% faster than HIC.

Descrição

Palavras-chave

Multidimensional database, Multidimensional query, Data cube, Holistic measure

Citação

SILVA ,R. R.; HIRATA, C. M.; LIMA, J. de C. Big high-dimension data cube designs for hybrid memory systems. Knowledge and Information Systems, 2020. Disponível em: <https://link.springer.com/article/10.1007%2Fs10115-020-01505-9>. Acesso em: 25 ago. 2021.

Avaliação

Revisão

Suplementado Por

Referenciado Por