Multidimensional cyclic graph approach : representing a data cube without common sub-graphs.

dc.contributor.authorLima, Joubert de Castro
dc.contributor.authorHirata, Celso Massaki
dc.date.accessioned2015-01-26T11:25:29Z
dc.date.available2015-01-26T11:25:29Z
dc.date.issued2011
dc.description.abstractWe present a new full cube computation technique and a cube storage representation approach, called the multidimensional cyclic graph (MCG) approach. The data cube relational operator has exponential complexity and therefore its materialization involves both a huge amount of memory and a substantial amount of time. Reducing the size of data cubes, without a loss of generality, thus becomes a fundamental problem. Previous approaches, such as Dwarf, Star and MDAG, have substantially reduced the cube size using graph representations. In general, they eliminate prefix redundancy and some suffix redundancy from a data cube. The MCG differs significantly from previous approaches as it completely eliminates prefix and suffix redundancies from a data cube. A data cube can be viewed as a set of sub-graphs. In general, redundant sub-graphs are quite common in a data cube, but eliminating them is a hard problem. Dwarf, Star and MDAG approaches only eliminate some specific common sub-graphs. The MCG approach efficiently eliminates all common sub-graphs from the entire cube, based on an exact sub-graph matching solution. We propose a matching function to guarantee one-to-one mapping between sub-graphs. The function is computed incrementally, in a top-down fashion, and its computation uses a minimal amount of information to generate unique results. In addition, it is computed for any measurement type: distributive, algebraic or holistic. MCG performance analysis demonstrates that MCG is 20–40% faster than Dwarf, Star and MDAG approaches when computing sparse data cubes. Dense data cubes have a small number of aggregations, so there is not enough room for runtime and memory consumption optimization, therefore the MCG approach is not useful in computing such dense cubes. The compact representation of sparse data cubes enables the MCG approach to reduce memory consumption by 70–90% when compared to the original Star approach, proposed in [33]. In the same scenarios, the improved Star approach, proposed in [34], reduces memory consumption by only 10–30%, Dwarf by 30–50% and MDAG by 40–60%, when compared to the original Star approach. The MCG is the first approach that uses an exact sub-graph matching function to reduce cube size, avoiding unnecessary aggregation, i.e. improving cube computation runtime.pt_BR
dc.identifier.citationLIMA, J. de C.; HIRATA, C. M. Multidimensional cyclic graph approach: representing a data cube without common sub-graphs. Information Sciences, v. 181, p. 2626-2655, 2011. Disponível em: <http://www.sciencedirect.com/science/article/pii/S002002551000215X>. Acesso em: 22 jan. 2015.pt_BR
dc.identifier.doihttps://doi.org/10.1016/j.ins.2010.05.012
dc.identifier.issn0020-0255
dc.identifier.urihttp://www.repositorio.ufop.br/handle/123456789/4375
dc.language.isoen_USpt_BR
dc.rights.licenseO periódico Information Sciences concede permissão para depósito do artigo no Repositório Institucional da UFOP. Número da licença: 3552521449494.pt_BR
dc.subjectData warehousept_BR
dc.subjectOnline analytical processingpt_BR
dc.subjectData cubept_BR
dc.titleMultidimensional cyclic graph approach : representing a data cube without common sub-graphs.pt_BR
dc.typeArtigo publicado em periodicopt_BR

Arquivos

Pacote original

Agora exibindo 1 - 1 de 1
Imagem de Miniatura
Nome:
ARTIGO_MultidimensionalCyclicGraph.pdf
Tamanho:
2.55 MB
Formato:
Adobe Portable Document Format

Licença do pacote

Agora exibindo 1 - 1 de 1
Nenhuma Miniatura Disponível
Nome:
license.txt
Tamanho:
2.57 KB
Formato:
Item-specific license agreed upon to submission
Descrição: