当前位置: X-MOL 学术J. Big Data › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
On hierarchical clustering-based approach for RDDBS design
Journal of Big Data ( IF 8.1 ) Pub Date : 2023-11-18 , DOI: 10.1186/s40537-023-00849-7
Hassan I. Abdalla , Ali A. Amer , Sri Devi Ravana

Distributed database system (DDBS) design is still an open challenge even after decades of research, especially in a dynamic network setting. Hence, to meet the demands of high-speed data gathering and for the management and preservation of huge systems, it is important to construct a distributed database for real-time data storage. Incidentally, some fragmentation schemes, such as horizontal, vertical, and hybrid, are widely used for DDBS design. At the same time, data allocation could not be done without first physically fragmenting the data because the fragmentation process is the foundation of the DDBS design. Extensive research have been conducted to develop effective solutions for DDBS design problems. But the great majority of them barely consider the RDDBS's initial design. Therefore, this work aims at proposing a clustering-based horizontal fragmentation and allocation technique to handle both the early and late stages of the DDBS design. To ensure that each operation flows into the next without any increase in complexity, fragmentation and allocation are done simultaneously. With this approach, the main goals are to minimize communication expenses, response time, and irrelevant data access. Most importantly, it has been observed that the proposed approach may effectively expand RDDBS performance by simultaneously fragmenting and assigning various relations. Through simulations and experiments on synthetic and real databases, we demonstrate the viability of our strategy and how it considerably lowers communication costs for typical access patterns at both the early and late stages of design.



中文翻译:

基于层次聚类的 RDBS 设计方法

即使经过数十年的研究,分布式数据库系统(DDBS)设计仍然是一个开放的挑战,特别是在动态网络环境中。因此,为了满足高速数据采集和庞大系统的管理和保存的需求,构建分布式数据库来存储实时数据非常重要。顺便说一句,一些分段方案,例如水平、垂直和混合,广泛用于 DDBS 设计。同时,如果不首先对数据进行物理分片,就无法完成数据分配,因为分片过程是 DDBS 设计的基础。为了开发 DDBS 设计问题的有效解决方案,人们进行了广泛的研究。但他们中的绝大多数几乎没有考虑 RDBBS 的最初设计。因此,这项工作旨在提出一种基于集群的水平分段和分配技术来处理 DDBS 设计的早期和后期阶段。为了确保每个操作流入下一个操作而不增加任何复杂性,分段和分配是同时完成的。通过这种方法,主要目标是最大限度地减少通信费用、响应时间和不相关的数据访问。最重要的是,据观察,所提出的方法可以通过同时分段和分配各种关系来有效扩展 RDBBS 性能。通过对合成数据库和真实数据库的模拟和实验,我们证明了我们的策略的可行性,以及它如何在设计的早期和后期阶段显着降低典型访问模式的通信成本。

更新日期:2023-11-19
down
wechat
bug