UCSY's Research Repository

Data Deduplication using B+ Tree Indexing

Show simple item record

dc.contributor.author Thwel, Tin Thein
dc.contributor.author Thein, Ni Lar
dc.date.accessioned 2019-08-06T11:33:01Z
dc.date.available 2019-08-06T11:33:01Z
dc.date.issued 2009-12-30
dc.identifier.uri http://onlineresource.ucsy.edu.mm/handle/123456789/1893
dc.description.abstract As the amount of storage utilization become larger and larger, people have been tried to find out the efficient ways to safe storage space. The single instance storage or data deduplication becomes vague in storage management as it can eliminate duplicated data or segments in those files. In this paper, we proposed Data Deduplication System for sub-file level. This system can perform deduplication with the integrated use of file chunking algorithm; secure hash function and B+ tree indexing. In this system, we will first separate the file into variable_length segments or chunks using Two Thresholds Two Divisors chunking algorithm. ChunkIDs are then obtained by applying hash function to the chunks. The resulted ChunkIDs are used to build as indexing keys in B+ tree like index structure. This system can reduce the indexing time complexity from O (n) to O (log n). The performance of proposed system will be compared with the other systems in terms of performance metrics such as WinZIP, WinRAR, etc. en_US
dc.language.iso en en_US
dc.publisher Fourth Local Conference on Parallel and Soft Computing en_US
dc.title Data Deduplication using B+ Tree Indexing en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository



Browse

My Account

Statistics