Data Deduplication using B+ Tree Indexing

dc.contributor.author	Thwel, Tin Thein
dc.contributor.author	Thein, Ni Lar
dc.date.accessioned	2019-08-06T11:33:01Z
dc.date.available	2019-08-06T11:33:01Z
dc.date.issued	2009-12-30
dc.identifier.uri	http://onlineresource.ucsy.edu.mm/handle/123456789/1893
dc.description.abstract	As the amount of storage utilization become larger and larger, people have been tried to find out the efficient ways to safe storage space. The single instance storage or data deduplication becomes vague in storage management as it can eliminate duplicated data or segments in those files. In this paper, we proposed Data Deduplication System for sub-file level. This system can perform deduplication with the integrated use of file chunking algorithm; secure hash function and B+ tree indexing. In this system, we will first separate the file into variable_length segments or chunks using Two Thresholds Two Divisors chunking algorithm. ChunkIDs are then obtained by applying hash function to the chunks. The resulted ChunkIDs are used to build as indexing keys in B+ tree like index structure. This system can reduce the indexing time complexity from O (n) to O (log n). The performance of proposed system will be compared with the other systems in terms of performance metrics such as WinZIP, WinRAR, etc.	en_US
dc.language.iso	en	en_US
dc.publisher	Fourth Local Conference on Parallel and Soft Computing	en_US
dc.title	Data Deduplication using B+ Tree Indexing	en_US
dc.type	Article	en_US