Cover for Data Deduplication Approaches

Data Deduplication Approaches

Concepts, Strategies, and Challenges

Book2021

Edited by:

Tin Thein Thwel and G.R. Sinha

Data Deduplication Approaches

Concepts, Strategies, and Challenges

Book2021

 

Cover for Data Deduplication Approaches

Edited by:

Tin Thein Thwel and G.R. Sinha

About the book

Browse this book

Book description

In the age of data science, the rapidly increasing amount of data is a major concern in numerous applications of computing operations and data storage. Duplicated data or redundant ... read full description

Browse content

Table of contents

Actions for selected chapters

Select all / Deselect all

  1. Full text access
  2. Book chapterAbstract only

    1 - Introduction to data deduplication approaches

    G.R. Sinha, Tin Thein Thwel, ... Divya Prakash Shrivastava

    Pages 1-15

  3. Book chapterAbstract only

    2 - Data deduplication concepts

    Pritish A. Tijare

    Pages 17-35

  4. Book chapterAbstract only

    3 - Concepts, strategies, and challenges of data deduplication

    Prakash Chandra Sharma, Sulabh Bansal, ... Su Su Hlaing

    Pages 37-55

  5. Book chapterAbstract only

    4 - Existing mechanisms for data deduplication

    Devendra Kumar Mishra and Sanjiv Sharma

    Pages 57-68

  6. Book chapterAbstract only

    5 - Classification criteria for data deduplication methods

    Sulabh Bansal and Prakash Chandra Sharma

    Pages 69-96

  7. Book chapterAbstract only

    6 - File chunking approaches

    Kapil Kumar Nagwanshi

    Pages 97-109

  8. Book chapterAbstract only

    7 - Study of data deduplication for file chunking approaches

    C.S.N. Koushik, Shruti Bhargava Choubey, ... G.R. Sinha

    Pages 111-124

  9. Book chapterAbstract only

    8 - Essentials of data deduplication using open-source toolkit

    Shivani Girish Dhok and Ankit A. Bhurane

    Pages 125-151

  10. Book chapterAbstract only

    9 - Efficient data deduplication scheme for scale-out distributed storage

    Myat Pwint Phyu and G.R. Sinha

    Pages 153-182

  11. Book chapterAbstract only

    10 - Identification of duplicate bug reports in software bug repositories: a systematic review, challenges, and future scope

    Naresh Kumar Nagwani

    Pages 183-201

  12. Book chapterAbstract only

    11 - A survey and critical analysis on energy generation from datacenter

    Riman Mandal, Manash Kumar Mondal, ... Utpal Biswas

    Pages 203-230

  13. Book chapterAbstract only

    12 - Review of MODIS EVI and NDVI data for data mining applications

    Sangram Panigrahi, Kesari Verma and Priyanka Tripathi

    Pages 231-253

  14. Book chapterAbstract only

    13 - Performance modeling for secure migration processes of legacy systems to the cloud computing

    Ankit Kumar, Pankaj Dadheech, ... Linesh Raja

    Pages 255-279

  15. Book chapterAbstract only

    14 - DedupCloud: an optimized efficient virtual machine deduplication algorithm in cloud computing environment

    Sudhansu Shekhar Patra, Sudarson Jena, ... Mahendra Kumar Gourisaria

    Pages 281-306

  16. Book chapterAbstract only

    15 - Data deduplication for cloud storage

    C.S.N. Koushik, Shruti Bhargava Choubey, ... G.R. Sinha

    Pages 307-317

  17. Book chapterAbstract only

    16 - Data duplication using Amazon Web Services cloud storage

    M. Varaprasad Rao

    Pages 319-334

  18. Book chapterAbstract only

    17 - Game-theoretic analysis of encrypted cloud data deduplication

    Xueqin Liang, Zheng Yan, ... Qinghua Zheng

    Pages 335-356

  19. Book chapterAbstract only

    18 - Data deduplication applications in cognitive science and computer vision research

    G.R. Sinha and Varun Bajaj

    Pages 357-368

  20. Book chapterNo access

    Index

    Pages 369-380

About the book

Description

In the age of data science, the rapidly increasing amount of data is a major concern in numerous applications of computing operations and data storage. Duplicated data or redundant data is a main challenge in the field of data science research. Data Deduplication Approaches: Concepts, Strategies, and Challenges shows readers the various methods that can be used to eliminate multiple copies of the same files as well as duplicated segments or chunks of data within the associated files. Due to ever-increasing data duplication, its deduplication has become an especially useful field of research for storage environments, in particular persistent data storage. Data Deduplication Approaches provides readers with an overview of the concepts and background of data deduplication approaches, then proceeds to demonstrate in technical detail the strategies and challenges of real-time implementations of handling big data, data science, data backup, and recovery. The book also includes future research directions, case studies, and real-world applications of data deduplication, focusing on reduced storage, backup, recovery, and reliability.

In the age of data science, the rapidly increasing amount of data is a major concern in numerous applications of computing operations and data storage. Duplicated data or redundant data is a main challenge in the field of data science research. Data Deduplication Approaches: Concepts, Strategies, and Challenges shows readers the various methods that can be used to eliminate multiple copies of the same files as well as duplicated segments or chunks of data within the associated files. Due to ever-increasing data duplication, its deduplication has become an especially useful field of research for storage environments, in particular persistent data storage. Data Deduplication Approaches provides readers with an overview of the concepts and background of data deduplication approaches, then proceeds to demonstrate in technical detail the strategies and challenges of real-time implementations of handling big data, data science, data backup, and recovery. The book also includes future research directions, case studies, and real-world applications of data deduplication, focusing on reduced storage, backup, recovery, and reliability.

Key Features

  • Includes data deduplication methods for a wide variety of applications
  • Includes concepts and implementation strategies that will help the reader to use the suggested methods
  • Provides a robust set of methods that will help readers to appropriately and judiciously use the suitable methods for their applications
  • Focuses on reduced storage, backup, recovery, and reliability, which are the most important aspects of implementing data deduplication approaches
  • Includes case studies
  • Includes data deduplication methods for a wide variety of applications
  • Includes concepts and implementation strategies that will help the reader to use the suggested methods
  • Provides a robust set of methods that will help readers to appropriately and judiciously use the suitable methods for their applications
  • Focuses on reduced storage, backup, recovery, and reliability, which are the most important aspects of implementing data deduplication approaches
  • Includes case studies

Details

ISBN

978-0-12-823395-5

Language

English

Published

2021

Copyright

Copyright © 2021 Elsevier Inc. All rights reserved.

Imprint

Academic Press

Editors

Tin Thein Thwel

Myanmar Institute of Information Technology (MIIT), Mandalay, Myanmar

G.R. Sinha

International Institute of Information Technology (IIIT), Bangalore, India

Myanmar Institute of Information Technology (MIIT), Mandalay, Myanmar