Securing Hadoop: A Survey to investigate need of security in Big Data Processing using Hadoop Ecosystem
Authors: Sonal Jain , Mohit Jain
Certificate: View Certificate
Abstract
Big data is used to store bulk amount of data. Hadoop processing system involves large data to deal with and offers scalable and distributed storage. In every minutes and seconds, large data is generated, with the generation of large data they required to be store in a safe and secure manner. As data, leakage is common in Hadoop Distributed File System so security methods need to be implemented in scenario. Existing work uses ARIA and AES algorithm and faces the issue of memory overheads and extra computation time. The drawbacks of existing work are overcome in presented work by replacing AES with Blowfish algorithm and ARIA by RC6. And also serve with high security architecture.
Introduction
Big data comprises of data, which is in structured and unstructured format. A massive data with different file formats are stored in Big Data. Hadoop provides with a platform to deal with bulk data by offering them scalable and distributed storage area. Case study on big data says that it is a never-ending deal of data evolution, which is too vast and big. Large data is complex to handle and care of with creating complicated environment. Even security becomes complicated for big data to encrypt due to vulnerable environment for attackers.
Conclusion
Concluded work satisfies the author by replacing AES with Blowfish and ARIA by RC6. As existing work is implemented using AES algorithm with ARIA and face multiple issue of time and memory with security. So Blowfish and RC6 will be implemented in presented work to define high security algorithm using Hybrid Architecture. Security architecture is proposed in solution section where client uploads data in HDFS using security algorithm and in return gets cipher text.
Copyright
Copyright © 2025 Sonal Jain , Mohit Jain. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.