Transforming Legacy Data Systems to Modern Big Data Platforms Using Hadoop

Authors

  • Satish Vadlamani Independent Researcher, Amberpet, Hyderabad-500007, Telangana State, India,
  • Shashwat Agrawal Independent Researcher, Northeastern University, Mehrauli, Ghaziabad, Uttar Pradesh, India,
  • Swetha Singiri Independent Researcher, JNTU University, Hyderabad ,
  • Akshun Chhapola Independent Researcher, Delhi Technical University, Delhi,
  • Om Goel Independent Researcher, Abes Engineering College Ghaziabad,
  • Shalu Jain Reserach Scholar, Maharaja Agrasen Himalayan Garhwal University, Pauri Garhwal, Uttarakhand

Keywords:

Legacy data systems, big data platforms, Hadoop, Hadoop Distributed File System (HDFS), MapReduce, data migration

Abstract

The transition from traditional legacy data systems to contemporary big data platforms presents a significant opportunity for organizations seeking enhanced data management and analytical capabilities. This paper explores the migration process to modern big data frameworks, with a particular focus on utilizing Hadoop—a widely adopted open-source framework designed for scalable and fault-tolerant data processing. Legacy systems, often characterized by rigid architectures and limited scalability, struggle to accommodate the increasing volume, velocity, and variety of data generated in today’s digital landscape. Hadoop, with its distributed storage and processing capabilities, offers a robust solution to address these challenges.

This study investigates the core components of Hadoop, including Hadoop Distributed File System (HDFS) and MapReduce, and their roles in facilitating seamless data integration, storage, and processing. The paper highlights key strategies for successful migration, such as data assessment, system compatibility evaluation, and incremental implementation. Additionally, it examines case studies where organizations have leveraged Hadoop to modernize their data infrastructure, resulting in improved data accessibility, real-time analytics, and operational efficiency.

By delineating the benefits and addressing the complexities associated with transitioning to Hadoop, this paper aims to provide a comprehensive guide for organizations contemplating a shift from legacy systems to big data platforms. The insights presented are intended to assist stakeholders in making informed decisions and optimizing their data management strategies in the era of big data.

References

Venkata Ramanaiah Chintha, Priyanshi, Prof.(Dr) Sangeet Vashishtha, "5G Networks: Optimization of Massive MIMO", IJRAR - International Journal of Research and Analytical Reviews (IJRAR), E-ISSN 2348-1269, P- ISSN 2349-5138, Volume.7, Issue 1, Page No pp.389-406, February-2020. (http://www.ijrar.org/IJRAR19S1815.pdf )

Cherukuri, H., Pandey, P., & Siddharth, E. (2020). Containerized data analytics solutions in on-premise financial services. International Journal of Research and Analytical Reviews (IJRAR), 7(3), 481-491 https://www.ijrar.org/papers/IJRAR19D5684.pdf

Sumit Shekhar, SHALU JAIN, DR. POORNIMA TYAGI, "Advanced Strategies for Cloud Security and Compliance: A Comparative Study", IJRAR - International Journal of Research and Analytical Reviews (IJRAR), E-ISSN 2348-1269, P- ISSN 2349-5138, Volume.7, Issue 1, Page No pp.396-407, January 2020. (http://www.ijrar.org/IJRAR19S1816.pdf )

"Comparative Analysis OF GRPC VS. ZeroMQ for Fast Communication", International Journal of Emerging Technologies and Innovative Research, Vol.7, Issue 2, page no.937-951, February-2020. (http://www.jetir.org/papers/JETIR2002540.pdf )

Ahmed, F., & Rauf, A. (2019). A Review of Big Data Migration Strategies and their Impact on Legacy Systems. International Journal of Information Management, 45, 131-143. https://doi.org/10.1016/j.ijinfomgt.2018.10.004

Alazab, M., & Ganaie, S. (2020). Challenges and Solutions for Legacy System Migration to Cloud Computing. Journal of Cloud Computing: Advances, Systems and Applications, 9(1), 12. https://doi.org/10.1186/s13677-020-00177-8

Bansal, A., & Bhardwaj, A. (2018). Big Data and Hadoop: Opportunities and Challenges. International Journal of Computer Applications, 182(14), 12-16. https://doi.org/10.5120/ijca2018916868

Bhatia, A., & Saini, H. (2019). Evaluating the Impact of Hadoop in Data Processing and Management. International Journal of Computer Applications, 182(27), 17-22. https://doi.org/10.5120/ijca2019918440

Das, S., & Maji, A. (2019). Data Migration in Hadoop: A Comprehensive Survey. Journal of Information Technology, 34(2), 145-160. https://doi.org/10.1057/s41265-018-0061-0

Elghannam, A., & Abulkhair, M. (2017). The Role of Hadoop in Big Data Analytics: A Review. Big Data Research, 6, 17-26. https://doi.org/10.1016/j.bdr.2017.01.002

Farooq, U., & Majid, A. (2016). Impact of Big Data on the Legacy Systems: A Case Study of Data Migration to Hadoop. Journal of Computer Science and Technology, 31(4), 829-843. https://doi.org/10.1007/s11390-016-1651-8

Gupta, S., & Gupta, A. (2020). Assessing the Effectiveness of Big Data Migration Strategies: Lessons Learned from Hadoop Implementations. Information Systems Management, 37(2), 162-175. https://doi.org/10.1080/10580530.2020.1740801

Jadhav, A., & Pande, A. (2018). Strategies for Migrating Legacy Systems to Big Data Platforms: A Systematic Review. Journal of Cloud Computing: Advances, Systems and Applications, 7(1), 25. https://doi.org/10.1186/s13677-018-0122-0

Kumar, A., & Gupta, R. (2019). Data Migration to Hadoop: A Review of Techniques and Challenges. Journal of Data Science, 17(3), 355-375. https://doi.org/10.6339/JDS.201903_17(3).0008

Liu, X., & Zhang, Y. (2020). Understanding the Impact of Big Data Migration on Legacy System Performance. Journal of Systems and Software, 165, 110560. https://doi.org/10.1016/j.jss.2019.110560

Mandal, S., & Chakraborty, M. (2018). Big Data Management with Hadoop: An Overview and Future Directions. Journal of Management Information Systems, 35(3), 1024-1039. https://doi.org/10.1080/07421222.2018.1498257

Patel, S., & Kumar, R. (2017). Legacy Systems and Big Data: A Study on Migration Strategies. International Journal of Computer Applications, 167(6), 1-6. https://doi.org/10.5120/ijca2017913831

Reddy, S., & Jha, M. (2020). Performance Analysis of Hadoop in Big Data Management: A Comprehensive Review. International Journal of Data Warehousing and Mining, 16(2), 12-27. https://doi.org/10.4018/IJDWM.2020040102

Sharma, R., & Sharma, A. (2019). Transforming Legacy Data Systems to Big Data Platforms: Challenges and Solutions. International Journal of Information Technology, 11(4), 851-859. https://doi.org/10.1007/s41870-019-00437-5

Singh, P., & Singh, A. (2016). Cloud Computing and Big Data: Synergy and Challenges in Migration. International Journal of Cloud Computing and Services Science, 5(4), 235-246. https://doi.org/10.11591/ijccs.v5i4.6328

Soni, P., & Thakar, D. (2017). Hadoop: A New Paradigm for Data Management. International Journal of Computer Applications, 162(3), 20-24. https://doi.org/10.5120/ijca2017915540

Thomas, S., & Zhang, W. (2018). Analyzing Data Migration Challenges in Hadoop Implementations. Journal of Systems and Software, 140, 77-88. https://doi.org/10.1016/j.jss.2018.03.040

Varma, A., & Kaur, R. (2020). Leveraging Hadoop for Effective Data Management: A Case Study. International Journal of Information Management, 52, 102067. https://doi.org/10.1016/j.ijinfomgt.2019.102067

Yadav, M., & Yadav, P. (2015). Data Migration to Big Data Technologies: Exploring the Challenges and Best Practices. International Journal of Advanced Computer Science and Applications, 6(7), 145-152. https://doi.org/10.14569/IJACSA.2015.060721

Mokkapati, C., Jain, S., & Pandian, P. K. G. (2022). "Designing High-Availability Alahari, Jaswanth, Dheerender Thakur, Punit Goel, Venkata Ramanaiah Chintha, & Raja Kumar Kolli. (2022). "Enhancing iOS Application Performance through Swift UI: Transitioning from Objective-C to Swift." International Journal for Research Publication & Seminar, 13(5): 312. https://doi.org/10.36676/jrps.v13.i5.1504.

Vijayabaskar, Santhosh, Shreyas Mahimkar, Sumit Shekhar, Shalu Jain, & Raghav Agarwal. (2022). "The Role of Leadership in Driving Technological Innovation in Financial Services." International Journal of Creative Research Thoughts, 10(12). ISSN: 2320-2882. https://ijcrt.org/download.php?file=IJCRT2212662.pdf.

Voola, Pramod Kumar, Umababu Chinta, Vijay Bhasker Reddy Bhimanapati, Om Goel, & Punit Goel. (2022). "AI-Powered Chatbots in Clinical Trials: Enhancing Patient-Clinician Interaction and Decision-Making." International Journal for Research Publication & Seminar, 13(5): 323. https://doi.org/10.36676/jrps.v13.i5.1505.

Agarwal, Nishit, Rikab Gunj, Venkata Ramanaiah Chintha, Raja Kumar Kolli, Om Goel, & Raghav Agarwal. (2022). "Deep Learning for Real Time EEG Artifact Detection in Wearables." International Journal for Research Publication & Seminar, 13(5): 402. https://doi.org/10.36676/jrps.v13.i5.1510.

Voola, Pramod Kumar, Shreyas Mahimkar, Sumit Shekhar, Prof. (Dr.) Punit Goel, & Vikhyat Gupta. (2022). "Machine Learning in ECOA Platforms: Advancing Patient Data Quality and Insights." International Journal of Creative Research Thoughts, 10(12).

Salunkhe, Vishwasrao, Srikanthudu Avancha, Bipin Gajbhiye, Ujjawal Jain, & Punit Goel. (2022). "AI Integration in Clinical Decision Support Systems: Enhancing Patient Outcomes through SMART on FHIR and CDS Hooks." International Journal for Research Publication & Seminar, 13(5): 338. https://doi.org/10.36676/jrps.v13.i5.1506.

Alahari, Jaswanth, Raja Kumar Kolli, Shanmukha Eeti, Shakeb Khan, & Prachi Verma. (2022). "Optimizing iOS User Experience with SwiftUI and UIKit: A Comprehensive Analysis." International Journal of Creative Research Thoughts, 10(12): f699.

Agrawal, Shashwat, Digneshkumar Khatri, Viharika Bhimanapati, Om Goel, & Arpit Jain. (2022). "Optimization Techniques in Supply Chain Planning for Consumer Electronics." International Journal for Research Publication & Seminar, 13(5): 356. doi: https://doi.org/10.36676/jrps.v13.i5.1507.

Mahadik, Siddhey, Kumar Kodyvaur Krishna Murthy, Saketh Reddy Cheruku, Prof. (Dr.) Arpit Jain, & Om Goel. (2022). "Agile Product Management in Software Development." International Journal for Research Publication & Seminar, 13(5): 453. https://doi.org/10.36676/jrps.v13.i5.1512.

Khair, Md Abul, Kumar Kodyvaur Krishna Murthy, Saketh Reddy Cheruku, Shalu Jain, & Raghav Agarwal. (2022). "Optimizing Oracle HCM Cloud Implementations for Global Organizations." International Journal for Research Publication & Seminar, 13(5): 372. https://doi.org/10.36676/jrps.v13.i5.1508.

Salunkhe, Vishwasrao, Venkata Ramanaiah Chintha, Vishesh Narendra Pamadi, Arpit Jain, & Om Goel. (2022). "AI-Powered Solutions for Reducing Hospital Readmissions: A Case Study on AI-Driven Patient Engagement." International Journal of Creative Research Thoughts, 10(12): 757-764.

Arulkumaran, Rahul, Aravind Ayyagiri, Aravindsundeep Musunuri, Prof. (Dr.) Punit Goel, & Prof. (Dr.) Arpit Jain. (2022). "Decentralized AI for Financial Predictions." International Journal for Research Publication & Seminar, 13(5): 434. https://doi.org/10.36676/jrps.v13.i5.1511.

Mahadik, Siddhey, Amit Mangal, Swetha Singiri, Akshun Chhapola, & Shalu Jain. (2022). "Risk Mitigation Strategies in Product Management." International Journal of Creative Research Thoughts (IJCRT), 10(12): 665.

Arulkumaran, Rahul, Sowmith Daram, Aditya Mehra, Shalu Jain, & Raghav Agarwal. (2022). "Intelligent Capital Allocation Frameworks in Decentralized Finance." International Journal of Creative Research Thoughts (IJCRT), 10(12): 669. ISSN: 2320-2882.

Agarwal, Nishit, Rikab Gunj, Amit Mangal, Swetha Singiri, Akshun Chhapola, & Shalu Jain. (2022). "Self-Supervised Learning for EEG Artifact Detection." International Journal of Creative Research Thoughts (IJCRT), 10(12). Retrieved from https://www.ijcrt.org/IJCRT2212667.

Kolli, R. K., Chhapola, A., & Kaushik, S. (2022). "Arista 7280 Switches: Performance in National Data Centers." The International Journal of Engineering Research, 9(7), TIJER2207014. tijer tijer/papers/TIJER2207014.pdf.

Agrawal, Shashwat, Fnu Antara, Pronoy Chopra, A Renuka, & Punit Goel. (2022). "Risk Management in Global Supply Chains." International Journal of Creative Research Thoughts (IJCRT), 10(12): 2212668.

Downloads

Published

2022-10-30

How to Cite

Satish Vadlamani, Shashwat Agrawal, Swetha Singiri, Akshun Chhapola, Om Goel, & Shalu Jain. (2022). Transforming Legacy Data Systems to Modern Big Data Platforms Using Hadoop. Universal Research Reports, 9(4), 426–450. Retrieved from https://urr.shodhsagar.com/index.php/j/article/view/1379

Issue

Section

Original Research Article

Most read articles by the same author(s)

1 2 3 4 > >>