Data-Driven Storage Optimization for Digital Repositories Using Big Data Techniques
Authors: Dr. Mariana Oliveira, Dr. Stefan Keller, Dr. Lucia Fernandez
DOI: 10.87349/JBUPT/271202
Page No: 4-06
Abstract
Software Configuration Management (SCM) deals with various changes and evolution in the software. Each software comprises of thousands of versions. Individual versions need to be stored again and again. Every software keeps on evolving so we need to keep track on each evolution. Software engineer uses mining techniques to store and retrieve these kinds of data’s. This research paper deals with the design, and implementation of an efficient storage management for SCM repositories that facilitates a developer’s to store revisions of software changes using Map reduce Techniques. The main objective of this research work is at making the storing and retrieval process of SCM repositories easier. Storage of SCM repository keeps on increasing. SCM repository needs a processing technique to process those data before storing. Map reduce Technique is used to process those data in Divide and Conquer manner. It stores source code in the format of the graph so storing and retrieval process is much easier. It, in turn, reduces the storage whole SCM repository. Thus, an efficient storage system for SCM repositories is achieved and a prototype is discussed in this paper.



