Deprecated: mysql_connect(): The mysql extension is deprecated and will be removed in the future: use mysqli or PDO instead in /home/ejournals/public_html/include/connect.php on line 6

Warning: mysqli_query() expects parameter 1 to be mysqli, resource given in /home/ejournals/public_html/include/verify.php on line 5

Warning: mysqli_num_rows() expects parameter 1 to be mysqli_result, null given in /home/ejournals/public_html/include/verify.php on line 7

Warning: mysqli_fetch_assoc() expects parameter 1 to be mysqli_result, null given in /home/ejournals/public_html/include/verify.php on line 10
Philippine EJournals| Development of a File Duplicate Detector System Using Hashing Algorithm

HomeInternational Journal on Social Innovation & Researchvol. 6 no. 1 (2013)

Development of a File Duplicate Detector System Using Hashing Algorithm

Danny G. Umoso

Discipline: Information Technology

 

Abstract:

The problem on duplication of files in an external storage has been increasing. The inability of built-in software to detect such duplication results to a massive loss of storage space. Files can be the same in terms of content and structure even if they have different filenames. The use of different techniques and algorithms leads to the identification of similar files. Elimination of duplicate files results to freeing space and optimal use of file storage. Hashing algorithm is one of the possible algorithms that can be used to detect file duplication. The capability of the hashing technique to identify the structure (file hash) of data files paves the way to validate whether files are duplicates or not. Such technique is a bit crude because of its simplicity; however, it is effective for academic use. This research further supports those discussions on hashing algorithm capabilities in determining duplicate files.