Skip to content

Shobhitsingh-2503/DupAlert

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

37 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DupAlerts

SIH1659 : Data download Duplication Alert System (DDAS)

Background

  • In an institute or organization, several users may need access to the same datasets for different purposes.
  • Due to a lack of communication or visibility into existing downloads, these users may unintentionally download the same dataset multiple times.
  • This duplication leads to wasted resources, including bandwidth and storage, and makes data management more complex.

Objective

  • The DDAS aims to address this problem by alerting users when they try to download a dataset that has already been downloaded by someone else within the organization.
  • The system maintain logs of metadata for all downloaded datasets. This metadata includes details like file names, owners, cid, departments, and time.

Features

  • IPFS Hashing: Utilizes IPFS hashing to detect duplicates even if file names differ, ensuring accurate identification.
  • Download Alerts: Notifies users at the time of download if the file has already been downloaded by another user, providing a link to the original file.
  • Real-Time Synchronization: Ensures up-to-date information and alerts by synchronizing downloaded datasets across users in real-time with Firebase.
  • Intuitive UI: Features a user-friendly interface with search and filter functionality for easy navigation and dataset management.
  • Resource Conservation: Helps users avoid unnecessary downloads, conserving resources, saving time, and simplifying data management.

Applications

This system is applicable across various fields and industries, including:

  • Academic Institutions: Prevents duplication in research projects and promotes efficient collaboration by making existing datasets easily accessible.
  • Research Facilities: Manages large scientific datasets efficiently and fosters better data sharing in cross-disciplinary research.
  • Government Agencies: Optimizes data storage and bandwidth in data-intensive operations and enhances interdepartmental coordination.
  • Corporate Environments: Avoids unnecessary duplication in data analytics and ensures efficient data management across global operations.
  • Libraries and Archives: Efficiently manages digital collections, preventing redundant downloads.
  • Healthcare and Biotech Organizations: Prevents redundant downloads in medical data repositories, ensuring efficient data handling.

Tech Stack

  • React : Front-end framework for building a dynamic and responsive user interface.
  • IPFS : Decentralized storage system for content-addressable file storage, ensuring reliable duplication detection.
  • Firebase : Real-time database and authentication services for seamless data management and synchronization.

Future Improvement Scope

  • Local Download Alerts: Warns users before downloading if the file has already been downloaded by them on their local system.
  • Role-Based Access Control: Only authorized users can approve downloads and delete files, enhancing security and data integrity.
  • User Feedback Mechanism: Allows users to report false positives and suggest improvements, contributing to the system's continuous refinement.

Team members

Static Badge Static Badge Static Badge Static Badge Static Badge Static Badge

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published