By 2025, IDC predicts worldwide data will grow 61% to 175 zettabytes, with as much of the data residing in the cloud as in data centers.
Data retention policies based on value, risk and compliance requirements are gaining significance as emerging technologies such a Big Data, Analytics, IoT etc have gained prominence for enterprises. The sheer amount of aged data and rapidly increasing new data from a variety of sources stored both within on-premises environments and in the cloud creates severe governance, security and management challenges.
“Storing Everything” Is not an effective information governance practice & maybe the reason of, slowing down your Database Application performance , increasing costs and increasing time for maintenance activities such as patching, backup recovery, cloning etc.
It is important to understand your organizations environmental triggers where ERP Database Cleaning through archive & Purge can be most effective.
8 Best practices for running archive and purge on your growing data are as follows:
- Determine the retention period based on the business requirements, rules and policies used.
- Determine whether to purge or archive, determine frequency by checking on data growth rate.
- Make sure replication is not enabled during the window when these scripts are run.
- Run these during off peak load hours, Oracle recommends this too.
- If only archiving is required, make sure there is enough disk space available on the database server.
- Plan your purging strategy since purging requires a significant amount of time.
- In a multi-data center, it is recommended that you run purges at low data flow since the data in tables is replicated.
- Custom purging scripts only include the tables used by the standard purging scripts provided. Oracle recommends this too.
1 Determine the retention period based on the business requirements and rules and policies used.
A data retention policy is part of an organization’s overall data management. A policy is important because data can pile up dramatically, so it’s crucial to define how long an organization needs to hold on to specific data. If this important piece on archive strategy is missing in your organization, you can Download the free “Toolkit: Sample Data Retention Policy” and begin here.
Different data types should have different retention periods. How Long Should Data Be Kept? A good rule of thumb is that data should be kept only as long as it’s useful (i.e., for your business needs) and as short a time as required (i.e. according to laws and regulations). Retaining data longer than necessary takes up unnecessary storage space, costs and decrease performance of your application systems more than needed.
Data retention is defined by legal and legislative needs, internal audit needs, and functional business needs. The Data Owners are the ultimate decision maker to understand these needs and address them.
2 Determine whether to purge or archive
The policy should also explain who is responsible for each category of data, and if data that is no longer needed should be archived or deleted. The frequency will be determined on the basis of the relevance of the data with respect to the business / legal needs along with determining the data growth rate, IT will have to determine the storage or processing speed of the infrastructure being used. The execution of this activity can be done through an experienced DBA, however if the data is mission critical, it is highly recommended to include an expert database vendor to help you assess and plan this Archive and Purge strategy to success.
3 Make sure replication is not enabled during the window when these scripts are run
The state of a replication rule can be enabled or disabled. If a replication rule is disabled, data to which the rule is assigned is not replicated. Disabling replication for a client node causes the server to skip the node during replication processing.
4 Run these during off peak load hours, Oracle recommends these too.
Archive and purge could be resource (like CPU) intensive. The most basic performance tuning strategy for archive processing is to reduce competition from normal application processing. For increased performance, run archive processing during off-peak or maintenance hours. You can further speed archive and delete processing by increasing the number of database connections. In cases where there is no competition for the database, you can remove any unnecessary constraints from the database. Another way to enhance efficiency is to design the partitioning of your database tables so that you can process the rows more easily.
5 If archiving is required, make sure there is enough disk space available on the database server.
Data would be moved to archive tables instead of simply purging, archival space should be equal to or greater than the current table’s storage.
6 Plan your purging strategy since purging requires a significant amount of time.
If there are millions of rows that need to be deleted or copied from the database, would take significant amount of time to complete the archive and Purge process. Pre-planning, executing and post clean-up check and validation for accuracy & integrity of your existing system & reports, significant time must be planned for this exercise and the appropriate skills of the DBA or the expert vendor. A database expert vendor can bring in the necessary skills, tools and also advice cost effective methods to accelerate this process to success. An Oracle Database specialist like IT Convergence can expedite your ERP Database cleaning through Archive and Purge click here for your free assessment.
7 In a multi-data center, it is recommended that you run purges at low data flow.
In a Multi-data center, it is recommended that you run purges at low data flow, since the data in tables is replicated. You should consult your database administrator if you have multidimensional clustering (MDC) set up and require purging.
8 Custom purging scripts only include the tables used by the standard purging scripts provided. Oracle recommends this too.
The alterations to the provided purge scripts can include parameterization for user ID. Such alterations should be thoroughly tested before being used in production to ensure they function as expected.