Sunday, May 19, 2019

Data Masking

An vaticinator uninfected Paper July 2010 selective in formation concealment lift out Practices vaticinator White Paper entropy viewing outstrip Practices Executive Overview 1 Introduction 1 The Challenges of book binding entropy . 2 Implementing entropy mask .. 2 Comprehensive endeavor-wide baring of Sensitive info .. Enforcing referential Relationships during selective information Masking .. 4 Rich and extensible Mask Library.. 6 Sophisticated Masking Techniques .. 7 High Performance Mask effect 9 merged testing with Application tint Management solutions11 vaticinators Comprehensive Solutions for entropybase gage .. 12 node Case Studies . 2 Conclusion .. 13 seer White PaperData Masking Best Practices Executive Overview Enterprises need to sh atomic number 18 intersection info with various constituents while as healthful as protecting keen or personally identifiable aspects of the information. As the number of activitys increases, more and more sele ctive information gets sh ard, thus further increasing the risk of a data breach, where rude(a) data gets exposed to unauthorised parties. illusionist Data Masking addresses this problem by irreversibly replacing the original clear data with possible -looking scrubbed data that has akin emblem and characteristics as the original sensitive data thus enabling formations to sh ar this information in conformation with information warrantor policies and government regulations. This paper describes the best practices for deploying seer Data Masking to protect sensitive information in prophesier and refreshing(prenominal) motley databases such(prenominal) as IBM DB2, Microsoft SQLServer.Introduction Enterprises sh are data from their production finishings with other users for a variety of melodic line purposes. Most organizations assume production data into footrace and development environments to will application developers to rill application elevates. Retail c omp anies share customer point-of-sale data with market researchers to analyze customer buying patterns. Pharmaceutical or healthcare organizations share patient data with aesculapian researchers to assess the efficacy of clinical trials or medical treatments.Numerous industry studies on data privacy carry concluded that almost all companies copy tens of millions of sensitive customer and consumer records to non-production environments for exam, development, and other uses. Very few companies do allthing to protect this data even when sharing with outsourcers and ternion parties. Almost 1 out of 4 companies responded that withstand data apply for development or analyzeing had been lost or stolen and 50% s aid they had no manner of k straighting if data in non-production environments had been compromised. 1 oracle White PaperData Masking Best Practices The Challenges of Masking DataOrganizations begin essay to address these issues with custom hand-crafted solutions or repurpo sed existing data manipulation tools within the opening move to puzzle out this problem of sharing sensitive information with non-production users. Take for example, the most common solution database scripts. At introductoryly glance, an advantage of the database scripts approach would appear that they specifically address the unique privacy needs of a occurrence database that they were designed for. They may have even been tuned by the DBA to run at their fastest Lets look at the issues with this approach. 1.Reusability Because of the tight association between a script and the associated database, these scripts would have to re-written from scratch if apply to another database. There are no common capabilities in a script that offer be easily leveraged crosswise other databases. 2. Transparency Since scripts tend to be monolithic programs, auditors have no enhancer into the cover procedures used in the scripts. The auditors would find it extremely difficult to offer any recommendation on whether the dissembleing bidding reinforced into a script is secure and offers the enterprise the appropriate degree of protection. 3.Maintainability When these enterprise applications are upgraded, new disheartens and columns containing sensitive data may be added as a part of the upgrade process. With a script-based approach, the entire script has to be revisited and updated to accommodate new tables and columns added as a part of an application patch or an upgrade. Implementing Data Masking Based on seer Data Masking , prophet has developed a comprehensive 4-step approach to implementing data cloak called stick, Assess, Secure, and Test (FAST). These steps are ? Find This phase involves identifying and cataloging sensitive or regulated data crossways the entire enterprise.Typically carried out by stemma or credential analysts, the goal of this exercise is to come up with the comprehensive list of sensitive data elements specific to the organization and discover the associated tables and columns across enterprise databases that contain the sensitive data. ? Assess In this phase, developers or DBAs in conjunction with business or security measure analysts identify the coating piece algorithms that represent the optimal techniques to alternate the original sensitive data. Developers croupe leverage the existing covering program library or extend it with their have masking routines. ? Secure This and the next steps may be iterative.The security administrator executes the masking process to secure the sensitive data during masking trials. Once the masking process has comp permited and has been verified, the DBA thusly custody over the environment to the application testers. 2 Oracle White PaperData Masking Best Practices ? Test In the final step, the production users execute application processes to test whether the resulting wrapped data can be turned over to the other non-production users. If the masking routines need to be tweaked further, the DBA restores the database to the pre-masked state, fixes the masking algorithms and re-executes the masking process.Comprehensive Enterprise-wide Discovery of Sensitive Data To begin the process of masking data, the data elements that need to be masked in the application must be identified. The first step that any organization must take is to watch what is sensitive. This is because sensitive data is pushd to specific to the government regulations and industry standards that govern how the data can used or shared. Thus, the first step is for the security administrator to publish what constitutes sensitive data and get arranging from the companys compliance or risk officers. A typical list of sensitive data elements may includePerson Name Bank Account way out Maiden Name control board effect (Credit or Debit Card go) Business Address Tax Registration Number or National Tax ID Business Telephone Number Person Identification Number Business telecommunic ate Address W elfare Pension Insurance Number Custom Name Unemployment Insurance Number Employee Number Government Affiliation ID User Global Identifier Military Service ID Party Number or client Number Social Insurance Number Account Name Pension ID Number Mail detail Article Number GPS Location Civil Identifier Number Student Exam Hall Ticket Number Credit Card Number Club Membership IDSocial trade protection Number Library Card Number Trade Union Membership Number Oracle Data Masking provides several easy-to-use mechanisms for isolating the sensitive data elements. 3 Oracle White PaperData Masking Best Practices ? Data computer simulation driven Typical enterprise applications, such as E-Business Suite, Peoplesoft and Siebel, have published their application data assume as a part of their product documentation or the support knowledge base. By supplement the published data models, data masking users can easily associate the relevant tables and columns to the mask formats to prepare the mask definition. ?Application Masking Templates Oracle Data Masking supports the concept of application masking templates, which are XML representations of the mask definition. Software vendors or service suppliers can obtain these pre-defined templates and make them available to enterprises to enable them to merchandise these templates into the Data Masking rapidly and thus, accelerate the data masking implementation process. ? Ad-hoc search Oracle Data Masking has a robust search mechanism that go outs users to search the database quickly based on ad hoc search patterns to identify tables and columns that represent sources of sensitive data.With all the database management capabilities, including the ability to query sample rows from the tables, built into Enterprise double-decker, the Data Masking a can assist enterprise users rapidly construct the mask definition the pre-requisite to mask the sensitive data. For deeper searches, Oracle provides the Oracle Data Finder tool during data masking implementation to search across enterprises based on data patterns, such as NNN-NN-NNNN for social security numbers or 16 or 15 digit sequences beginning with 3, 4 or 5 for credit account . numbers.Using the combination of schema and data patterns and augmenting them with published application meta data models, enterprises can now develop a comprehensive data privacy catalog that captures the sensitive data elements that exist across enterprise databases. To be clear, this is not a static list. This is a dynamic living catalog managed by security administrators that needs to be refreshed as business rules and government regulations transmute as sound as when applications are upgraded and patched and new data elements containing sensitive data are now discovered. Enforcing Referential Relationships during Data MaskingIn todays relational databases (RDBMS), data is stored in tables related by legitimate recognise columns , called primary nominate columns, which allows efficient storage of application data without have to duplicate data. For example, an EMPLOYEE_ID generated from a compassionate capital management (HCM) application may be used in sales force automation (SFA) application tables employ foreign attain columns to keep track of sales reps and their accounts. When deploying a masking solution, business users are often concerned with referential integrity, the family between the primary key and the foreign key columns, in a database or across databases. 4Oracle White PaperData Masking Best Practices CUSTOMERS EMPLOYEES ? ? ? EMPLOYEE_ID FIRST_NAME LAST_NAME Database oblige Application enforced ? ? ? CUSTOMER_ID SALES_REP_ID COMPANY_NAME SHIPMENTS ? ? ? SHIPMENT_ID SHIPPING_CLERK_ID CARRIER Figure 1The Importance of Referential Integrity Oracle Data Masking automatically identifies referential integrity as a part of the mask definition creation. This convey that when a business user chooses to mask a key colu mn such as EMPLOYEE_ID, the Oracle Data Masking discovers all the related foreign key relationships in the database and enforces the same mask format to the related foreign key columns.This guarantees that the relationships between the various applications tables are preserved while ensuring that privacyrelated elements are masked. In applications where referential integrity is enforced in the database, Oracle Data Masking allows these relationships to be registered as relate d columns in the mask definition, thereby applying the same masking rules as applied to the database-enforced foreign key columns. 5 Oracle White PaperData Masking Best Practices Figure 2 machine-driven enforcement of referential Integrity Rich and Extensible Mask LibraryOracle Data Masking provides a centralized library of out-of-the-box mask formats for common types of sensitive data, such as credit card numbers, phone numbers, national identifiers (social security number for US, national insurance number fo r UK). By leveraging the Format Library in Oracle Data Masking, enterprises can apply data privacy rules to sensitive data across enterprise-wide databases from a hit source and thus, ensure consistent compliance with regulations. Enterprises can also extend this library with their own mask formats to act as their specific data privacy and application requirements. Oracle White PaperData Masking Best Practices Figure 3 Rich and extensible Mask Format Library Oracle Data Masking also provides mask primitives, which serve as building blocks to allow the creation of nearly infinite custom mask formats ranging from numeric, alphabetic or date/time based. Recognizing that the objective-world masking needs require a high degree of flexibility, Oracle Data Masking allows security administrators to create user-defined-masks. These user-defined masks, written in PL/SQL, let administrators create unique mask formats for sensitive data, e. g. enerating a unique email address from fictitiou s first and last names to allow business applications to send test notifications to fictitious email addresses. Sophisticated Masking Techniques Data masking is in general a trade-off between security and reproducibility. A test database that is same to the production database is 100% in terms of reproducibility and 0% in terms of security because of the fact that it exposes the original data. Masking technique where data in sensitive columns is replaced with a single glacial abide by is 100% in terms of security and 0% in terms of reproducibility.When considering various masking techniques, it is important to consider this trade-off in mind when selecting the masking algorithms. Oracle Data Masking provides a variety of sophisticated masking techniques to meet application requirements while ensuring data privacy. These techniques ensure that applications continue to live without errors later on masking. For example, ? Condition-based masking this technique makes it possible to apply different mask formats to the same data particularize depending on the rows that match the conditions.For example, applying different national identifier masks based on country of origin. ? Compound masking this technique ensures that a set of related columns is masked as a group to ensure that the masked data across the related columns retain the same relationship, e. g. city, state, zip values need to be consistent after masking. 7 Oracle White PaperData Masking Best Practices settled Masking Deterministic masking is an important masking technique that enterprises must consider when masking key data that is write across multiple applications.Take, for example, three applications a human capital management application, a customer relationship management application and a sales data warehouse. There are some key handle such as EMPLOYEE ID referenced in all three applications and needs to be masked in the corresponding test systems a employee identifier for each employee i n the human resources management application, customer service lesson identifiers, which may also be EMPLOYEE IDs, in the customer relationship management application and sales delegate IDs, which may be EMPLOYEE IDs in the sales data warehouse.To ensure that data relationships are preserved across systems even as privacy-related elements are removed, deterministic masking techniques ensure that data gets masked systematically across the various systems. It is vital that deterministic masking techniques used produce the transposition masked value consistently and yet in a manner that the original data cannot be derived from the masked value. One way to think of these deterministic masking techniques is as a melt down that is applied on the original value to generate a unique value consistently that has the same format, type and characteristics as the original value, e. . a deterministic function f(x) where f(x1) will always produce y1 for a given value x1. In ramble for the de terministic masking to be applied successfully, it is important that the function f(x) not be reversible, i. e. the inverse function f-1(y1) should not produce x1 to ensure the security of the original sensitive data. Deterministic masking techniques can be used with mathematical entries, e. g. social security numbers or credit card numbers, as well as with text entries, e. g. , to generate names.For example, organizations may require that names always get masked to the same set of masked names to ensure consistency of data across runs. Testers may find it dissipated if the underlying data used for testing is changed by production refreshes and they could no longer locate certain types of employees or customer records that were examples for specific test cases. Thus, enterprises can use the deterministic masking functions provided by Oracle Data Masking to consistently generate the same replacement mask value for any type of sensitive data element.Deterministic masking becomes extr emely critical when testing data feeds coming from external systems, such as employee expense data provided by credit card companies. In production environments, the feed containing real credit card numbers are processed by the accounts payable application containing employees twin(a) credit card information and are used to reconcile employee expenses. In test systems, the employee credit card numbers have been obfuscated and can no longer be matched against the data in the flat files containing the employees real credit card number.To address this requirement, enterprises pre-load the flat file containing data using tools such as SQL*Loader, into standard tables, then mask the sensitive columns using deterministic masking provided by Oracle Data Masking and then extract the masked data back into flat file. Now, the application will be able to process the flat files correctly just as they would have been in Production systems. 8 Oracle White PaperData Masking Best Practices High Pe rformance Mask executing Now that the mask definition is complete, the Oracle Data Masking can now execute the masking process to replace all the sensitive data.Oracle Enterprise Manager offers several options to clone the production database ? obtain from backup Using the Oracle Managed Backups functionality, Oracle Enterprise Manager can create a test database from an existing backup. ? Clone Live Database Oracle Enterprise Manager can clone a live production data into any non production environment within a few clicks. The clone database readiness also provides the option to create a clone image, which can then be used for other cloning operations.With the cloned (non-production) database now ready for masking, the Oracle Data Masking builds a work list of the tables and columns chosen for masking. Other tables that are not required to be masked are not touched. Further, the tables selected for masking are processed in the optimal order to ensure that only one pass is made at any time even if there are multiple columns from that table selected for masking. Typically, the tables with the primary keys get masked first, followed by the dependent tables containing foreign keys.Once the mask work list is ready, the Oracle Data Masking generates office tables for all the sensitive fields and their corresponding masked values. These are temporary tables that are created as a part of the masking process, which will be dropped once all data has been masked successfully. Using a highly efficient data the great unwashed mechanism, Oracle Data Masking rapidly recreates the masked replacement table based on original tables and the mapping tables and restores all the related database elements, such as indexes, constraints, grants and triggers identical to the original table.Compare this with the typical data masking process, which usually involves performing table row updates. Because rows in a table are usually scattered all over the disk, the update process is ex tremely ineffectual because the storage systems attempts to locate rows on data file stored on extremely large disk s. The bulk mechanism used by Oracle Data Masking lays down the new rows for the masked table in rapid succession on the disk. This enhanced efficiency makes the masked table available for users in a fraction of the time spent by an update-driven masking process.For large tables, Oracle Data Masking automatically invokes SQL parallelism to further speed up the masking process. Other performance enhancements include using the NOLOGGING option when recreating the table with the masked data. Typical database operations such as row inserts or updates generate redo logs, which are used by the database to capture changes made to files. These redo logs are completely supernumerary in a data masking operation since the non-production database is not running in a production environment, requiring continuous availability and recoverability.Using the NOLOGGING option, the Oracl e Data Masking bypasses the logging mechanism to further accelerate the masking process efficiently and rapidly. 9 Oracle White PaperData Masking Best Practices In internal tests run on a single-core Pentium 4 (Northwood) D1 system with 5. 7G of memory, the following performance results with reported. Criteria Baseline metric function Column scalability 215 columns 100 tables of 60G 20 minutes Row scalability 100 million rows 6 columns 1 arcminute 20 minutes Figure 4 Oracle Data Masking Performance scalability testsAs these results clearly indicate, Oracle Data Masking can handle significant volumes of sensitive data effortlessly both in terms of the number of sensitive columns as well as tables with large numbers of rows. Oracle Data Masking is also structured with Oracle Provisioning and Patch Automation in Oracle Enterprise Manager to clone-and-mask via a single workflow. The secure high performance nature of Oracle Data Masking feature with the end-to-end workflow ensures th at enterprise can provision test systems from production rapidly instead of geezerhood or weeks that it would with separate manual processes.Optimized for Oracle databases Oracle Data Masking leverages key capabilities in Oracle databases to enhance the overall manageability of the masking solution. Some of these include ? Flashback Administrators can optionally configure Oracle databases to enable flashback to a premasked state if they encounter problems with the masked data. ? PL/SQL Unlike other solutions, Oracle Data Masking generates DBA-friendly PL/SQL that allows DBAs to tailor the masking process to their needs. This PL/SQL script can also be easily integrated into any cloning process. 0 Oracle White PaperData Masking Best Practices Support for heterogeneous databases Oracle Data Masking supports masking of sensitive data in heterogeneous databases such as IBM DB2 and Microsoft SQLServer through with(predicate) the use of Oracle Database Gateways. Figure 5 Data masking sup port for heterogeneous databases Integrated Testing with Application Quality Management solutions The final step of the masking process is to test that the application is performing successfully after the masking process has completed.Oracle Enterprise Managers Application Quality Management (AQM) solutions provide high quality testing for all tiers of the application stack. Thorough testing can help you identify application quality and performance issues prior to deployment. Testing is one of the most challenge and time consuming parts of successfully deploying an application, but it is also one of the most critical to the sheds success. Oracle Enterprise Managers AQM solutions provide a unique combination of test capabilities which enable you to ?Test fundament changes Real Application Testing is designed and optimized for testing database tier infrastructure changes using real application workloads captured in production to validate database performance in your test environment . 11 Oracle White PaperData Masking Best Practices ? Test application changes Application Testing Suite helps you ensure application quality and performance with complete end-to-end application testing solutions that allow you to automate functional & regression testing, execute load tests and manage the test process.Oracles Comprehensive Solutions for Database Security Oracle provides a comprehensive portfolio of security solutions to ensure data privacy, protect against insider threats, and enable regulatory compliance. With Oracles goodly privileged user and multifactor access control, data classification, transparent data encryption, auditing, monitoring, and data masking, customers can deploy rock-steady data security solutions that do not require any changes to existing applications, saving time and money. Customer Case StudiesCustomers have had a variety of business needs which drove their decision to adopt the Oracle Data Masking for their sensitive enterprise data. These b enefits of using Oracle Data Masking were realized by a major global telecommunications products company that implemented the above methdology. Their database administrators (DBAs) had developed custom scripts to mask sensitive data in the test and development environments of their human resources (HR) application. As the company was growing and offering new services, their IT infrastructure was also growing thus placing an increased burden on their DBAs.By implementing Oracle Data Masking, the organization was able to use the role-based separation of duties to allow the HR analysts to define the security policies for masking sensitive data. The DBAs then automated the implementation of these masking policies when provisioning new test or development environments. Thus, the telecommunications company was able to allow business users to ensure compliance of their non-production environments while eliminating another manual task for the DBAs through automation. The need for data maski ng can come from internal compliance requirements.In the case of this UKbased government organization, the internal audit and compliance team had identified that the nonproduction copies of human resource management systems used for testing, development and reporting did not meet the established standards for privacy and confidentiality. In joint consultations with their IT service provider, the organization quickly identified the Oracle Data Masking as ideally suited to their business needs based on the fact that it was integrated with their day-to-day systems management operations provided by Oracle Enterprise Manager.Within a few weeks, the service provider deployed the mask definitions for their Oracle eBusiness Suite HR application and thereby rapidly brought the internal non-productions systems into compliance. There are organizations that have internally developed data masking solutions that have discovered that custom scripts ultimately have their limits and are not able to scale up as enterprise data sets increase in volume. This Middle East-based real estate company found that their data masking scripts were running for several hours and were mental retardation down as data volumes increased.Due to the stringent requirement to create production copies available for testing within rapid time-frames, the company evaluated the Oracle Data Masking among other commercial solutions. Upon deploying the Oracle 12 Oracle White PaperData Masking Best Practices Data Masking, they discovered that they were able to accelerate the masking time from 6 hours using their old scripts to 6 minutes using the Oracle Data Masking, an usefulness of 60x in performance. ConclusionStaying compliant with policy and government regulations while sharing production data with nonproduction users has become a critical business imperative for all enterprises. Oracle Data Masking is designed and optimized for todays high volume enterprise applications running on Oracle databases. L everaging the power of Oracle Enterprise Manger to manage all enterprise databases and systems, Oracle Data Masking accelerates sensitive data identification and executes the masking process with a simple easy-to-use web interface that puts the power of masking in the workforce of business users and administrators.Organizations that have implemented Oracle Data Masking to protect sensitive data in test and development environment have realized significant benefits in the following areas ? Reducing peril through Compliance By protecting sensitive information when sharing production data with developers and testers, organizations have able to ensure that non -production databases have remained compliant with IT security policies while enabling developers to conduct production-class testing. ?Increasing productivity through Automation By automating the masking process, organizations have been able to reduce the burden on DBAs who previously had to maintain manuallydeveloped maskin g scripts. 13 Data Masking Best Practices July 2010 Copyright 2010, Oracle and/or its affiliates. each rights reserved . This document is provided for information purposes only and the Author Jagan R. Athreya contents hereof are subject to change without notice. This document is not warranted to be error -free, nor subject to any other Contributing Authors arranties or conditions, whether convey orally or implied in law, including implied warranties and conditions of merchant ability or fitness for a particular purpose. We specifically disclaim any liability with respect to this document and no contractual obligations are Oracle pile formed either directly or indirectly by this document. This document may not be reproduced or transmitted in any form or by a ny W orld headquarter means, electronic or mechanical, for any purpose, without our prior written permission. 500 Oracle Parkway Redwood Shores, CA 94065Oracle and Java are registered trademarks of Oracle and/or its affiliat es. Other names may be trademarks of their respective U. S. A. owners. W orldwide Inquiries AMD, Opteron, the AMD logo, and the AMD Opteron logo are trademarks or registered trademarks of Advanced Micro Devices. Intel Phone +1. 650. 506. 7000 and Intel Xeon are trademarks or registered trademarks of Intel Corporation. All SPARC trademarks are used under license and are Fax +1. 650. 506. 7200 trademarks or registered trademar ks of SPARC International, Inc. UNIX is a registered trademark licensed through X/Open oracle. com Company, Ltd. 0110

No comments:

Post a Comment

Note: Only a member of this blog may post a comment.