Detection tasks scans the target source system for sensitive data elements.
Select the Sampling Configuration from the drop-down.
Percent - It will scan approximately 100% of data in a table.
Rows per Region – It will scan 1000 rows from each region.
Perform the following steps to create a sample:
Go to Hadoop > Hbase > Add/Edit Sampling Configuration tab. Click the + Add Configuration button. You can also create a sample by clicking the +Add button next to the Sampling Configuration drop-down on the Add New Task Definition screen.
Provide values for the fields depicted in the screenshot:
Provide a unique name to the sampling in the Name field. This is a mandatory field.
Enter a brief description for the sampling in the Description field.
Check the Set Sampling Config As Default option to set the Sampling Configuration as the default configuration for all the tasks.
Check the Show Advanced Sampling Details option to view and set the advanced settings for the sampling. Below are the options for advanced settings:
Row count Range: Specify the start row count range.
To: Specify the end row count range.
By: There are two ways to specify how to pick data for sampling from the table:
Rows per Region: Select this option and enter the number of rows to be sampled in a region.
Percent: Select this option and enter the %age of the data to be sampled in a table.
After specifying the values, click the Add button to add the user-defined sampling configuration to the list. Click the Save button to save the configuration in the system, else click Cancel.
To proceed further for remaining steps, refer to step 3 of Create a HBase task.