Before push the data to a data flow it is better to profile and cleanse or maintain quality of the data. For data profiling, Information Steward is the best tool which we will discuss later in a different blog.
My intention in each and every blog is to a point explanation of any object which is only required for any developer to start development. For more details, there are many places where it is described.
Today I will discuss three important Quality Transformation in SAP data service. These are given below –
By seeing the name you can predict that this transformation match the records and Identify the records. But Match transformation do more than that.
For an example in one of the cases It will identify that potential identical data may be duplicate from data those do not have primary key to identify the duplicated. Based on matching method an criteria it can identify those duplicated, then –
Source Table :
There are many kind of MATCH transformation is there like Match Base, Address Match , Different type of consumer match.
By changing the configuration need to set rules for matching records then establish the thresholds and scores that BODS uses to determine near matches and matching records. it is based on the below rules-
We will use simple Base_Match for that.
Configuration in Match Wizard:
Expected Result Table :
There are multiple scenarios where we can use Match Transformation different types.
Used to clean the data to prepare for other transformation . The below option you can use to clean the data , replace a pattern of string using the below option –
For more detail you can visit the below SAP URL in which it is described in very clear and details .
Global Address Cleanse transformation are used to parse the input data ,cleanse the data , correct and standardize address data for different countries.
The address proposed field are given below –
In the panel we need to configure the scenario and customized accordingly.
In each scenario we need to customized as per retirement.