SAP Information Steward is a data quality tool. We use the solution to build data quality rules to detect issues in source systems. The solution has many components, the data inside that is being used for data quality and data that is being used for the business glossary and metadata management.
Information Steward has a component called, 'Metapedia' that describes technical fields in business terms. It is equivalent to a business glossary in other systems.
When we see a business intelligence report, we want to be sure that the figures we are seeing are consistent with the transformation. We want to be able to see where the number is coming from. For example, we want to know if the data is from an ERP and is being transformed by an ETL and then loaded into a warehouse. Using metadata management in the Information Steward, we can see all of these steps.
SAP Information Steward also has a cleansing package that helps in the matching process in data services. For example, the name of a person. In many systems, the same person can have their name displayed in many different ways. In one system an individual could be Bob Smith, in another system it could be Bobby Smith and another could show Robert Smith. The cleansing package will try to get all of these synonyms in one place and link it to a standard one. By applying these cleansing packages in data services, we are able to consolidate all the different terms into just one.
For most SAP customers, Information Steward is enough because it is able to build quality data rules to detect issues in the source systems like SAP HANA, Business Warehouse, or other systems.
A business user can first organize their data into several data domains. For example, procurement, human resources, and logistics setup. The domains can build data quality dimensions where you can describe the kind of rule that you are going to use. The user then can immediately see if something is wrong with their data using traffic lights.
Another great feature of SAP Information Steward is the accuracy that the content is followed by validating against the reference tool. With the solution, you are creating data quality dimensions. Within these dimensions, you are creating business data quality rules that are looking for specific fields. From these rules, you can create a scorecard. The scorecard will highlight the percentage of good data and ensure the user can feel confident that the data is accurate within predetermined limits.
SAP tables have field names that are very cryptic, making them hard to understand the meaning of the fields. Metapedia helps describe these fields in business terms.