Part of IBM? View this page on W3 for additional options.
Authors: Whei-Jen Chen
Reference data is a key aspect of any application integration. Today, many enterprises have no centralized enterprise governance and management over reference data. Reference data variations and inconsistencies can be a major source of data quality issues within the enterprise and can cause business losses through system downtime, incorrect transactions, and incorrect reports. IBM® InfoSphere® Master Data Management Reference Data Management Hub (InfoSphere MDM Ref DM Hub) is designed as a ready-to-run application that provides the governance, process, security, and audit control for managing reference data as an enterprise standard, resulting in fewer errors, reduced business risk, and cost savings. This IBM Redbooks® Solution Guide highlights the value of this powerful solution and describes how to implement it in your organization
Reference data refers to data that is used to categorize other data within enterprise applications and databases. Reference data includes the lookup table and code table data that is found in virtually every enterprise application, such as country codes, currency codes, and industry codes.
Reference data is distinct from transactional data and master data. Transactional data is the data that is produced by transactions within applications; master data is the data that represents the key business entities that participate within transactions. Reference data is also distinct from metadata, which describes the structure of an entity. Transactional data, master data, and reference data, when combined, comprise the key business data within an enterprise.
Most enterprise applications contain reference data, built into code tables, to classify and categorize product information, customer information, and transaction data. Reference data changes relatively infrequently, but it does change over time, and given its ubiquity, synchronizing reference data values and managing changes across the enterprise is a major challenge (Figure 1).
Figure 1. Reference data is found everywhere
Did you know?
Reference data has been part of enterprise applications from the beginning of the modern computing era. However, despite this fact and the fact that it constitutes a fundamental class of enterprise data, there is relatively little focus on reference data and its importance as an enterprise data asset.
Ad hoc management of reference data without a formal governance policy can create significant operational risk. For many enterprises, reference data is a major contributor to enterprise data quality problems and has a high support cost. The demands of complying with national and international industry regulations are causing companies to rethink reference data management, and compelling enterprises to manage and control their reference data by using sound data governance principles. IBM® InfoSphere® Master Data Management Reference Data Management Hub (InfoSphere MDM Ref DM Hub) is an ideal solution for reference data management.
Business value
Today, many companies have no centralized enterprise governance over reference data; critical reference data is managed using spreadsheets and manual, ad hoc methods. The difficulty of managing change across the complex web of reference data variations is not systematically addressed; errors in reference data mappings and inconsistencies are accepted and tolerated as an everyday reality. Reference data variations and inconsistencies can be a major source of data quality issues within the enterprise and cause business losses through system downtime, incorrect transactions, and incorrect reports.
InfoSphere MDM Ref DM Hub provides a robust solution for centralized management, stewardship, and distribution of enterprise reference data. It supports defining and managing reference data as an enterprise standard. It also supports maintaining mappings between the various application-specific representations of reference data that are used within the enterprise. The InfoSphere MDM Ref DM Hub supports formal governance of reference data, putting management of the reference data in the hands of the business users, reducing the burden on IT, and improving the overall quality of data used across the organization.
The IBM InfoSphere Master Data Management Reference Data Management Hub was released as a separately chargeable component under the IBM Master Data Management Product ID (PID) in July 2012. The hub was developed as a stand-alone reference data domain on the InfoSphere MDM Custom Domain Hub Platform, which itself is the foundation for the InfoSphere MDM Advanced Edition. The InfoSphere MDM Ref DM Hub implements its own specialized domain model specifically for reference data, that is, reference data is supported as a first-class domain entity. The InfoSphere MDM Ref DM Hub includes a dedicated stewardship interface that is designed for managing reference data. The web-based user interface (UI) runs in the browser and no special code is required on the client. The UI is designed for business users, with intuitive and familiar navigation and controls. A flexible data model supports dynamic modeling of reference data properties through the UI, ensuring a quick implementation and minimizing the need for IT involvement on an ongoing basis.
Figure 2. InfoSphere MDM Ref DM Hub logical architecture
The InfoSphere MDM Ref DM Hub user interface is a web application UI that supports collaborative authoring of reference data. Reference Data Stewards use the RDM web UI for the importing, managing, and publishing of reference data sets. The role-based UI allows a stewardship team to view, author, map, and approve reference data sets within a central repository. With this approach, reference data sets can be created and managed in a controlled manner. User actions on the web UI trigger requests, which are handled by appropriate service controllers present in the Representational State Transfer (REST) layer. The REST layer services invoke the server-side transactions to manage create, read, update, and delete (CRUD) procedures on RDM database.
The server-side is implemented on the proven InfoSphere Master Data Management Custom Domain Hub engine (the same engine that powers InfoSphere MDM Server and InfoSphere MDM Advanced Edition).
The reference data domain model elevates reference data to be a first class domain entity within MDM. By implementing the InfoSphere MDM Ref DM Hub as a new domain on the InfoSphere MDM platform, the InfoSphere MDM Ref DM Hub benefits from a wide range of base services and ready-to-use frameworks that InfoSphere MDM provides, such as business rules, event notification, data quality, and audit history. In addition, several reference data management specific services are implemented to achieve key functionality, such as import and export, reference data set lifecycle management, transcoding, distribution, and versioning.
The client and server enterprise archives reside in an IBM WebSphere® Application Server instance. The currently supported databases are IBM DB2® and Oracle.
Solution architecture
The InfoSphere MDM Ref DM Hub serves as an integration, management, and distribution point in the enterprise for reference data sets, maps between reference data sets, and hierarchies over reference data.
Figure 3 shows an overall view of where RDM fits into an enterprise reference architecture.
Figure 3. Reference Data Management Hub in an enterprise architecture
Reference data sets and hierarchies that InfoSphere MDM Ref DM Hub provides are consumed by enterprise information systems (such as InfoSphere MDM, SAP, data warehouses, business intelligence systems, and so on) to ensure that business objects are accurately and consistently described across the enterprise. Reference data maps are used by data integration layers (such as IBM InfoSphere Information Server, or an enterprise service bus) to map reference data values between source systems and target systems.
There are three core reference data domain objects: sets, maps, and hierarchies (Figure 4). Each object supports the standard CRUD operations. Each object also supports the notion of a validity period (the time when an object becomes active, and the time when an object is no longer valid). Sets and maps also support extensibility and lifecycle.
Figure 4. Reference data domain objects
In addition to the core reference data domain objects, there are some supporting objects utilized in the reference data domain. These objects range from providing underlying support for core objects (types), to providing objects that link to organizational containers (folders), and finally, to a set of objects that are linked to the core objects (subscriptions, managed systems) as part of the reference data ecosystem.
Figure 5 illustrates an InfoSphere MDM Ref DM Hub data model with various reference data objects.
Figure 5. InfoSphere MDM Ref DM Hub data model
Usage scenarios
One of the critical functions of InfoSphere MDM Ref DM Hub is to interact with the reference data found in other enterprise systems. InfoSphere MDM Ref DM Hub can obtain reference data from key enterprise information systems and the managed reference data objects are then used in conjunction with those enterprise information systems.
IBM InfoSphere MDM Reference Data Management Hub supports IBM AIX®, Sun Solaris, and Linux Red Hat operating systems. The database systems supported are DB2 Enterprise Server Edition Version 9.7, Version 10.1, and Oracle Database 11g Enterprise Edition.
For complete hardware and software requirement information, refer to the InfoSphere MDM Reference Data Management Hub Installation Guide and the readme document.
The hardware and software requirements for InfoSphere MDM Reference Data Management Hub might be updated. To obtain the most current information for supported hardware, visit
http://www.ibm.com/software/data/infosphere/mdm_server/requirements.html
Ordering information
IBM InfoSphere MDM Reference Data Management Hub V10 is only available via IBM Passport Advantage®. It is not available as shrinkwrap. This product can only be sold directly by IBM or by authorized IBM Business Partners for Software Value Plus.
To locate IBM Business Partners for Software Value Plus in your geography for a specific Software Value Plus portfolio, contact your IBM representative.
Related information