I believe that every Business Intelligence report or analysis should have an informational data handicap score (IDHS) listed as a reporting element. The handicap includes the sum total of data scores for accuracy of context, standardization, structure of use, completeness and ability to extract the information for reporting. The Informational Data Handicap Score should be applied to all reporting and analytics used in every business decision where data is the foundation of information. The cold hard fact is that BI reports and analyses are used in critical business decisions, budgets and plans and are made from data that may be inaccurate, incomplete or unavailable. A report or analysis with your IDHS is a true informational element for BI.
I spend a lot of time analyzing the product data quality, missing data elements, system accessibility because the data elements are impossible to pull out of the system or not collected to support our clients’ enterprise requirements for purchasing, engineering and maintenance decisions. I have to admit, I am always astonished by what I see (or don’t see) and the time and cost to pull data from a system. The reality is the data entered in these systems and the systems themselves are considered a support function (indirect or non-product activity) and not the core revenue generating stream for the business however the data is the life support of BI, accurate and available data is critical for smart and efficient business decisions. The missing gap in most business intelligence programs is a foundational flaw, referred to as data integrity and data quality or the lack thereof.
A business has two options, augment their BI decisions with a data quality scoring model, IDHS, a simple example “I am confident that our inventory budget should be 1 million dollars this year, based on the IDHS (+/- 30%) the actual budget could range from 700,000 to 1.3 million.” The easiest reality is to budget the 1.3 million, with the plan to come in under budget, .3 million provides a safe cushion. This also alleviates the over budget spending and the tedious tasks of re-budgeting or canceling other important initiatives mid quarter / year.
The other option is to incorporate a structured and standardized Master Data Management process with Data Governance to collect, manage, cleanse (legacy / new data), enrich and disseminate information to the various systems. The goal is to create one master record set to ensure that decisions are based on accurate and complete data sets to implement meaningful BI reporting and analytics.
The results of data quality improvements are because of the requirements and processes of MDM. My definition is “An MDM program includes the Data Governance to define data requirements (structure, format and content), and the data processes to manage data activities such as collecting (extraction of BOM data or the data request web form), evaluating, matching (auto and mismatch), structuring, verifying and enriching to minimum data requirements, tracking history of change and data use, quality-assurance, reporting and distributing data (MAXIMO, ORACLE, SAP or another client’s systems) throughout an enterprise to ensure consistency and control. The MDM program will also include an on-going data maintenance process to manage data updates for this information.”
The following elements of data quality should be part of the governance program for your master data. This is critical to support a global enterprise. The discussions and metrics should include:
Accuracy: We intellectually understand the meaning of accuracy. An email address is either right or wrong, however in the product information world it can be a little more complex, this is where data governance is instrumental. The same spare part can be purchase from the manufacturer (one part number) or maybe a supplier (another part number)? A part number can be many different versions; for instance, a master org record is setup with a part number to purchase safety gloves, except one part number can’t buy you safety gloves; you must include the size as a description element in order to purchase. The result of an inaccurate glove record is you may receive all small gloves, but you really wanted large or you may not receive any gloves. Different manufacturers and suppliers have different ordering and purchasing rules.
Standardization: Is absolutely critical to BI reporting. Standardization is the map to how data is entered, referenced and stored to support ease of data access. The data elements should include classification naming, attributes, part numbers including formats, unit of measures, manufacturer and supplier names, addresses, web urls, relationships to parent companies and so forth.
Structured to support multiple uses: If you have one master organization and are only concerned with purchasing systems then structure may not be a concern, but to a global enterprise with multi-systems, the structure of use is extremely important as the data is disseminated to maintenance or inventory systems. In a purchasing system a ’Bearing, Ball’, part number ‘12345’ should only be set up once but in an “end use” structured environment, that ’Bearing, Ball’ is referenced to many pieces of equipment located and used on other equipment and in other plants, it is also listed in engineering drawings, etc. If the multiple use structure is set up correct you can report “where used” for inventory sharing, internal purchasing programs supporting reduction in inventory.
Completeness: Having all data elements entered into the system required for the safe and efficient use of each item. If your data set has some missing prices and a report is provided the value of the inventory, obviously the report is inaccurate. The governance requirements include minimum required data elements. In the world of product data, the process may require a special speedy set up for a critical item that is urgent, however the MDM processes includes going back to acquire the missing information.
Accessibility: The ability to pull information from a system is the foundation of reporting. This is a continual struggle when I am working with a new client. I often ask the questions, “Is the expertise available to be able to query and pull data as needed from existing systems?”, “Is the data stored parametrically or as concatenated text fields?”, “is the table structure extremely complicated?” Accessing the businesses information and providing the ability to slice / dice the information critical to BI.
In this fast moving, big data intense world of collecting and storing information for businesses, the reporting and analytics to enable meaningful decision making is critical, so I ask the question “What does data have to do with business intelligence? EVERYTHING”
