Posts Tagged ‘masterdata’

Informational Data Handicap Score (IDHS) for your BI analysis and reporting

Thursday, October 20th, 2011

I believe that every Business Intelligence report or analysis should have an informational data handicap score (IDHS) listed as a reporting element. The handicap includes the sum total of data scores for accuracy of context, standardization, structure of use, completeness and ability to extract the information for reporting.  The Informational Data Handicap Score should be applied to all reporting and analytics used in every business decision where data is the foundation of information. The cold hard fact is that BI reports and analyses are used in critical business decisions, budgets and plans and are made from data that may be inaccurate, incomplete or unavailable. A report or analysis with your IDHS is a true informational element for BI.

I spend a lot of time analyzing the product data quality, missing data elements, system accessibility because the data elements are impossible to pull out of the system or not collected to support our clients’ enterprise requirements for purchasing, engineering and maintenance decisions. I have to admit, I am always astonished by what I see (or don’t see) and the time and cost to pull data from a system. The reality is the data entered in these systems and the systems themselves are considered a support function (indirect or non-product activity) and not the core revenue generating stream for the business however the data is the life support of BI, accurate and available data is critical for smart and efficient business decisions. The missing gap in most business intelligence programs is a foundational flaw, referred to as data integrity and data quality or the lack thereof.

A business has two options, augment their BI decisions with a data quality scoring model, IDHS, a simple example “I am confident that our inventory budget should be 1 million dollars this year, based on the IDHS (+/- 30%) the actual budget could range from 700,000 to 1.3 million.”  The easiest reality is to budget the 1.3 million, with the plan to come in under budget, .3 million provides a safe cushion. This also alleviates the over budget spending and the tedious tasks of re-budgeting or canceling other important initiatives mid quarter / year.

The other option is to incorporate a structured and standardized Master Data Management process with Data Governance to collect, manage, cleanse (legacy / new data), enrich and disseminate information to the various systems. The goal is to create one master record set to ensure that decisions are based on accurate and complete data sets to implement meaningful BI reporting and analytics.

The results of data quality improvements are because of the requirements and processes of MDM. My definition is “An MDM program includes the Data Governance to define data requirements (structure, format and content), and the data processes to manage data activities such as collecting (extraction of BOM data or the data request web form), evaluating, matching (auto and mismatch), structuring, verifying and enriching to minimum data requirements, tracking history of change and data use, quality-assurance, reporting and distributing data (MAXIMO, ORACLE, SAP or another client’s systems) throughout an enterprise to ensure consistency and control. The MDM program will also include an on-going data maintenance process to manage data updates for this information.”

The following elements of data quality should be part of the governance program for your master data. This is critical to support a global enterprise. The discussions and metrics should include:

Accuracy: We intellectually understand the meaning of accuracy. An email address is either right or wrong, however in the product information world it can be a little more complex, this is where data governance is instrumental. The same spare part can be purchase from the manufacturer (one part number) or maybe a supplier (another part number)? A part number can be many different versions; for instance, a master org record is setup with a part number to purchase safety gloves, except one part number can’t buy you safety gloves; you must include the size as a description element in order to purchase. The result of an inaccurate glove record is you may receive all small gloves, but you really wanted large or you may not receive any gloves. Different manufacturers and suppliers have different ordering and purchasing rules.

Standardization: Is absolutely critical to BI reporting. Standardization is the map to how data is entered, referenced and stored to support ease of data access. The data elements should include classification naming, attributes, part numbers including formats, unit of measures, manufacturer and supplier names, addresses, web urls, relationships to parent companies and so forth.

Structured to support multiple uses: If you have one master organization and are only concerned with purchasing systems then structure may not be a concern, but to a global enterprise with multi-systems, the structure of use is extremely important as the data is disseminated to maintenance or inventory systems. In a purchasing system a ’Bearing, Ball’, part number ‘12345’ should only be set up once but in an “end use” structured environment, that ’Bearing, Ball’ is referenced to many pieces of equipment located  and used on other equipment and in other plants, it is also listed in engineering drawings, etc. If the multiple use structure is set up correct you can report “where used” for inventory sharing, internal purchasing programs supporting reduction in inventory.

Completeness: Having all data elements entered into the system required for the safe and efficient use of each item. If your data set has some missing prices and a report is provided the value of the inventory, obviously the report is inaccurate. The governance requirements include minimum required data elements. In the world of product data, the process may require a special speedy set up for a critical item that is urgent, however the MDM processes includes going back to acquire the missing information.

Accessibility: The ability to pull information from a system is the foundation of reporting. This is a continual struggle when I am working with a new client. I often ask the questions, “Is the expertise available to be able to query and pull data as needed from existing systems?”, “Is the data stored parametrically or as concatenated text fields?”, “is the table structure extremely complicated?” Accessing the businesses information and providing the ability to slice / dice the information critical to BI.

In this fast moving, big data intense world of collecting and storing information for businesses, the reporting and analytics to enable meaningful decision making is critical, so I ask the question “What does data have to do with business intelligence? EVERYTHING”


 

The Master Data Management and Governance of Maintenance Data

Monday, March 14th, 2011

My strong belief in Master Data Management (MDM) incorporates the management of data from the entry point and multi-channel uses throughout the enterprise. This philosophy results in a holistic understanding of the data content and uses achieving data quality enterprise wide. Yes, an overwhelming task but it can be achieved if you take a step back from the one-dimension software thought process . . . . centered around one software product. Through my experiences, the lack of ownership within the enterprise results in a chain of isolated data islands with only the concerns to perform the isolated activity. MDM is much more than a single data activity or transaction within the operation or a software system to perform said activity.

In the perfect MDM world, naturally not only does the data (product, services, spare parts) adhere to governance, structure of classification, quality and content but also a data structure of location of use. An example of structure could incorporate naming standards for location of use, for example plant or office. Within the plant, the use could be referenced to a department, referenced to a piece of equipment and to a specific location within the department. This type of structure is preset in a MDM plan and will benefit the maintenance data structure. The MDM data plan and structure meets the requirements of the complete enterprise, the purchasing department may only require 5 or 6 data elements but the maintenance department will require 10 or more. This is why Master Data Management requires a complete view of all data concepts and use.

Think of how powerful the analytics are if the enterprise is set up with established standards through governance for plant / facilities location structure, location names, equipment location structure and equipment naming standards. The benefits include the ability to view equipment and spare parts enterprise wide enabling the initiation of common spare parts strategies, spare parts sharing programs supporting inventory planning and reduction.

This type of MDM planning also supports equipment moves or disposals with the view of spare parts associated to the equipment. The spare parts can be packaged and moved or disposed of at the time of the disposition of the equipment. I can’t count the number of times that I have been told that I am not even sure if we still have this piece of equipment that these inventoried spare parts are used on.

Now the beauty, yes I said beauty, is that the required data structure can be set up with templates, written into requirements and contracts to equipment suppliers and when the bill of material data deliverables are sent to the engineering department of the enterprise (entry point) ensuring the data location governance structure is audited and at that point accepted to start the data cleansing and purchasing setup or rejected to fix the data structure errors. Other key data elements are classification, verification, enrichment and translation before the data is setup in any of the enterprise systems.

The by-product of the well executed MDM governance plan is that once the spare parts data is processed, the cleansed record is then propagated into the purchasing system, engineering library and maintenance system. The maintenance system is fully loaded with spare parts information associated to equipment and locations of use ready for the maintenance staff to set up their tasks for the equipment maintenance and planning strategies.

View Jackie Roberts's profile on LinkedIn

The Act of Data Migration is not Master Data Management

Tuesday, March 1st, 2011

Let’s face it, if an organization is spending millions of dollars to purchase and integrate an ERP system, then the project requirements and schedule will be driven by the IT department. Unfortunately, in the definition of the scope of the project most will only focus on moving the “dirty” data from the legacy system to the bright new and shiny ERP system. The IT implementation then moves to the next integration and the users of the data will have the same data issues that plagued them in the legacy now also in the new ERP system.

A flawed philosophy is to migrate the legacy data, meet the deadline, call the project green and successful while the users must figure out to how to correct and update the data. These ERP systems are not designed to handle the volume of change to data, provide a simple method to track change, obsolete a record with history view and archive functionality is non-existent.  Another reason this is a flawed philosophy is that purchasing contracts are set up based on the “bad” data, a unit of measure, part number or manufacturer change will void a contract resulting in wasted time of valuable resources and at the end of the day an inability to source the item, this could set in motion a critical manufacturing line shut down. Let face it, an ERP system is designed store a product or service record providing the business a method to transact, not to cleanse a record to a single master version of an accurate classified, verified and technically described Master Record. Therefore the activity of migrating data to a new system is not Master Data Management.

Master Data Management needs to be independently structured and separately managed in the organization not through IT. It is critical that within the Master Data Management organization to properly represent the business assets of the data (engineering, purchasing, customer, etc). The data is the core information used as the foundation to run the operations, sometimes referred to as the BI for the analytics of sound decision making processes. If the data is incorrect in the new systems, how is the BI improved? How is the business case ever calculated and successfully achieved? I can’t even imagine trying to tally up the potential “cost savings” when bad data is migrated to a new system.

Establishing a MDM program will need to have clear and well defined ownership, stake in the end user organizations and representation in the design and schedule of the software roll outs with full participation in all the projects with data involved. They should also participate in the project design strategy for systematically cleansing, classifying and migration of the data to the new system. Strategy should include an audit of data in the legacy system, let’s face it there maybe 20 year old records with no transactional history or balance on hand in inventory. Should this data be moved to the new system? The answer is NO.

An MDM data strategy to support the IT team can encompass a number of options. A simple option is the publishing of a long term schedule establishing adequate time for the data group to meet the data cleansing and classification requirements. This is not always possible, so what about a phased strategy? Some of the possible steps should include

  • Evaluation of data to review transactional use 
  • Evaluation of the stock on the shelf and confirm that none of the inventory should be obsolete and disposed of.
  • Review of data related to the equipment but is not inventoried
  • Identify data that should not be moved to the new system
  • Establish data priorities for cleansing starting with high transaction use and stock items classified and cleansed first.

An ongoing maintenance and new set up process is imperative to be established with an easy method to request an urgent record during the data migration to support the day to day operations of the business.

We need to get out of the mindset that MDM is simply a data migration to a new system. MDM is a business process to establish the single version of accurate information which is then propagated throughout the organization, part of which is the proper migration of data from legacy systems.

View Jackie Roberts's profile on LinkedIn

Data Quality: An essential tool to facilitate national security?

Monday, June 28th, 2010

When I think of how to best protect the nations infrastructure two broad categories of national security come to mind, physical security and electronic security.

 Physical protection of our nation’s infrastructure happens at many levels, namely, National which includes all the United States armed forces protecting our land, sea and airspace borders. At the state level which includes state police forces, state highway patrols, and the National Guard forces. The local protection is handled by each and every local police force in addition local community civilian efforts to monitor suspicious activity around neighborhoods more commonly known as neighborhood watches.

 These amounts to hundreds of thousands of human resources that need to be analyzed managed and deployed properly to ensure the nation’s entire critical “brick and mortar” infrastructure is protected adequately and to ensure those individuals are given the proper tools to execute their assigned tasks. The question becomes how do we effectively inventory all of the nation’s human and physical resources in order to facilitate real-time assessments of our ability to protect our critical infrastructure?

 That question presents several seemingly insurmountable problems… Potentially tens of thousands of local level sources of data, thousands of sources for state level data, thousands of sources of federal data in addition to thousands of data sources contained within the firewalls and control of private and publicly owned companies as is the case with major power distribution and communication corporations. To make matters worse there could be hundreds of different software vendors that support the collection, storage, maintenance, and reporting of this information. Each software package has a unique and often proprietary data model and most likely a unique and proprietary meta data schema used to tag the meaning of each database field and values within those fields. Without consistent and easily discernable database schema from each data source, integrating the mass data becomes an impossible task to do in a way that the resulting information can be trusted to make decisions in what amounts to life or death situations.

 One potential solution to this problem is a combination approach which would include a National Master Data Management initiative with the ultimate goal of achieving a level of data quality sufficient enough to accurately derive infrastructure intelligence information to be used to assess the vulnerability any given infrastructure asset has to attack and the potential damage and casualty fallout if an attack were to occur at a location such as any bridges that carry large amount of cargo across the Mississippi. Not just any data will do when allocating resources to secure and protect such important national resources; the data needs to be quality data, data which consists of profiled and standardized vocabulary, the syntax or format of the data, the provenance or source of the data as well as the accuracy and completeness of data. Data that does not meet minimum requirements for the metrics I listed previous can not be used to make decisions that are supported by accurate data.

 My proposal of a National Master Data Management Program would in its most basic description include the creation of a national mashup of these thousands of data sources into one Infrastructure Security Data Warehouse that would be used to govern, analyze and report the readiness of the nation’s infrastructure if an attack were to occur.       

 A mashup is a relatively new term used to describe a database application that pulls information from tens to thousands of data sources and integrates the data together so it can be analyzed over and over again. At the same time would not require the replacement of the thousands of legacy systems the local, state, federal, and private entities use to run their day to day operations.

 I have oversimplified the problem and solution in this explanation, the actual solution is very technical and requires professionals who have managed data integration, data cleansing, data governance and master data management programs in the past. An initiative like we are proposing is not a short term project with a defined beginning and end date, in reality the project began more than 30 years ago with the collection of electronic data. A master data management program for something as critical as the United States infrastructure is not simply a program; it is a complete change in the thinking and the way we interact with the data we spend so much time, money and resources archiving and cataloging. It is a cultural change.

 More technically speaking the idea is to create a classified open technical dictionary which will contain all the terms and definitions needed to describe, at their most atomic level, every single data field we require to generate the information needed to assess and prioritize potential infrastructure targets. We will then tag or associate one of the classified numbers and terms to every single piece of asset data (Master Data) determined necessary to implement the infrastructure protection strategy. We will use a combination of publicly available meta data and newly created meta data to tag all the data elements. This allows us to store and report on them quickly and accurately from a central location with a centralized team, resulting in a true Infrastructure Intelligence Master Data Program. Once all the target data is in a standard format we can further national security goals of increasing the level of quality related to information and reporting, increase the level of interaction between local state and federal authorities and provide threat advisories to citizens and the law enforcement community. As well as providing data that can be used in a variety of training exercises and computer simulations of potential attacks.

 Core to the proposed data quality/master data management program are the business processes used to carry out the data cleansing and enrichment processes. The methods used NEED to be vetted and thoroughly tested with large datasets and algorithms comparable to the complexity of the algorithms that will need to be developed to predict things such as; How many people will be effected if a particular power substation is physically or cyber attacked?, Which railroad infrastructure is key to keeping the flow of military supplies to ports for distribution to our forces worldwide?, Once a threat is identified how long will it take to mobilize enough manpower to properly defend any given location?, Which power generation facilities are large enough that if damaged would results in blackout for 5% or more of the population?, Which open air water sources, if poisoned will cause the most death?. The processes and methods used to execute the project in its initial phase and on an ongoing basis are just as if not MORE important than the quality of the initial data sets, they are mutually independent on each other.

 The implementation of an ongoing and permanent national Infrastructure Master Data Management initiative would be of great benefit and could be essential to the long term growth and protection of the United States of America.

Hey baby, what is your material type and material status . . .

Tuesday, June 15th, 2010

You would never believe the discussions around the “ho-hum” or “don’t sweat the small details” elements of a data cleansing project. Believe it or not, understanding your material type and material status is critical to be able to automate system updates. I have a firm belief that data updates to legacy systems should be completed as a night job or direct feed based a series of programmed templates. In one recent example we created an Oracle system update process for a new item referencing a material type template or another update process if the item is already set up for another location of use but is new to the requesting location, this is sometimes referred to as a location setup or purchasing organization update. You can start to imagine the amount pre-planning work and data mapping that is required for a data cleansing program.

The first fundamental rule is that the customer business doesn’t stop. For all you data purists out there that believe that one day a switch to turn on the cleansed database is in the near future, please include me, I would like to see it. Most master data management projects included years and years of legacy data; therefore there is an acceptance to draw a line in the database by last used date. When I design a data cleansing project, I will have a new item setup process referenced to legacy items, this way the client business continues and as the new items are analyzed and setup, we can reference and update the legacy item information. Independently, we will always have the legacy data cleansing parallel the new set up process.

As the data cleansing project is designed, let’s start to explore the data elements and classifications. Every client will have their material types and material status set up but generally during the data / systems assessment there should be a thorough review of industry standards vs. company processes. I find that our clients appreciate the opportunity to bench mark their processes and data structure elements such as material types and status.  We will start with material type and material status.

Material Type

Material types can be as simple as goods and services or as complicated as service, critical spare, spare part, commodity, generic, blueprint, etc. The material type is a critical element to classify which template is used for setup in the downstream legacy systems with an inventory stocking strategy applied.

Obviously a service can be standardized by the class type to describe the service where a cost for the service can be standardized. The definition of the service is described by the properties, for instance a service class of CLEANING, OFFICE can be set up with descriptive elements such as 10,000 square feet, light cleansing (dusting / vacuuming), etc. From a purchasing perspective, the buyer can run the reports globally to determine how much is spent for office cleaning then evaluate the costs and utilize best practice sourcing strategies and other global supply chain processes to lower costs. The purpose of the standard naming conventions of classes and property are to provide enough standardize information to provide the ability to compare and cost services or products.

If a critical spare is being set up for sourcing and inventory, then the part has been evaluated by maintenance or engineering and determined that the spare is critical for production uptime. An inventory plan is developed for stocking the critical spare including an initial buy quantity, plan for stores (inventory) setup of item’s unit of measure (each, assembly, package, etc.), min / max, reorder quality, stocking location, etc.

Material Status

In addition to applying a “material type” to the item records, due to the longevity of materials used in the manufacturing operation, a material status should be utilized as a long term data maintenance process. In dealing with component manufacturers and suppliers, a component may be active from a plant use perspective; however the component manufacturer no longer manufactures the item. How is that possible? A piece of equipment can have a 10 year or a 50 year life span, to maintain a piece of equipment, a list of recommended spare parts is identified and set up for equipment maintenance. If the spare part component is obsolete by the manufacturer but the piece of equipment is still in use on the production line, the material status would be “obsolete active”. A different buy / stock strategy would be implemented, such as purchase all available stock from the manufacturer or another alternative is to source with unconventional methods such as through eBay or maybe contract the item to be built by a local shop.

Typical material statuses that I have experienced are active, inactive item referenced to an active item, obsolete active, obsolete inactive (typically the status to start the disposal process) and archive. The archive status is a classification used by the analysts to allow the viewing of the item information but is not visible to the client or the item record is not exported to the client systems.

I would appreciate any input or better yet a discussion of the different material types and material status used in Product Information Management (PIM) or Master Data Management (MDM). As an industry we inherited material types and material status used in a purchasing system or maintenance systems designed to meet business function but not from the data quality or master data management perspective. What are the proper data requirements for a material type or material status? The MDM or PIM software companies and data quality consultants need to provide input from the data management perspective to provide long term data management functionality.

View Jackie Roberts's profile on LinkedIn

Enterprise Information Management 2010 via DAMA Management International

Friday, June 11th, 2010

Presentation proposals are now being accepted for the second Enterprise Information Management Conference scheduled for September 21-23, 2010 at the Hilton Toronto in Toronto, Canada.

Speaker submission guidelines can be found here: Online Proposal Form.

All questions regarding speaking may be directed to Wilshire Conferences at maya@wilshireconferences.com. The deadline for submitting your proposal is June 4, 2010, and we anticipate being able to notify accepted speakers by June 14, 2010.

Thanks and we look forward to hearing from you!

Did we forget the old adage “Garbage In, Garbage Out” I mean Garbage Extracted, Garbage Migrated

Friday, April 23rd, 2010

When it comes to Master Data Management, the implied definition is an à la carte of detailing and normalizing activities including data cleansing, data verification, data profiling, data governance, de-duplication, data enrichment and data provenance among other tasks. If you are managing or participating in the activities of a Master Data Management program, you are progressing in the right direction of achieving data quality. If you are NOT participating in the activities of MDM then you are part of a company wide initiative of “Garbage In, Garbage Out (GIGO)”. By the way, GIGO, in this case is not environmentally responsible or a “green” behavior.

Wikipedia’s definition for “Garbage In, Garbage Out, is a phrase in the field of computer science or information and communication technology. It is used primarily to call attention to the fact that computers will unquestioningly process the most nonsensical of input data (Garbage in) and produce nonsensical output (Garbage out).”

If you enter “garbage in” to a computer system, having the data passed through some very expensive ERP or CMMS software, isn’t going to change the data quality, the business results are equivalent to “garbage out”, which will be apparent in the day to day business activities and subsequent reporting used to determine the health of your business. Is it obvious that data should just not be moved from one system to a new system without a MDM program?

Let us now explore the concept of data migration. Wikipedia’s definition for Data Migration is the process of transferring data between storage types, formats, or computer systems. Data migration is usually performed programmatically to achieve an automated migration, freeing up human resources from tedious tasks. It is required when organizations or individuals change computer systems or upgrade to new systems, or when systems merge.

If an MDM program is not in process when implementing a new software or upgrading an existing software, the project should include an evaluation of the data and/or an evaluation of the additional functionality of the “to be” model of the new software identifying the new data required for improved business processes, reporting and the plan for legacy data clean up. A data migration project needs to be more than moving data from a legacy system to the new system.

I asked the question to one user of a maintenance software implemented a number of years earlier as I had the opportunity during a site visit at a plant. The software had awesome abilities to create and manage the relationships between equipment and spare parts, supplier contacts as well as the potential to improve processes, reporting and streamlining the information required for a maintenance organization. The company invested in the software / hardware, understood the ROI but lack the understanding of the data needs or management. The software was implemented however the majority of the functionality was not used, therefore the ROI was never achieved. When I asked why, I was told “no data and we don’t have time to add the data.”

Another scenario I came across, purchasing moved data from a legacy system to a new ERP system. The data wasn’t set up to a data governance or MDM procedure, legacy data riddled with duplication, obsolete information, unstructured descriptions and so forth. Different system, same legacy data quality and the ROI was never achieved.

I have one simple question, why invest in a software product if the data is not going to be treated as an asset? The results of a successful implementation are that the business processes are streamlined; simplified and reporting capabilities are enhanced through enabling both Master Data Management and Software functionality.

Garbage In, Garbage Out or Garbage Extracted, Garbage Migrated as we are moving to the next generation of technology. Are we relying on a skewed nonsensical output based on low quality data to make our critical business decisions?

View Jackie Roberts's profile on LinkedIn

ECCMA’s 11th Annual Data Quality Conference Oct 12-14, 2010

Friday, April 23rd, 2010

Whether you are new to data quality or a seasoned professional, this conference will provide you with a unique opportunity to discuss the latest trends, technologies and software available to the data quality industry. You’ll experience top level speakers discussing how to manage, catalog, clean and standardize your data. It will introduce you to the international standard for data quality , ISO 8000-110. An exhibition will showcase the latest data quality software from companies not only in the U.S. but all over the world.

PROGRAM OF EVENTS  

Tuesday October 12, 2010

  • Pre-conference ISO 8000-110:2009 Master Data Quality Certification Workshop 
  • Welcome Reception (includes open bar and hors d’ oeuvres)

Wednesday October 13, 2010

  • Opening Address
    Overview: The critical need to maintain the quality of master data.
  • Panel PresentationsFundamental updates on the progress of the practical application of the eOTD (ECCMA Open Technical Dictionary), ISO 22745 and ISO 8000 for the collection, validation, and distribution of master data in support of the procurement of goods and services as well as inventory and asset management initiatives. The panels will address the importance to using the standards to define and manage data requirements as well as the latest trends in spend analysis, cataloguing at source and data cleansing and rendering.
  • Exhibition
    A unique opportunity to see the latest offerings from leading data service and software application providers.
  • Annual Awards Dinner
    Celebrate and share achievements with colleagues and friends.

Thursday October 14, 2010

  • Workshops

 

Workshops will cover new technology and practical examples of vendor specific data quality application software and data cleaning services.

*Content subject to change.

Data Cleansing to Achieve Information Quality

Wednesday, March 10th, 2010

Those of us that work around or manage the day to day operations of an MDM, data governance, or data cleansing projects understand the challenges and efforts needed to transform “raw” data though multiple stages of analytics and processes to achieve information quality to be used in our customer’s CRM, CMMS, PIM and ERP systems. The result of an un-cleansed product record can cause a production line to stay off line because an inventory item wasn’t ordered due to incomplete information or added inventory cost of ordering an incorrect item (we can be talking about a $10,000 motor) or multiple entries and setups in the material master due to data duplication.

Data vs. Information definition: to simplify the concept, data is managed by a combination of a team of analysts and software to achieve the goal of a cleansed record or useable information. Data is imported and profiled, classified, structured, verified, enriched, translated and reports generated; we create useable information from low quality data for use in decision making related to engineering, purchasing, maintenance, marketing, sales, etc. The data that is exported into client systems is information that will meet a predetermined set of data governance rules and information quality requirements.

Data Quality Experts, let have a discussion on the definitions of data quality, does an address or a product detail meet the requirement if only classified? Or should verification at source (contact for address or manufacturer / supplier for product) be required at initial setup of the data in the system or maintenance scheduled as part of the data governance program? Is the data incomplete? Does the MDM process include a question / answer scenario to complete the data?

MDM software designers and developers can we also have a discussion on the software’s ease of use to manage the stages of data cleansing to support a MDM philosophy and using advanced techniques to automate the management, add intelligence in processing data imports, workflows and data cleansing stages of classifying, profiling, matching, translation, data audit analytics, exception reports and status reporting of a data record?

I believe these are great discussion points and will serve as great blog topics.

View Jackie Roberts's profile on LinkedIn

It Is Not So Easy to Build a Data Cleansing Logic

Tuesday, March 2nd, 2010

During my morning data quality, MDM and data cleansing reading, I happened upon this on a help site and the million $$ question:

I have a scenario to build a data flow task for Data Cleansing.

Logic 1 to be build:
Source data would be like 1050 and I should convert it to 1.050
Source data would be like 085 and I should convert it to 0.85

Profiling, structuring or normalizing data without any referential information risks errors in business use, especially if the data is use for purchasing or maintenance. If the goal is to automate the data normalization, the data needs to be referenced to metadata, 1050 could be a part number? Or a quantity? It could be an attribute representing a measurement such as length or diameter. Is it an inch or foot or meter?

View Jackie Roberts's profile on LinkedIn

Open Letter to Gartner

Thursday, February 4th, 2010

Dear Andrew White,

Thank you for your comments in “Something beyond MDM is coming your way – would MDM 2.0 fly?” and starting the discussion to expand the definition of MDM to include data integrity, data quality, entity resolution, matching, data integration, governance, metrics and analysis. The topics discussed should also include work flow (management of data and analysts), translation management, data structuring, data profiling, duplication removal, data change management, verification contact management, etc.

The MDM and PIM software industry needs to take a step back to understand actual day to day business requirements of data management to achieve Master Data Quality. Lesson one is that data is created and supplied by many sources in many different formats at various quality levels. Data is created by engineering, submitted by integrators, manufacturers and suppliers. To add to the complexity of the information flow, data is introduced into businesses systems in different departments (engineering or purchasing or maybe plant from maintenance) with different data requirements to meet the needs of that job function. Now the next dynamic is mashing new data to existing legacy data in a number of systems to ensure no duplicates are created, managing obsolete / recommended use and functional equivalents. The old philosophies of a PIM or MDM software to “hold, provide search functionality and maybe a shopping cart” isn’t going to meet the true requirements of the new definitions of Master Data Management.

To meet the new definitions the MDM or PIM software needs to provide horse power to electronically and intelligently processing data to identify exceptions for manual intervention by an analyst. Data should be processed one time to ensure that the data record will be enriched to meet the requirements of the enterprise and then the record is moved to a maintenance program (managed also by the MDM or PIM software). The processing of data needs to be efficient and cost effective, from my perspective the cost of data management should be covered by the cost saving achieved by MDM management.

I look forward to the discussions as the definition of MDM is expanded to include data quality, data governance, data provenience as the software industry provides the intelligence, functionality and business processes to cleanse, enrich and management data for my client to ensure their ability to make confident business decisions based on data integrity and accuracy.

Here is to the future of PIM and MDM!

Jackie Roberts

View Jackie Roberts's profile on LinkedIn

New Data Management System Implementation Common Sense

Friday, January 8th, 2010

With the ever increasing emphasis on finding ways to reduce cost, one of the clear targets is IT and more specifically data management systems. On the surface it can seem like there is real fat to trim, and many times this is true. But it is easy to become lost in the details and eliminate or negate some of the potential savings. Some of these ideas may seem obvious but are often forgotten. The evidence is clear with missed timing and over budget issues seen.

If we’re talking about a large company then inevitably with this new system comes the monolith project with whole organizations of people and processes, projects and documentation. The compulsion is to be sure that everyone, everywhere who has any relationship to it has their input and their needs accounted for. Along the way, the cost of implementation and other peripheral indirect costs have likely negated a great deal of at least any short term savings. Not to mention the potential increase in continuous maintenance costs and loss in performance. These are a few things I’ve learned from experience and I welcome yours.

Always have a specific objective when planning for development or evaluating software to purchase that overrides all others. Start with something like a mission statement, “We need this new system for….”

Determine the Real Needs. Try to separate the “must haves” from the “nice to haves”. Bells and whistles are great but there needs to be a true benefit. Seek a balance between development time, software performance, hardware performance and user experience. I always try to put special emphasis on the user group which stands to benefit the most. Having many users who can do their job faster and more efficiently can add up to real savings versus the few users who have a special need which bogs down the project and performance.

Change is inevitable. If some requests for additional features come along, evaluate them against the mission objective. There is nothing wrong with listening and investigating ideas for project add-ons as long as the benefits outweigh the costs in time and money, but there needs to be a limit or you’ll never complete the project. Good ideas can always be implemented later if it makes sense then you’ll have the benefit of the research already done, but be quick with the research. Evaluate the impact for doing it now or waiting. Here are some good questions to start with: 1) How much more money?  2) Would this be faster/cheaper for programming to do it now versus waiting and doing a more complicated enhancement?  3) Is the impact to the users great enough to warrant it?

Know the roles. Good ideas can come from anyone. Every project must have a project champion who makes the final decisions (and live with them) and also eliminate roadblocks. You need a user advocate who has done the job and knows what it takes. Have programmers who possess both talent and vision, not just code crunchers, and listen to them.

Have good documentation, and “Good” is subject to interpretation. This is another area where the KISS principle is very often not utilized. If you have to hire ten people to sit in meetings just to maintain your documentation you’ve probably overcomplicated it and certainly increased your project cost. I try to start with these principles:

  1. Document the people on the project and their responsibilities. Let there be no question as to who does what.
  2. Everyone who has a job to do needs to understand what they need to do and have the documentation to reference.
  3. Keep the language simple. Focus on getting the point across. If it takes a rocket scientist to understand it you’ve failed.
  4. Of course, document the issues, decisions made, by whom etc. but be sensible. Document enough to cover for the “he said/she said” but content is most important. No bonus points for flash.
  5. Know who is supposed to have what done and when. Another obvious one here but I see too often where target dates are determined top down with little or no thought to cost or the tasks. Don’t let the tail wag the dog. Pushing hard to get the job done is fine but be realistic. Listen to the people who know before making bold predictions.

Data Quality: Classify and Describing

Wednesday, December 2nd, 2009

As the Master Data Management industry matures, the industry focus is not only on the development of software to collect product records but software to implement the data quality process solutions supporting data governance and provenance including record history, structure, completeness and accuracy to ensure our customers are able to make confident, informed and accurate business decisions based on data accuracy. The first step of implementing a data governance program is implementing a naming classification system.

I have had experience working with single business home-grown classification structures and third party developed structures for purchase, currently I have chosen an open and public classification structure provided by ECCMA (www.eccma.org). This is beneficial to the customers that I support ensuring that they will always have access to the classification structure sometime referred to as the schema used to classify their data.

Implementing a classification requires setting up Identification Guide (IG) to establish the template definition to technically describe the product or service with enough information to support engineering, maintenance or purchasing while recognizing the limitation of software short and long description required character lengths. The IG template supports and simplifies the required information request to the manufacturer and suppliers to verify all information by our analysts to standardize the description.

To create an IG, we search the ECCMA class list; fortunately many of the classes are established. As the IG is set up we will use the ECCMA established class name convention; this will ensure that every item will be setup with the same name and format, every ball bearing item submitted will be classified as a BEARING, BALL.

The next step is to set up the properties required to describe the BEARING, BALL and for each property designated the data type requirements such as numeric, text string or designated unit of measure. The property value requirements for a BEARING, BALL might include TYPE, BORE DIAMETER, OUTSIDE DIAMETER, WIDTH, DYNAMIC LOAD CAPACITY, STATIC LOAD CAPACITY, MATERIAL and so forth. Our analysts will verify the data to the original manufacturer sometimes using xml to exchange the product information referred to as “Cataloging at Source”, the information requests are standardized and remove much of the quality issues commonly found in a non-standardized data verification or description process.

The property value description build is controlled by the sequence number of each property Item data that will make it’s way into a length restricted description field we place the most important information in the begin of the auto generated description.

Setting up the Identification Guides requires upfront strategic planning and detailed work, as you can imagine that a classification schema can be up to 10,000 classes depending on the industry but it provides a multitude of benefits including standardized requirements, a road map for our analysts to facilitate the process, improved data management reporting / metrics and enhances language translation for the global organization.

View Jackie Roberts's profile on LinkedIn

Implementation and Use of MRO Naming Standards

Friday, October 23rd, 2009

With all the discussion focusing on Master Data Management and Data Quality, I always come back to these questions: How is the data structured and how is the accuracy and content completeness measured? In our business of managing the coding and verification of items and spare part information needed to keep manufacturing plants running, a structured schema of naming conventions (class), descriptive attribute standardization (properties) and verification at the sources of manufacture (coding @ source) is “key” to quality and completeness measurement. We are managing the ECCMA eOTD for the Automotive Industry Content Standards Council (AICSC) focusing on MRO naming definitions which is the foundation to a spare part description, just as a table of contents is the foundation of a text book.

The first step is to develop the Identification Guide (IG) in order to baseline the properties needed to best describe the class. For example, let’s take the class of SCREW, SHOULDER and the properties TYPE, MATERIAL, FINISH, THREAD SIZE, DRIVE SIZE, SHOULDER DIAMETER, SHOULDER LENGTH, THREAD LENGTH, HEAD DIAMETER, HEAD HEIGHT, SHOULDER LENGTH TOLERANCE, MINIMUM TENSILE STRENGTH, CLASS, HARDNESS RATING and PACKAGE QUANTITY. The IG also provides the information needed for our analysts to acquire properties and our applications to sequence the properties within the short and long descriptions that are built:

SCREW,SHOULDER – | TYPE: HEX HEAD | MATERIAL: 18-8 STAINLESS STEEL | FINISH: PLAIN | HEAD STYLE: HEX | THREAD SIZE: 3/8-16 INCHES | DRIVE SIZE: 3/4 INCHES | SHOULDER DIAMETER: 1/2 INCHES | SHOULDER LENGTH: 2-1/2 INCHES | THREAD LENGTH: 3/4 INCHES | HEAD DIAMETER: 3/4 INCHES | HEAD HEIGHT: 1/4 INCHES | SHOULDER LENGTH TOLERANCE: ±0.005 INCHES | MINIMUM TENSILE STRENGTH: 80.000 POUND-FORCE PER SQUARE INCH | CLASS: 2A | HARDNESS RATING: B85 TO B95 ROCKWELL A | PACKAGE QUANTITY: 2

Each time an item is submitted for coding or processing the item is imported into a master database. Through intervention by our data analysts, the item navigates its way through a number of checkpoints including an auto-suggest to propose a class. The class and properties via the IG are the requirements our coding analysts use to verify the accuracy of the information submitted, to verify the completeness and to acquire the additional information needed to enhance and build an item or spare part description for our clients to base real business decisions.

The implementation of the eOTD is a two process scenario when working with our clients. First, the legacy data is mapped to the class, the item data is profiled, cleansed and enhanced to meet the requirements of eOTD IG, ensuring the client’s data quality goals are met. The updated item information needs to be applied to existing client item data. It is critical that all changes to data be tracked and logged. A properly planned and executed update to legacy ERP and CMMS systems should be initiated to incorporate the enhanced and corrected item information into the user facing systems. This is an extremely critical step as the downstream information flow will affect systems and uses such as inventory re-distribution, purchasing and contract management, engineering bills of materials and maintenance schedules. A thorough and complete mapping of data through the enterprise should be used to understand data flow across all business units. The mapping should include data entry points and data use points through all departments which set up all of the cost saving pay points as the data processing is streamlined.

The second process is an on-going data maintenance plan for new items that are introduced into the organization. This process should start at the introduction of item information into the system. All items and spare part information should be verified with the manufacturer and classified to the eOTD before setup or use in any system. The length of time the coding process requires is a critical element as the item or spare part information should be as complete as possible while at the same time be ready and waiting for the buyer to put the item on a contact or a maintenance employee to setup the tasking information in the CMMS for a piece of equipment. The only requirement for the employees who use the information after its initial entry into the system is to perform the actual requirement of their job and not to decipher a cryptic unstructured description.

If the items are pre-processed using the eOTD and the associated ISO standards, every item and spare part will be structured and standardized. The engineering, purchasing and maintenance departments will focus on the core of their day to day specialized responsibilities instead of searching for parts or dealing with trying to purchase items that a supplier does not recognize or have to acquire the missing information.

We all agree on some of the basic benefits both in process and cost such as reducing inventory with the identification of duplicate items, facilitation of inventory sharing and internal purchasing programs, reduced employee time searching for parts, common spare part usage strategies, reduced downtime in manufacturing equipment due to lack of information availability and ability to manage using a just in-time inventory model. The eOTD and its Identification guides are the building blocks and the roadmap to achieving structured and accurate data that can be reliably used to base real world decisions.

For more information on the eOTD please visit www.eccma.org.

View Jackie Roberts's profile on LinkedIn

Decision and decision

Tuesday, October 6th, 2009

The difference between a Decision and a decision is simple. A Decision spelled with an uppercase “D” is one based on data, information and real-life experience. A decision spelled with a lowercase “d” is one made without data, information or a real-life experience. All too often I have seen decisions based on an individual’s feelings or opinions of a given situation. It makes me shake my head. In some cases, a business will use an algorithm or spreadsheet with embedded formulas to “choose” the best decision based on a set of desired requirements and associated weights applied to each requirement. For certain decisions, this might be the most appropriate way to assess the situation. For me, working to develop web-based software applications, it just doesn’t work.
The most common method of decision making during a development cycle I have come in contact with is the “committee” driven requirements analysis. During this process a group of usually high level managers (far from end users) sit down and work their way through a spreadsheet of requirements to decide which ones should be included in the next six-month iteration of development. In my experience, the only information included in the requirement column is the perceived expected behavior or outcome of the given feature or change. The end of this type of development cycle is usually followed by hundreds of hours of testing and arguing about how each feature should work, how it actually works and how we ‘thought” it would work. As well as two strokes, three heart attacks and a combined three square inches of newly exposed scalp for the male members of the team…

Every day I make decisions. Some turn out to be the right ones. Some turn out to be horribly wrong. Since I accepted the role of product manager rather than simply a project team member, I have put a great deal of thought around decision making. I ask myself questions like: “What information do I need to make a decision the right one?,” “At what point do I have enough information to make the decision?,” “Does the outcome of each decision I make effect the remainder of the product launch in a positive way?” I still can not answer all the questions I have about decision making. However, I have used the following principles to aid in making the right decision most of the time:

1.) Make lots and lots of small Decisions. When you send your developers off on a mission to complete a large section of code or forge an entire revision to an app in one shot, there are inevitably a lot of decisions that have to be made along the way. If your development team has to make these decisions on the fly, against an imaginary timeline, there is a large chance the decisions will be made without all pertinent information. It’s more likely each small Decision will be the right one if you use all the information available from the complete team at each point a decision is required.
2.) Keep the communication channels open between developer and subject matter experts. I recommend daily touch points of less than 15 minutes. Meetings are expensive, time consuming and often attendees are never prepared. I prefer discussions to take place at the programmer’s workstation while he or she is working on changes. This allows demonstrations of current and expected behavior to be shown immediately. Real information and real code turns into a visual aid. It is important all team members understand the purpose is not to meet the schedule. The real purpose is to launch an application people love to use. I might go as far as saying it should be expected that as you move from design to development, you’ll need to make lots of little changes along the way. I question any development cycle where there is little difference between the original designs and the product at launch.
3.) Test, Test, Test. After each Decision there should be some time spent to test it. Testing does not have to be a project in itself. Testing should be performed by both developer and the team member who is in the best position to interpret what will and will not benefit the end user. Testing at each available opportunity is essential to minimizing the amount of change required after a bad decision is made. For example, if a change is made and not tested, each change implemented from that point forward could require revisiting if it’s found the original change was in error.

These principles also contribute to a pleasure filled work environment by allowing each team member to work on what they love. Development does not have to sit in endless conceptual meetings, nor does the product management team need to wait months or weeks to debut new features. The three principles I illustrated above can and should be applied to any development scenario. Using these principles to govern product testing and design reduces our development cost and gets our product to users faster. And in the end, that’s what it’s all about.

Data Quality: Software Innovation Please

Thursday, October 1st, 2009

I am all about the data, location management (to location and equipment), data quality, and methods to improve auto-processing, enhancing data, providing data reports and results that support our customer’s data requirements in their day to day activities.

Here is the million dollar question, this is one scenario: Over a million records in a year, legacy and new records submitted for processing from 2,500 different users and two different business processes (single submit and BOM extract). What technology would be required to intelligently automate the processing of these records to a Master Data Quality Standard?

Remember this is an on-going maintenance process, not a one time migration of non-cleansed data to a new ERP or maintenance system, nor am I referring to parsing the records into different fields of the new ERP system but ensuring that the records are verified, structured, properly attributed with full descriptions and additional information to support the business needs.

First, let’s look at the Wikipedia definition of Product Information Management (PIM) “PIM systems generally need to support multiple geographic locations, multi-lingual data, and maintenance and modification of product information within a centralized catalog to provide consistently accurate information to multiple channels in a cost-effective manner.”

Future PIM software purchasers, what evaluation methods are you using to ensure that your PIM software purchase will support the continuous update and flow of data for your entire enterprise system? Here are some items to take into consideration during your evaluation, these are all items that I ask about and would recommend that you request the answers in writing:

1. How is the change history of the data stored in the system and how easily can it be retrieved?
2. Has the performance of all modules of the software been tested and what is the base line?
3. Request references (at least three) for each module of the software.
4. What is the software product work flow and how is the data processing assigned to employees?
5. Ask to review the documentation and take the time to review; this should be a window into the complexity of the system.
6. Request the design process model and how the software company incorporates customer feedback?
7. What is the bug fix process? What is the quality system to implement a bug fix?
8. What is the software company’s philosophy on customizations at your cost?
9. How is language handled? Translations referenced to a master record?
10. If the software solution is multi module system, how are the master records referenced through
the entire solution?
11. What are the long term design strategies or road maps for each module of the software solution? Ask for the earlier road maps and the software release note to evaluate the how well the software company plans and implement updates to the systems.

And I can go on and on, the licensing; customizing and implementing software in your environment can be extremely costly and time consuming, does Caveat emptor “Let the buyer beware” work in the business world or is there a “Lemon Law” when purchasing software?

View Jackie Roberts's profile on LinkedIn

Budget Time

Thursday, October 1st, 2009

My company’s fiscal year is based on the calendar year as many others are. So, customarily we start the budget planning process in October. It is a detailed process that all of my managers and business units participate in. We usually do a few iterations before it is finalized in mid December. Sound familiar? So here’s the question, after 2009, how do you plan for 2010? Everything we knew and could usually predict with some certainty in recent years went out the window in 2009. Where do you start to plan for the next year? Is it too early to plan for growth, if not, at what pace? What certainty can we count on when developing our plans? The simple fact is, for most of us, we don’t know enough at this stage in the recovery to forecast with certainty where our businesses will be, at least, through mid next year.

So what can be done to insure profitability, or least stability, until growth returns? Control and further reduce costs. Already been there, done that? You have cut staff, benefits, wages, renegotiated prices and terms with suppliers, cut services, slowed production, cut inventories, everything you can think of. Are you sure? How well do you manage your Enterprise wide Master Data Indirect Materials / Commodities spend? What? Everything you buy that supports your facilities and the build of your products. Most large manufactures manage direct material precisely but don’t have an organized approach to their full advantage throughout the Enterprise to strategically manage indirect materials. A solution, fully implemented, provides a number of benefits:

1. “Cleansed” data, eliminating duplication of the same item coded to several different part numbers.

2. Consistent pricing for each and every part / component verified to the OEM level with lead time and warranty information. Minimizing your need to buy spare parts / commodities from distributors or your build sources.

3. Enterprise-wide material management to the department level in every Manufacturing Operation.

4. A reuse or repurposing of excess inventory in Manufacturing Engineering.

5. Able to search inventory with standardized part naming conventions and in multiple languages.

Bottom-line, an aggressive Enterprise wide well executed strategy can and will save your company significant dollars in the first 12 months of implementation. That’s 2010 folks….

“What’s the difference?”

Monday, September 21st, 2009

I have worked for many years supporting major manufacturing clients with operations throughout the world. Often times it has been centered around product engineering support and product documentation. Everything from initial development, prototyping, testing, production, parts (production and after-market), operator and service documentation – soup to nuts. I have always been impressed by the great lengths companies go to ensuring that when the product is ready for market nothing has been left to chance. They know every part that is needed, whether custom built or purchased (supported by engineering drawings), the best price, lead time, how much inventory is needed, sourcing risks to consistent part numbering schema. Virtually every detail that needs to be done to get product successfully out the door and supported has been thought through numerous times.

As I have been working with indirect or non-production spare parts and commodities, I am equally surprised at how little thought of organization goes into the activities that supports the product build or even the facilities. Usually, I find that this whole issue is not dealt with in an organized fashion and is somewhat left to chance. All of the same thought that goes into product development should go into the manufacturing of the product. Why isn’t a Master Database of all indirect materials / commodities required for the Enterprise so the information can be commonly shared? With lead time, common pricing, warranty information, vendor or vendors, etc? First, no one individual owns the enterprise information across the different functional teams. Secondly, it is a decentralized task. Each individual manufacturing facility handles its own needs to get product out the door. In the meantime corporate purchasing is trying to support or at least get its arms around what the Enterprise needs.

By managing this spend consistently throughout the Enterprise, corporations can help ensure product gets out the door 24/7 and reduce their manufacturing cost substantially.

Who Represents the Data in your Master Data Management Software Systems Designs?

Thursday, September 17th, 2009

Those of us that are representatives of Master Data Management initiatives, data quality projects and the users working the processes developed by software makers have a difficult journey in front of us. It seems that for years software developers have designed cumbersome transactional data management systems that do not begin to understand real time data management and what effort it really takes to achieve an on-going Master Data Management program. I have two initial questions: Do these software companies toting one press release after another about Master Data Quality Management even understand the importance of on-going change management to a master data record? How does a business stay in front of the information flow if the software system does not dynamically adapt to the ebb and flow of data volumes and requirements? Software companies track updates and revisions to software code, data is of the same importance sometimes it is of greater importance; the number of data level updates can be monumental depending on the size of the company. Isn’t the end result of a multi-million dollar software system implementation supposed to drive efficiencies and streamline the activities to support their businesses? Cost saving and real time data management is the name of the game.

Here are a few data management tips:

1. Data needs a simple way to be imported into the system. Data comes from a number of sources so a dynamic mapping and import procedure to an internal processing area is useful for data analysis.
2. Yes, there needs to be an area to work on data before it is promoted to a Master Data Status. Software developers need to understand that data is never in a pristine state ready to be entered as a Master Data Record. Never!
3. Data processing requires a managed work flow through the system. Imagine the issue to have thousands of records for analyzing and many employees trying to manage who has what records outside the system. Just not functional work scenario.
4. Never copy data from one software module or grid to another, always reference. Cost per record to manage the data is increased every time a person needs to manually update an aspect of a record more than once.
5. Performance of the software is imperative. To really capitalize on software and technology reporting and analysis need to be done on thousands of records at a time. Time is money.
6. Provenance tracking is extremely imperative especially when “Cataloging @ Source” is the foundation to the quality of the record. Data should be identified with history: where the data originated, contact information, data and time, a revision level, file name, all associated records on the file, etc. MDM system developers, can you start to see the importance of this information?
7. Data needs to be cleansed and profiled; it is important that the software processing tools understand all aspects of the data. For instance search rules should not be so rigid that it takes an analyst manual actions to find a duplicate record because of an extra space or a slash. A worse case scenario is to take the data out of the system to work the data in excel, I am not going to even comment any more on that scenario except that it is totally unacceptable to remove data from a system to try to normalize it. Remember there is a lot of data brought into the business and the cost to manage the data is not core to the primary business, it is an indirect cost. The solution is not outsourcing to a “low cost, low skilled” worker in another country when much of the preprocessing can be done at the expense of CPU time.
8. Data changes, if you have a number of different modules in your software package what is the strategy to support aggregation of the changes to the different business units using the data? Does your software only update in one module and the other modules are in an out of sync situation? Again remember software should be designed to simplify the processes to support the business needs.
9. We live in a global economy language translation and localization of data is more important now than ever. What are the methods translate and maintain localized data?
10. Reporting and exporting of information is critical. It is a requirement to export a segment data set to send to a business customer or run a report of the activities of the work. A MDM system must be able audit data activities through the complete process of import through promotion to a master record.

I am a firm believer that software should not dictate a business process but should be designed to streamline and add efficiency to lower the cost the activity. If you are designing MDM systems, your team should include experts in data management, data quality and business process expertise with applicable experience. Businesses should not be paying for customizations to your software to be support basic 101 management of data.

View Jackie Roberts's profile on LinkedIn

Events DATAForge will be presenting at

Friday, September 11th, 2009

October 2009 is going to be a busy and exciting month for DATAForge. We are scheduled to present at two events and hope to see you all at both

DATAForge will be presenting at the FMMUG 2009 Best Practices to be held October 11th and 12th. This years event is to be hosted by Purdue University. The mission of the Facilities Management Maximo User Group (FMMUG) is to provide a forum for Maximo users in the facilities management industry to exchange information, methods and experiences. This exchange of information is designed to optimize the use of Maximo’s capabilities. For more information visit http://www.fmmug.org.

DATAForge will also be presenting on behalf of the Automotive Industry Content Standardization council at the 10th Annual ECCMA ISO 8000 Data Quality Conference on October 27th, 28th and 29th. This years event will be held at the historic Hotel Bethlehem in Bethlehem, Pennsylvania. Whether you are new to data quality or a seasoned professional, this conference will provide you with a unique opportunity to discuss the latest trends and check out the latest technology. If you have an interest in improving the accuracy of your vendor, material, service or asset masters, improving the descriptions in your ERP software or buy-side or sell-side catalogs or if you are looking for solutions to data integration challenges, the ECCMA conference is the place to be! Please visit the ECCMA website for more information.

It’s Complicated

Wednesday, September 9th, 2009

At DATAForge we pride ourselves on designing simple, elegant, easy to use, web based software for a manufacturing demographic that has been flooded with overly complicated software, abound with options and restrictions, screens to control those options,restrictions and configurations. I’m tired of it. I don’t want you to get me wrong, there is certainly a time, place, and need for software that is configurable in every conceviable way. For example when a multi-state and international corporation is required by law to comply with one of the most complicated tax codes in the recorded history of Earth, then you get a pass for making an application complicated. In this case complication can and has saved many organizations millions or hundreds of millions of dollars, issues like The Sarbanes-Oxley Act of 2002 are not to be taken lightly.

The same logic of presenting every imaginable, option, configuration, button, screen, step, radio button, piece of information has been applied to many software packages. You would think in a large organization, simplicity would be king…not so…I am currently consulting with a large multi-national organization to help in the deploymentof a centralized system to house all product information for their MRO or Maintenance, Repair and Operations. Which, in practical terms, means that they are centralizing their databases of information required to order, maintain, and use any item that can potentially be purchased but does not go into their final product.

Not a small task by any measuring stick. Master Data Management, data cleansing, data normalization, intra-organization de-duplication are on the radar of most if not all large businesses. The most important part of the process is to choose application(s) that are the best fit for your organization, not the one that is made or owned by the largest company, and not the one who has the most clever marketing, not the one that appears in the latest report by the best marketed research firm (think about the ratings agencies who rated toxic subrime mortgage backed securities AA or AAA)

The software that was chosen xxxxxx (contractually obligated not to say the name) has one main screen for entering most of the data related to any given item, this screen contains no less than 50 possible fields in tabular form. There are also 3 additional screen each with less than 50 fields for data entry, these subsequent screens are used to associate ansillary information such as pictures to an item. The screens that DATAForge uses – one screen with 25 or less (depending on the type of data). The remainder of the information is gathered organically and seamlessly based on the way the application is used and who is using it.

When we design a solution the question on each team members mind is “How can I make this easier and faster to do for the end user?”

When evaluating an application force the vendor to show you how it will be used (not tell you), make them show you their solution is faster and more efficient. Lots of options, inputs, and fields are not always the users friend.

Life Cycle Data Management Strategy

Thursday, September 3rd, 2009

Life Cycle Management implies a single “cradle to grave” plan that integrates production support planning, acquisition and sustainment strategies. Think about the importance of data flow and the criticality of accurate data throughout the complete life cycle of a piece of equipment: design, build, install, spare part acquisition, inventory management, maintenance, spare parts sharing and finally, asset disposal. From a data perspective, remember the old computer motto: “Garbage In, Garbage Out”.

What is your Life Cycle Data Management Strategy?

1) Drawing Libraries – The items in the library need to be cleansed and profiled to a classification schema. The schema requires standard naming conventions and technical descriptions. The schema can be designed within your company, priority purchased from another vendor or you can opt for using an open classification dictionary for public use such as the ECCMA eOTD.

2) Common Component Listing – provides a listing of preferred components that support the inventory management strategies for your organization. All equipment designers and builder are required to use the common components identified. Note: common components are set up in the drawing libraries.

3) Spare Part Acquisition – Place the components on purchasing contacts at the beginning of design, this will facilitate the ease of spare parts planning and purchasing. An item on contract provides purchasing the data needed to run analytical algorithms in order to better negotiate pricing organization wide. If the item is set up accurately to a standardized classification dictionary with technical descriptions only one time the whole organization can realize the benefits of the Life Cycle Data Management Strategy.

4) Inventory – supports optimal inventory management by promoting the ability to plan stocking levels and strategies with nearby facilities. Think about the implementation of spare parts sharing or an internal purchase first program. The most important requirement is the standardization or normalization of the data; the part needs to be classified only one-way and should be shown in every system the same way.

5) Maintenance –The use of standardized components coupled with a data management strategy allows the organization to streamline the number of different components used to serve the same function on different equipment. Also reducing the number of parts in inventory and maintenance management tasks.

Life Cycle Data Management Plans starts with component standardization and cleansing the data in your equipment drawing libraries and all downward systems including maintenance. This strategy avoids duplicate inventory items and at the same time promotes an internal purchase philosophy that puts a priority on inventory sharing before issuing supplier purchase orders. Standardizing inventory with information elements such as predefined stocking levels, identification of critical inventory, functionally equivalent item identification and purchasing analytics as well as enhanced vendor management are all necessary steps for a manufacturing business to remain competitive in today’s world of lean low overhead manufacturing.

View Jackie Roberts's profile on LinkedIn

Why Data Cleansing?

Thursday, August 27th, 2009

The statistics around data cleansing are overwhelming and there are mountains of discussions, white papers and tweets available pertaining to Data Quality, Data Profiling and Master Data Management. I think we need to take a step back and try to understand how and why data cleansing has become such a hot topic. You may have realized that business data typically isn’t as streamlined and efficiently maintained as we thought it was. Your organization may have shipped purchased items back because they were not what you thought you had ordered. In some cases another department was found to have the item in inventory, even though we have the item on urgent delivery status from a supplier because the item is set up under a different number or description, you couldn’t have possibly known the item was actually available from existing inventory.

The data quality issues that industries around the world are experiencing have occurred as a result of many years of manual inventory and purchasing record maintenance, through mergers and acquisitions of companies and business units as well as data migrations from various legacy systems into new fangled ERP black holes. There are a number of reasons why.

A common data trap frequently fallen into is assuming that just because you are implementing a new ERP system your organization will now have quality data. Remember the old computer motto – “Garbage In, Garbage Out”. Let me tell you based on first hand experience that there is nothing “sexy” about bad data when the production line is down or any other time.

Data Cleansing and Data Profiling is a very tedious and detailed oriented service. There are a number of key rules to follow whether the profiling and cleansing work is done internally or outsourced to someone who specializes in data cleansing. Here are some rules to consider before a project is started:

1) Conduct a detailed and comprehensive data mapping through all internal systems including engineering, purchasing, asset management, plant inventory management, etc. The goal is to standardize and document all data sources within the enterprise one time and ensure that each department is accounted for and determines what data elements are required to complete their business required tasks.

2) Build a central data cleansing database and make sure all locations using each item are referenced. This ensures that updated information will be passed back to the various legacy systems. You will need old information and updated information for this stage of the process.

3) The data cleansing database should include a balance of electronic scripting for data corrections and manual auditing. A solid process for answering questions needs to be set up. My preference is that the system should use a web utility that tracks data change history and other data related information such as contact information, issue resolution status, classification, questions and answers, etc.

4) The data needs to be referenced to a classification schema and a standard implemented for descriptions and properties. The schema can be designed within your company, priority purchased from another vendor or you can opt for using an open classification dictionary for public use such as the ECCMA eOTD.

5) Free text is not our friend in the data standardization world. If all possible use a system that has built in data rules and ensure anyone entering data into the system understands the standards and the importance of quality data in addition to the high cost to businesses using bad data.

6) Data Cleansing and Profiling the proper way is not “cheap”, but the cost of cleaning the bad data is always less than the expenditures incurred by cleansing your data multiple times or continuing to operate your organization based on erroneous information generated from one or multiple dirty databases.

Cleansed data permits the removal of duplicated inventory items, an internal purchase philosophy that puts a priority on inventory sharing before issuing supplier purchase orders, standardizing inventory with predefined stocking levels, identifying critical pieces of inventory, identifying functionally equivalent items, use of engineering component standardization libraries and facilitates purchasing analytics as well as enhanced vendor management.

View Jackie Roberts's profile on LinkedIn

The Spare Parts World and What It Could Be

Thursday, August 20th, 2009

The conundrum of spare parts management at a high level is perceived and often approached as a process that should be simple. Looking at it from the perspective of the many different entities that form the supply chain and are required to work together symbiotically—component manufacturers, Tier One and Tier Two suppliers, and OEM manufacturers—the logistical expertise needed to coordinate the information flow is anything but simple.

To realize cost savings from new process efficiencies, these separate legal entities need to integrate the information flow and internal groups within each entity such as purchasing, manufacturing engineering, plant maintenance, facilities management, warehousing, commodity management, and asset recovery. Each area must share the mission-critical master data related to the spare parts. Truly integrating the information flow within the conceivably 50-plus business units that indirectly work together across the automotive supply chain to deliver just one item to an OEM sounds literally impossible and cost prohibitive. However, your opinion may change when you read the next couple sentences.

It is estimated that process failures and bad information cost business $1.5 trillion or more in the U.S. alone (Larry English, 2007). A study of large companies, a majority of which have revenues of more than $1 billion, found that 31 percent believe that their costs for incorrect data are $1 million or more per year (Dave Waddington, 2008). The most common element needed by (and from) all involved in the supply chain of the spare parts that keep our lines running is data quality and content as information is transmitted from one organization to another.

Figure 1. Typical Supply Chain Spare Part Data and Information Flow

There is a lot of activity and even more information available around Master Data Management (MDM). MDM and data quality initiatives have become an industry trend these days. To champion a successful MDM effort, formal strategies regarding data standardization in content and structure, as well as import, storage, display, and transmission from your enterprise resource planning (ERP) systems to industry partners are mandatory.

Every supplier, OEM, and manufacturer is using a unique set of data standards to attempt to achieve true “quality” for their data. But how powerful, efficient and beneficial to the automotive industry can the use of silo developed standards be? If all partners were using the same data standards, naming conventions, and requirements to describe spare parts, we can greatly streamline the process needed to exchange the information and at the same time reduce the number of physical and business process failures resulting from the low-quality descriptions contained in our legacy systems, and in most cases, new state of the art ERP systems.

The elements required to achieve a symbiotic information flow for the automotive industry are the same:

A common understanding of what data is needed for a particular class or type of item;
A common method to store the data;
A common method to display the data; and
A common method to transmit the data to those entities that do business together.
The answer is to simplify and standardize the methods used for the exchange of structured, accurate, and efficient data-sharing in an automated fashion, rather than manually sharing as it has traditionally been done. The Electronic Commerce Code Management Association (ECCMA) and DATAForge LLC have formed the Automotive Industry Content Standardization Council (AICSC). The purpose of the council is to facilitate the addition of automotive industry specific terminology to the electronic Open Technical Dictionary (eOTD), create identification guides for quality descriptions, or data requirement statements for individuals, organizations, locations, goods and services.

This also helps develop an automotive supply chain specific spend analysis classification. The dictionary being maintained by ECCMA and the AICSC is ISO standard and public domain; any organization can benefit from its use. ECCMA and the AICSC work with automotive-centric businesses to standardize the way data and information is stored, viewed, and exchanged.

Figure 2. Quality Description:

ECCMA has brought together thousands of experts from around the world and provides them a means of working together in the fair, open, and extremely fast environment of the Internet to build and maintain the global, open-standard dictionaries that are used to unambiguously label information. ISO 22745 spare parts data is capable of being used in any ISO 8000 computer application (neutral exchange), is easily translated, and must stand the test of time (long-term data retention) by using a public domain concept identifier.

Jacqueline Roberts is vice president of DATAForge LLC. For more information about ECCMA, visit the ECCMA Web site.

View web publication:

 http://aiag.informz.net/admin31/content/template.asp?ps=4683&sid=4683&brandid=4002&ptid=406&uid=0&mi=390242

ISO 22745 Standard Based Exchange of Product Data

Thursday, August 20th, 2009

When a spare parts list, bill of material, or other product information for your ERP or inventory system is received, what processes do you follow to make sure the data is accurate and complete?

Typically, maintenance or inventory information is not given any due diligence until it is needed. For instance, a bill of material (BOM) is received, all the parts are set up in your ERP system, and the item record sits untouched until you need to place an order or set the item up in your maintenance system. Then you find that the part number is inaccurate and the supplier doesn’t recognize it, or there is an essential piece of information missing from the description needed to complete the order and bring the line back up. There is a solution: ISO-22745.

The ISO-22745 standard provides the framework needed for any organization to conduct business with internationally recognized data quality. Its most basic purpose is to provide a means to realize the benefits of ISO-8000, which is the ability to specify syntax, semantic encoding, and specification of data requirements for messages containing master data that is exchanged between organizations in the supply chain. Once an organization begins to standardize the descriptions it uses to describe materials, the organization can also begin to see cost savings and cost avoidance by implementing business intelligence algorithms to identify conditions such as duplicate items in inventory, purchase price disparities between facilities, vendor reductions, and identification of functional equivalent items.

ISO-22745’s primary facilitator is the open technical dictionary (OTD), a database of concept IDs and associated descriptive words used to “tag” individual data elements. Once each element is tagged with the concept ID from the OTD, the descriptive elements can be stored, sent, received, and displayed by different organizations without losing any meaning. This is done for multiple languages at once, with no need to translate into multiple languages independently.

ISO-22745 also includes guidelines for the use of identification guides (IG). An identification guide is a statement of requirements describing what data is needed about an item. If all elements are included in the description, this IG facilitates the machine-aided analysis of data quality because we have a clear understanding of what data is required without a person having to review the data.

ISO-22745 describes XML formats that can be used to automate the exchange of ISO-8000 master data.

i-xml is used to specify the data requirements or IG.
q-xml is used to query another organization for the data elements specified in the IG.
r-xml is used to reply to requests for specific data elements.
Together, these formats allow for the machine aided exchange of master data.

The Electronic Commerce Code Management Association (ECCMA) provides a very mature OTD, known as eOTD, which contains more than 440,000 terms that can be used to generate descriptions. ECCMA and DATAForge have also formed the Automotive Industry Content Standardization Council (AICSC). The AICSC is here to help organizations move from proprietary methods of managing descriptions to an ISO method that includes working together as an industry to meet the common goal of lowering operating overhead related to catalog maintenance.

Chris Roberts is an associate product manager at DATAForge™ LLC

For more information on AIAG’s activities and initiatives in electronic commerce, visit the AIAG Web Site or contact Mohammad Abidi.

View web publication:
http://aiag.informz.net/admin31/content/template.asp?sid=4762&brandid=4002&uid=0&mi=396973&ptid=415

The Spare Parts World

Tuesday, August 11th, 2009

Spare parts management at a high level is perceived and often approached as a process that should be simple. Looking at it from perspectives of the many different entities that form the supply chain and are required to work together – component manufacturers, tier 1 suppliers, tier 2 suppliers, and manufacturers, the logistical expertise needed to coordinate the information flow is anything but simple.

To realize cost saving from new process efficiencies, these separate legal entities need to “integrate” the information flow to manufacturers and within each manufacturer to internal groups such as purchasing, manufacturing engineering, plant maintenance, facilities management, warehousing, commodity management, and asset sharing / recovery need to share the mission critical master data related to the spare parts. A truly integrated information flow could conceivably touch a number of business units that indirectly work together across the supply chain to deliver just one item to a manufacturer. The most common element needed by (and from) all involved in the supply chain of the spare parts that keeps the equipment running is data standardization, data quality and an electronic method of transmittal. A study of large companies, a majority of which have revenues of more than $1 billion, found that 31% believe that their costs for incorrect data are $1 million or more per year.1

Data standardization and data cleansing cost should be covered with cost saving initiatives. In addition to the initial data cleanup; strong data governance processes should be implemented for on-going data setups.

1Dave Waddington, “Growing Adoption of Master Data Management by Business?” citing an Information Difference survey of 112 companies, 65% of which had revenues of more than $1 billion, IT-Director.com, IT Analysis Communications Ltd., June 23, 2008.

View Jackie Roberts's profile on LinkedIn

Outsourcing; how do I compete?

Tuesday, July 28th, 2009

I get it, you operate globally and the cost of labor in the states is 4 to 5 times higher than the wages in the countries that typically receive outsourced work. I have only one question; is the only factor taken into account when deciding to outsource from the US to a foreign country cost? When the RFP is evaluated does intellectual property protection and security, quality of work product, time zone communication issues, the geopolitical climate or increasing price trends enter into the decision making process?

I once spoke with a purchasing agent employed by a Fortune 500 company and this is how outsourcing was explained to me…”even if takes someone in a foreign low wage country 3 attempts to get the work correct, we are still are saving 25% over their competitors in the US.” Of course, I had a number of responses, including: Was the cost to manage and audit the work 3 times included in the cost saving analysis? Of course not, the cost savings estimate is only documented at the RFP phase.

Each day our company evaluates our internal and customer processes to build automation and intelligent software applications that increase throughput, improve accuracy without manual intervention and provide our customers with a continuous stream of process improvements. I believe long term our cost are competitive, the challenge is educating new customers to understand the unique and beneficial processes that allow them to capitalize long term implementing  our data quality solutions.

My hope is that I will never see another response to an RFP “Need more competitive pricing or to include “off shore” solution – This is required for more competitive proposal and for further consideration”

How long will it take US salaries to race to the bottom so work can be outsourced back to the states? I hope that this is not the answer, let’s discuss what US vendors need to do offer the long term value add processes that off shore options do not?

               View Jackie Roberts's profile on LinkedIn

DATAForgeTM LLC Managing the Automotive Industry Content Standardization Council (AICSC) Through ECCMA

Tuesday, July 21st, 2009

SOLON, OH–(Marketwire – July 21, 2009) – The ECCMA (Electronic Commerce Code Management Association) awarded DATAForge LLC the distinct honor of managing the Automotive Industry Content Standardization Council (AICSC).  Read More…

 

 

What is the Cost of Bad Data?

Friday, July 10th, 2009

How does a company apply a “cost” to bad data when the costs are so fragmented across the organization? There are obvious costs such as a part not being in inventory, purchasing has tried to buy the part but the supplier didn’t recognize the part number, now production is down and everyone is scrambling to find the replacement part. In this case the cost of the bad data can be assigned.

What about the other costs? What does it cost a global manufacturer the lack of visibility of the “spend” or the inability to manage vendors selling like or equivalent products?

It’s estimated that process failures and bad information cost $1.5 trillion or more in the U.S. alone.[i]


[i] Larry English, “Information Quality Tipping Point: Plain English about Information Quality,” DM Review, July 2007.

View Jackie Roberts's profile on LinkedIn

Master Data Ownership

Friday, June 19th, 2009

Master Data ownership is a hot topic these days inside most organizations, large and small. Business or IT?  The correct answer is both! For your companies master data to be managed in a way that is best for the company, your customers, and suppliers it is imperative that both the business and IT units take shared responsibility for its maintenance. James MacLennan states it simply. Who owns master data in your company?

Data Integrity – How is this really achieved?

Thursday, May 21st, 2009

Data integrity is the assurance that data is consistent and correct. Spare parts, sounds fairly simple?
What are the basic elements of a part record; name, part number, description? Data Integrity is used way too much but is a very vague concept. Let’s just look at the purchasing department; it is easy if the part records are only used by the purchasing department where the main objective is to purchase the item. This example is all the data that the buyer will need to purchase this switch.

 untitled

How would a buyer know that these are the same parts? Two different manufacturer names and two different part numbers; this scenario will cause duplication in a purchasing system. The result is the additional work of creating and maintaining two contracts but also cause downstream effects such as excess inventory with more than 1 stocking location, lack of a volume purchase or a global view.

 Question: Is the answer to always to confirm the actual manufacturer and set up supplier references?

View Jackie Roberts's profile on LinkedIn