<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments for </title>
	<atom:link href="http://www.dataforge.com/wpblog/index.php/comments/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dataforge.com/wpblog</link>
	<description>Business Intelligence Redefined</description>
	<lastBuildDate>Mon, 14 Mar 2011 23:49:04 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
	<item>
		<title>Comment on The Master Data Management and Governance of Maintenance Data by Garnie Bolling</title>
		<link>http://www.dataforge.com/wpblog/index.php/jackie-roberts/the-master-data-management-and-governance-of-maintenance-data/comment-page-1/#comment-7725</link>
		<dc:creator>Garnie Bolling</dc:creator>
		<pubDate>Mon, 14 Mar 2011 23:49:04 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataforge.com/wpblog/?p=464#comment-7725</guid>
		<description>Jackie, awesome post, and this just illustrates the importance that MDM is a growth strategy, not just a point solution for one view of the data... your finance / maintenance scenario plays right into that.  

Sometimes it is easier to go with the low hanging fruit of financial data / meta data, but as time grows, the benefits outside of finance (like your Maintenance example having to know more elements than finance) can be clearly articulated.   Imagine the start of an MDM project with one focus, then growing that data model to include other areas of the business like Maintenance or shipping, channels, vendors, retail locations... 

Thanks for the clear example, scenario.  Really drives home the value of MDM in our world.</description>
		<content:encoded><![CDATA[<p>Jackie, awesome post, and this just illustrates the importance that MDM is a growth strategy, not just a point solution for one view of the data&#8230; your finance / maintenance scenario plays right into that.  </p>
<p>Sometimes it is easier to go with the low hanging fruit of financial data / meta data, but as time grows, the benefits outside of finance (like your Maintenance example having to know more elements than finance) can be clearly articulated.   Imagine the start of an MDM project with one focus, then growing that data model to include other areas of the business like Maintenance or shipping, channels, vendors, retail locations&#8230; </p>
<p>Thanks for the clear example, scenario.  Really drives home the value of MDM in our world.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Hey baby, what is your material type and material status . . . by Tweets that mention Hey baby, what is your material type and material status . . . « -- Topsy.com</title>
		<link>http://www.dataforge.com/wpblog/index.php/jackie-roberts/hey-baby-what-is-your-material-type-and-material-status/comment-page-1/#comment-1266</link>
		<dc:creator>Tweets that mention Hey baby, what is your material type and material status . . . « -- Topsy.com</dc:creator>
		<pubDate>Wed, 16 Jun 2010 05:54:39 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataforge.com/wpblog/?p=415#comment-1266</guid>
		<description>[...] This post was mentioned on Twitter by Jim Harris, Henrik L Sørensen. Henrik L Sørensen said: Reading @JackieMRoberts Hey baby, what is your material type and material status http://bit.ly/8ZVneM #MDM #dataquality #datacleansing #PIM [...]</description>
		<content:encoded><![CDATA[<p>[...] This post was mentioned on Twitter by Jim Harris, Henrik L Sørensen. Henrik L Sørensen said: Reading @JackieMRoberts Hey baby, what is your material type and material status <a href="http://bit.ly/8ZVneM" rel="nofollow">http://bit.ly/8ZVneM</a> #MDM #dataquality #datacleansing #PIM [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Did we forget the old adage &#8220;Garbage In, Garbage Out&#8221; I mean Garbage Extracted, Garbage Migrated by Phil Simon</title>
		<link>http://www.dataforge.com/wpblog/index.php/jackie-roberts/did-we-forget-the-old-adage-garbage-in-garbage-out-i-mean-garbage-extracted-garbage-migrated/comment-page-1/#comment-878</link>
		<dc:creator>Phil Simon</dc:creator>
		<pubDate>Tue, 27 Apr 2010 18:04:36 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataforge.com/wpblog/?p=391#comment-878</guid>
		<description>&lt;b&gt;Why invest in a software product if the data is not going to be treated as an asset?&lt;/b&gt;

It&#039;s an excellent question, Jackie. Maybe one day I&#039;ll be smart enough to actually to know the answer.

Give me an older system with good data over a newer one with garbage any day of the week and twice on Sunday.</description>
		<content:encoded><![CDATA[<p><b>Why invest in a software product if the data is not going to be treated as an asset?</b></p>
<p>It&#8217;s an excellent question, Jackie. Maybe one day I&#8217;ll be smart enough to actually to know the answer.</p>
<p>Give me an older system with good data over a newer one with garbage any day of the week and twice on Sunday.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Did we forget the old adage &#8220;Garbage In, Garbage Out&#8221; I mean Garbage Extracted, Garbage Migrated by Ken O'Connor</title>
		<link>http://www.dataforge.com/wpblog/index.php/jackie-roberts/did-we-forget-the-old-adage-garbage-in-garbage-out-i-mean-garbage-extracted-garbage-migrated/comment-page-1/#comment-874</link>
		<dc:creator>Ken O'Connor</dc:creator>
		<pubDate>Tue, 27 Apr 2010 13:20:22 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataforge.com/wpblog/?p=391#comment-874</guid>
		<description>Hi Jackie,

Expectation management is one of the challenges of &quot;new systems&quot; and new &quot;software products&quot;.  The marketing material, and the business case always highlights what the new system is, undoubtedly, CAPABLE of doing.

The marketing demo shown to the senior execs, always uses pre-populated sample data, to illustrate the wonderful capabilities of the new system. 

The fact that a new system is CAPABLE of accepting perfect data, and CAPABLE of generating wonderful results, on the basis of perfect data, is no indication of what the new system will ACTUALLY do.      

Most new systems need to be populated by a Data Migration project.  Failing to plan data cleansing of the old data is shockingly common place.     

Then senior execs express surprise when the &quot;new system&quot; fails to live up to their expectations....

Some &quot;new systems&quot; are required for regulatory compliance reasons (Anti Money Laundering, Solvency II, etc. ).  Regulators are beginning to seek evidence of the data population processes - I cite some examples on my blog.  

Rgds Ken</description>
		<content:encoded><![CDATA[<p>Hi Jackie,</p>
<p>Expectation management is one of the challenges of &#8220;new systems&#8221; and new &#8220;software products&#8221;.  The marketing material, and the business case always highlights what the new system is, undoubtedly, CAPABLE of doing.</p>
<p>The marketing demo shown to the senior execs, always uses pre-populated sample data, to illustrate the wonderful capabilities of the new system. </p>
<p>The fact that a new system is CAPABLE of accepting perfect data, and CAPABLE of generating wonderful results, on the basis of perfect data, is no indication of what the new system will ACTUALLY do.      </p>
<p>Most new systems need to be populated by a Data Migration project.  Failing to plan data cleansing of the old data is shockingly common place.     </p>
<p>Then senior execs express surprise when the &#8220;new system&#8221; fails to live up to their expectations&#8230;.</p>
<p>Some &#8220;new systems&#8221; are required for regulatory compliance reasons (Anti Money Laundering, Solvency II, etc. ).  Regulators are beginning to seek evidence of the data population processes &#8211; I cite some examples on my blog.  </p>
<p>Rgds Ken</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Cleansing to Achieve Information Quality by Recently Read: March 22, 2010 &#171; Reblogger Memo Links</title>
		<link>http://www.dataforge.com/wpblog/index.php/jackie-roberts/data-cleansing-to-achieve-information-quality/comment-page-1/#comment-831</link>
		<dc:creator>Recently Read: March 22, 2010 &#171; Reblogger Memo Links</dc:creator>
		<pubDate>Tue, 30 Mar 2010 18:25:40 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataforge.com/wpblog/?p=377#comment-831</guid>
		<description>[...] Data Cleansing to Achieve Information Quality &#8211; Jackie Roberts raises some interesting questions regarding the efforts needed to cleanse data though multiple stages of analytics and processes to achieve appropriate information quality. [...]</description>
		<content:encoded><![CDATA[<p>[...] Data Cleansing to Achieve Information Quality &ndash; Jackie Roberts raises some interesting questions regarding the efforts needed to cleanse data though multiple stages of analytics and processes to achieve appropriate information quality. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Quality Open Issues and Questions? by Recently Read: March 6, 2010 &#171; Reblogger Memo Links</title>
		<link>http://www.dataforge.com/wpblog/index.php/jackie-roberts/data-quality-open-issues-and-questions/comment-page-1/#comment-830</link>
		<dc:creator>Recently Read: March 6, 2010 &#171; Reblogger Memo Links</dc:creator>
		<pubDate>Tue, 30 Mar 2010 18:23:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataforge.com/wpblog/?p=366#comment-830</guid>
		<description>[...] Data Quality Open Issues and Questions? &#8211; Jackie Roberts of DATAForge issues the blogosphere challenge of discussing real-world best practices for MDM, data governance, and data quality.&#160; This blog post received some great comments. [...]</description>
		<content:encoded><![CDATA[<p>[...] Data Quality Open Issues and Questions? &ndash; Jackie Roberts of DATAForge issues the blogosphere challenge of discussing real-world best practices for MDM, data governance, and data quality.&nbsp; This blog post received some great comments. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Cleansing to Achieve Information Quality by William Sharp</title>
		<link>http://www.dataforge.com/wpblog/index.php/jackie-roberts/data-cleansing-to-achieve-information-quality/comment-page-1/#comment-714</link>
		<dc:creator>William Sharp</dc:creator>
		<pubDate>Thu, 11 Mar 2010 15:57:18 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataforge.com/wpblog/?p=377#comment-714</guid>
		<description>@Jackie &amp; @Ken ... Were you guys on the same projects I was a few years ago?  :)  I was an ETL developer on several data warehousing projects when I developed this philosophy.  I&#039;ve even been asked to reto-ETL a broken warehouse were the original ETL developers forgot to add -- primary keys! I&#039;ve learned to never rule anything out as far as data is concerned. I&#039;ve seen every special character on the keyboard in a dataset before.  Which leads me to believe someone loaded their delimiters.  

I think people take data for granted like we do hardware.  Not to mention, how often and long do people talk about the UI features they want?  It&#039;s more tangible to business users, somehow?

In the end, it&#039;s always going to be there and we need to be slick in the ways we try to prevent it and clean it up when it does happen.</description>
		<content:encoded><![CDATA[<p>@Jackie &amp; @Ken &#8230; Were you guys on the same projects I was a few years ago?  <img src='http://www.dataforge.com/wpblog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />   I was an ETL developer on several data warehousing projects when I developed this philosophy.  I&#8217;ve even been asked to reto-ETL a broken warehouse were the original ETL developers forgot to add &#8212; primary keys! I&#8217;ve learned to never rule anything out as far as data is concerned. I&#8217;ve seen every special character on the keyboard in a dataset before.  Which leads me to believe someone loaded their delimiters.  </p>
<p>I think people take data for granted like we do hardware.  Not to mention, how often and long do people talk about the UI features they want?  It&#8217;s more tangible to business users, somehow?</p>
<p>In the end, it&#8217;s always going to be there and we need to be slick in the ways we try to prevent it and clean it up when it does happen.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Cleansing to Achieve Information Quality by Ken O'Connor</title>
		<link>http://www.dataforge.com/wpblog/index.php/jackie-roberts/data-cleansing-to-achieve-information-quality/comment-page-1/#comment-713</link>
		<dc:creator>Ken O'Connor</dc:creator>
		<pubDate>Thu, 11 Mar 2010 15:02:38 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataforge.com/wpblog/?p=377#comment-713</guid>
		<description>Hi Jackie, 

I completely agree with your reply to William.

The attitude too often taken is: &quot;Our job is to implement the new system, not clean up someone else&#039;s mess&quot;.

Ken</description>
		<content:encoded><![CDATA[<p>Hi Jackie, </p>
<p>I completely agree with your reply to William.</p>
<p>The attitude too often taken is: &#8220;Our job is to implement the new system, not clean up someone else&#8217;s mess&#8221;.</p>
<p>Ken</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Cleansing to Achieve Information Quality by Jackie Roberts</title>
		<link>http://www.dataforge.com/wpblog/index.php/jackie-roberts/data-cleansing-to-achieve-information-quality/comment-page-1/#comment-712</link>
		<dc:creator>Jackie Roberts</dc:creator>
		<pubDate>Thu, 11 Mar 2010 14:37:15 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataforge.com/wpblog/?p=377#comment-712</guid>
		<description>William, I believe that the topic of verification or sometimes referred to a &quot;catalog @ source&quot; is a critical step of data quality but is usually overlooked during the traditional data migration phase of a software implementation which results in bad data moved from a legacy system to a new system . . . only providing the business with limited data accuracy for reporting.</description>
		<content:encoded><![CDATA[<p>William, I believe that the topic of verification or sometimes referred to a &#8220;catalog @ source&#8221; is a critical step of data quality but is usually overlooked during the traditional data migration phase of a software implementation which results in bad data moved from a legacy system to a new system . . . only providing the business with limited data accuracy for reporting.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Cleansing to Achieve Information Quality by William Sharp</title>
		<link>http://www.dataforge.com/wpblog/index.php/jackie-roberts/data-cleansing-to-achieve-information-quality/comment-page-1/#comment-711</link>
		<dc:creator>William Sharp</dc:creator>
		<pubDate>Thu, 11 Mar 2010 14:05:59 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataforge.com/wpblog/?p=377#comment-711</guid>
		<description>1. &quot;does an address or a product detail meet the requirement if only classified?&quot; - not in my opinion due to the fact that without verification the data cannot become information.

2. should verification at source (contact for address or manufacturer / supplier for product) be required at initial setup of the data in the system or maintenance scheduled as part of the data governance program? - yes, otherwise what are we governing and by what laws/rules.

3. Is the data incomplete? - yes, often address data is incomplete or invalid.  Either is insufficient to become information.

4. Does the MDM process include a question / answer scenario to complete the data? - I believe so.  This is the voting portion of the governance initiative, functioning as the business&#039; chance to manage its own information.

Great post Jackie!!  Keep it up!  I&#039;ll be mulling these questions over and forming a blog post soon!</description>
		<content:encoded><![CDATA[<p>1. &#8220;does an address or a product detail meet the requirement if only classified?&#8221; &#8211; not in my opinion due to the fact that without verification the data cannot become information.</p>
<p>2. should verification at source (contact for address or manufacturer / supplier for product) be required at initial setup of the data in the system or maintenance scheduled as part of the data governance program? &#8211; yes, otherwise what are we governing and by what laws/rules.</p>
<p>3. Is the data incomplete? &#8211; yes, often address data is incomplete or invalid.  Either is insufficient to become information.</p>
<p>4. Does the MDM process include a question / answer scenario to complete the data? &#8211; I believe so.  This is the voting portion of the governance initiative, functioning as the business&#8217; chance to manage its own information.</p>
<p>Great post Jackie!!  Keep it up!  I&#8217;ll be mulling these questions over and forming a blog post soon!</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Quality Open Issues and Questions? by Jackie Roberts</title>
		<link>http://www.dataforge.com/wpblog/index.php/jackie-roberts/data-quality-open-issues-and-questions/comment-page-1/#comment-665</link>
		<dc:creator>Jackie Roberts</dc:creator>
		<pubDate>Wed, 03 Mar 2010 19:10:16 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataforge.com/wpblog/?p=366#comment-665</guid>
		<description>Julian, 
Just curious did you maintain the two different naming structures and providing a dynamic referencing system? Were each of the classification home grown or were they public classification structures? 

The classification of maintenance items is a challenge especially when you are working at multiple manufacturing facilities where the terminology and classifications are localized. We set up the standards, provided a clean classified and attributed record verified to the manufacturer of the component to the plant which improved our ability to standardize the MRO data up front. Part of our work with the maintenance and engineering departments was to set up synonym search capabilities which enhance the ability to find a part. 
Jackie</description>
		<content:encoded><![CDATA[<p>Julian,<br />
Just curious did you maintain the two different naming structures and providing a dynamic referencing system? Were each of the classification home grown or were they public classification structures? </p>
<p>The classification of maintenance items is a challenge especially when you are working at multiple manufacturing facilities where the terminology and classifications are localized. We set up the standards, provided a clean classified and attributed record verified to the manufacturer of the component to the plant which improved our ability to standardize the MRO data up front. Part of our work with the maintenance and engineering departments was to set up synonym search capabilities which enhance the ability to find a part.<br />
Jackie</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Quality Open Issues and Questions? by Steve Sarsfield</title>
		<link>http://www.dataforge.com/wpblog/index.php/jackie-roberts/data-quality-open-issues-and-questions/comment-page-1/#comment-664</link>
		<dc:creator>Steve Sarsfield</dc:creator>
		<pubDate>Wed, 03 Mar 2010 15:28:21 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataforge.com/wpblog/?p=366#comment-664</guid>
		<description>If that&#039;s your goal, it&#039;s groundbreaking and important work. Go get &#039;em. Keep a journal, because when it&#039;s all done, you should write the book.</description>
		<content:encoded><![CDATA[<p>If that&#8217;s your goal, it&#8217;s groundbreaking and important work. Go get &#8216;em. Keep a journal, because when it&#8217;s all done, you should write the book.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Quality Open Issues and Questions? by Jackie Roberts</title>
		<link>http://www.dataforge.com/wpblog/index.php/jackie-roberts/data-quality-open-issues-and-questions/comment-page-1/#comment-663</link>
		<dc:creator>Jackie Roberts</dc:creator>
		<pubDate>Wed, 03 Mar 2010 15:17:50 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataforge.com/wpblog/?p=366#comment-663</guid>
		<description>Steve,
Interesting enough, a standardized MRO schema is exactly what my goal is. Currently we classify (name) the MRO items to the manufacturer verified item name and attributes to describe the items. The standardization permits accuracy in purchasing and set up of tasking for maintenance. I have actually had an introductory meeting with Automotive Industry Action Group (AIAG), discussing the options of driving standard MRO naming conventions to support manufacturers, tier 1, tier 2, etc. We are working with ECCMA&#039;s Open Technical Dictionary to drive standard naming conventions and attributes providing a technical description to support not only purchasing but engineering and manufacturing. 

Jackie</description>
		<content:encoded><![CDATA[<p>Steve,<br />
Interesting enough, a standardized MRO schema is exactly what my goal is. Currently we classify (name) the MRO items to the manufacturer verified item name and attributes to describe the items. The standardization permits accuracy in purchasing and set up of tasking for maintenance. I have actually had an introductory meeting with Automotive Industry Action Group (AIAG), discussing the options of driving standard MRO naming conventions to support manufacturers, tier 1, tier 2, etc. We are working with ECCMA&#8217;s Open Technical Dictionary to drive standard naming conventions and attributes providing a technical description to support not only purchasing but engineering and manufacturing. </p>
<p>Jackie</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Quality Open Issues and Questions? by Jackie Roberts</title>
		<link>http://www.dataforge.com/wpblog/index.php/jackie-roberts/data-quality-open-issues-and-questions/comment-page-1/#comment-662</link>
		<dc:creator>Jackie Roberts</dc:creator>
		<pubDate>Wed, 03 Mar 2010 15:00:10 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataforge.com/wpblog/?p=366#comment-662</guid>
		<description>Dave,
If you are participating in Linked In, I have a Data Cleansing User Group to discuss and understand how to improve data quality. I also just created a parallel Google group. The statistics around Data Quality are overwhelming and there are mountains of discussions, white papers and tweets available pertaining to Data Quality, Governance, Profiling and Master Data Management. I think we need to take a step into the details to discuss and understand how to improve data quality by reviewing processes, sharing lessons learned and tips n&#039; tricks of data cleansing. 

Jackie</description>
		<content:encoded><![CDATA[<p>Dave,<br />
If you are participating in Linked In, I have a Data Cleansing User Group to discuss and understand how to improve data quality. I also just created a parallel Google group. The statistics around Data Quality are overwhelming and there are mountains of discussions, white papers and tweets available pertaining to Data Quality, Governance, Profiling and Master Data Management. I think we need to take a step into the details to discuss and understand how to improve data quality by reviewing processes, sharing lessons learned and tips n&#8217; tricks of data cleansing. </p>
<p>Jackie</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Quality Open Issues and Questions? by Steve Sarsfield</title>
		<link>http://www.dataforge.com/wpblog/index.php/jackie-roberts/data-quality-open-issues-and-questions/comment-page-1/#comment-661</link>
		<dc:creator>Steve Sarsfield</dc:creator>
		<pubDate>Wed, 03 Mar 2010 14:53:59 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataforge.com/wpblog/?p=366#comment-661</guid>
		<description>What makes it so difficult is that business processes ultimately drive many of the things you mentioned, Jackie. So, in your example, the fields included in an MRO project would be driven by the business processes that are in play.  We have to start by asking &quot;what does the business need in order to accomplish its mission?&quot; In my experience from the vendor side of things, the process would not only vary across industry, but vary within an industry.  Ford wouldn&#039;t use the same MRO schema as Chrysler because they have different business processes.
In Julian&#039;s response, you can see how the business processes of two similar companies challenged him - great insight into how processes drive schema.
Ultimately, we&#039;re dependent upon factors like corporate history, business processes, partnerships, the corporation&#039;s position as a leader, and future business expansion plans to help us build a custom job every time.  It certainly doesn&#039;t make the job of a consultant easy.
Steve</description>
		<content:encoded><![CDATA[<p>What makes it so difficult is that business processes ultimately drive many of the things you mentioned, Jackie. So, in your example, the fields included in an MRO project would be driven by the business processes that are in play.  We have to start by asking &#8220;what does the business need in order to accomplish its mission?&#8221; In my experience from the vendor side of things, the process would not only vary across industry, but vary within an industry.  Ford wouldn&#8217;t use the same MRO schema as Chrysler because they have different business processes.<br />
In Julian&#8217;s response, you can see how the business processes of two similar companies challenged him &#8211; great insight into how processes drive schema.<br />
Ultimately, we&#8217;re dependent upon factors like corporate history, business processes, partnerships, the corporation&#8217;s position as a leader, and future business expansion plans to help us build a custom job every time.  It certainly doesn&#8217;t make the job of a consultant easy.<br />
Steve</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Quality Open Issues and Questions? by Dave Pawson</title>
		<link>http://www.dataforge.com/wpblog/index.php/jackie-roberts/data-quality-open-issues-and-questions/comment-page-1/#comment-659</link>
		<dc:creator>Dave Pawson</dc:creator>
		<pubDate>Wed, 03 Mar 2010 13:18:41 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataforge.com/wpblog/?p=366#comment-659</guid>
		<description>So are you going to set up a mailing list Jackie? Google groups or something? I guess the interest is there.

DaveP</description>
		<content:encoded><![CDATA[<p>So are you going to set up a mailing list Jackie? Google groups or something? I guess the interest is there.</p>
<p>DaveP</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on It Is Not So Easy to Build a Data Cleansing Logic by Ken O'Connor</title>
		<link>http://www.dataforge.com/wpblog/index.php/jackie-roberts/it-is-not-so-easy-to-build-a-data-cleansing-logic/comment-page-1/#comment-658</link>
		<dc:creator>Ken O'Connor</dc:creator>
		<pubDate>Wed, 03 Mar 2010 12:33:20 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataforge.com/wpblog/?p=368#comment-658</guid>
		<description>Totally agree Jackie...

Before considering any form of &quot;Transformation&quot;, the &quot;business rules&quot; for the field should be fully understood: 

I would want to have:

1. Business name of the data field(s):
2. Permitted values:
3. Business meaning of each permitted value:
4. Interdependencies with other data:
5. Field precedence:

The &quot;business rules&quot; should be centrally maintained, and visible to the business.
All &quot;Transformation&quot; rules should also be centrally maintained (and not just coded into an ETL script).  
Actual transformations, and exception processing (when input data could not be transformed) should be recorded in an audit trail.   

Regulators will increasingly look for a &quot;Data Audit Trail&quot; - be warned. 

Ken</description>
		<content:encoded><![CDATA[<p>Totally agree Jackie&#8230;</p>
<p>Before considering any form of &#8220;Transformation&#8221;, the &#8220;business rules&#8221; for the field should be fully understood: </p>
<p>I would want to have:</p>
<p>1. Business name of the data field(s):<br />
2. Permitted values:<br />
3. Business meaning of each permitted value:<br />
4. Interdependencies with other data:<br />
5. Field precedence:</p>
<p>The &#8220;business rules&#8221; should be centrally maintained, and visible to the business.<br />
All &#8220;Transformation&#8221; rules should also be centrally maintained (and not just coded into an ETL script).<br />
Actual transformations, and exception processing (when input data could not be transformed) should be recorded in an audit trail.   </p>
<p>Regulators will increasingly look for a &#8220;Data Audit Trail&#8221; &#8211; be warned. </p>
<p>Ken</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Quality Open Issues and Questions? by Ken O'Connor</title>
		<link>http://www.dataforge.com/wpblog/index.php/jackie-roberts/data-quality-open-issues-and-questions/comment-page-1/#comment-657</link>
		<dc:creator>Ken O'Connor</dc:creator>
		<pubDate>Wed, 03 Mar 2010 11:48:11 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataforge.com/wpblog/?p=366#comment-657</guid>
		<description>Jackie,

Count me in too.  

I will be happy to share real world examples from my experience in requirements definition, sourcing, migrating, profiling, structuring, mismatching and re-verifying legacy system data.

I look forward to learning from the experience of others.  Working together we can achieve far more. 

Ken</description>
		<content:encoded><![CDATA[<p>Jackie,</p>
<p>Count me in too.  </p>
<p>I will be happy to share real world examples from my experience in requirements definition, sourcing, migrating, profiling, structuring, mismatching and re-verifying legacy system data.</p>
<p>I look forward to learning from the experience of others.  Working together we can achieve far more. </p>
<p>Ken</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Quality Open Issues and Questions? by Dylan Jones</title>
		<link>http://www.dataforge.com/wpblog/index.php/jackie-roberts/data-quality-open-issues-and-questions/comment-page-1/#comment-656</link>
		<dc:creator>Dylan Jones</dc:creator>
		<pubDate>Wed, 03 Mar 2010 09:38:53 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataforge.com/wpblog/?p=366#comment-656</guid>
		<description>Hi Jackie

I hear you.

One of my goals for Data Quality Pro was to create a channel for the many practitioners who can&#039;t find the time to get to conferences or even blog about how they are delivering these initiatives on the ground.

I think the DQ/MDM/DG blogosphere has really expanded in the last 12 months but I agree we could do with a lot more case studies and practical accounts.

Shameless plug coming up...

I&#039;m launching www.DQDirections.com late in Q2 this year to provide a virtual presentation platform for many of these case study organisations, many of them I&#039;m sure will find their way out onto the blogosphere in the form of interviews and podcasts.

In the meantime, I&#039;m also keen to gather and collaborate on more &quot;how&quot; stories, theory is great but I agree we need more stories from the coalface.</description>
		<content:encoded><![CDATA[<p>Hi Jackie</p>
<p>I hear you.</p>
<p>One of my goals for Data Quality Pro was to create a channel for the many practitioners who can&#8217;t find the time to get to conferences or even blog about how they are delivering these initiatives on the ground.</p>
<p>I think the DQ/MDM/DG blogosphere has really expanded in the last 12 months but I agree we could do with a lot more case studies and practical accounts.</p>
<p>Shameless plug coming up&#8230;</p>
<p>I&#8217;m launching <a href="http://www.DQDirections.com" rel="nofollow">http://www.DQDirections.com</a> late in Q2 this year to provide a virtual presentation platform for many of these case study organisations, many of them I&#8217;m sure will find their way out onto the blogosphere in the form of interviews and podcasts.</p>
<p>In the meantime, I&#8217;m also keen to gather and collaborate on more &#8220;how&#8221; stories, theory is great but I agree we need more stories from the coalface.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Quality Open Issues and Questions? by Julian Schwarzenbach</title>
		<link>http://www.dataforge.com/wpblog/index.php/jackie-roberts/data-quality-open-issues-and-questions/comment-page-1/#comment-655</link>
		<dc:creator>Julian Schwarzenbach</dc:creator>
		<pubDate>Wed, 03 Mar 2010 08:37:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataforge.com/wpblog/?p=366#comment-655</guid>
		<description>Jackie,

To pick up on one of your questions - classification of maintenance items, I have a real world example that demonstrates just how challenging this can be!
On a recent project I needed to develop the requirements for an asset register for a major rail project. The complication was the fact that different organisations would be managing the assets on completion of the project. The two main organisations have different asset classification structures which are totally incompatible. Due to their different origins, they reflect different classes of assets in different ways, have different levels of granularity, and different &#039;branches&#039; in the tree structure. Additionally, both organisations are continuing to refine their classifications on an ongoing basis, with no synchronisation and no driver to adopt a more consistent approach.

A further complication is that the CAD team use the Uniclass coding system, which is fine for buildings etc. but is light on the particular assets required.

I was able to develop an approach which all parties are happy with, but this did make the overall process more complex than a typical infrastructure project.

Hope this example is of some use.

Julian</description>
		<content:encoded><![CDATA[<p>Jackie,</p>
<p>To pick up on one of your questions &#8211; classification of maintenance items, I have a real world example that demonstrates just how challenging this can be!<br />
On a recent project I needed to develop the requirements for an asset register for a major rail project. The complication was the fact that different organisations would be managing the assets on completion of the project. The two main organisations have different asset classification structures which are totally incompatible. Due to their different origins, they reflect different classes of assets in different ways, have different levels of granularity, and different &#8216;branches&#8217; in the tree structure. Additionally, both organisations are continuing to refine their classifications on an ongoing basis, with no synchronisation and no driver to adopt a more consistent approach.</p>
<p>A further complication is that the CAD team use the Uniclass coding system, which is fine for buildings etc. but is light on the particular assets required.</p>
<p>I was able to develop an approach which all parties are happy with, but this did make the overall process more complex than a typical infrastructure project.</p>
<p>Hope this example is of some use.</p>
<p>Julian</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Quality Open Issues and Questions? by William Sharp</title>
		<link>http://www.dataforge.com/wpblog/index.php/jackie-roberts/data-quality-open-issues-and-questions/comment-page-1/#comment-651</link>
		<dc:creator>William Sharp</dc:creator>
		<pubDate>Tue, 02 Mar 2010 20:55:05 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataforge.com/wpblog/?p=366#comment-651</guid>
		<description>Jackie,

I&#039;m in!  As the name of my blog implies, The Data Quality Chronicle was setup to do just as you propose; get into the details of how data quality is implemented.  With regard to the specific questions you asked above, I&#039;m most experienced with cleansing/profiling and implementation of CDI with CRM packages (Microsoft Dynamics CRM to be specific).

I look forward to the exchange!

William</description>
		<content:encoded><![CDATA[<p>Jackie,</p>
<p>I&#8217;m in!  As the name of my blog implies, The Data Quality Chronicle was setup to do just as you propose; get into the details of how data quality is implemented.  With regard to the specific questions you asked above, I&#8217;m most experienced with cleansing/profiling and implementation of CDI with CRM packages (Microsoft Dynamics CRM to be specific).</p>
<p>I look forward to the exchange!</p>
<p>William</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Quality Open Issues and Questions? by Henrik Liliendahl Sørensen</title>
		<link>http://www.dataforge.com/wpblog/index.php/jackie-roberts/data-quality-open-issues-and-questions/comment-page-1/#comment-649</link>
		<dc:creator>Henrik Liliendahl Sørensen</dc:creator>
		<pubDate>Tue, 02 Mar 2010 20:01:39 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataforge.com/wpblog/?p=366#comment-649</guid>
		<description>Jackie, I am with you on this one.

No doubt that we need discussions on why data governance, master data management and data quality is important – and sure we actually also have that in the blogosphere.

Also we need to exchange ideas on how to establish ROI for these disciplines. That exist too.

But we certainly also need to share concrete opinions, experiences and observations on exactly how to achieve improvements – may they be of general nature or be specific to the single entity types and lines of business.

This will actually help in going round the circle. If we know how to solve the issues we will be much better in clarifying the investment part of the ROI. In doing that we will see that most people actually already know that data quality etc. is important, but many people, not at least senior management, don’t want to go to action if there is no facts for how to succeed.

In the realm of data quality tools and services there seem to be some trend of having it look like a kind of magic sorcery. It is a kind of an art – but as you have pointed out, it is mostly a question of ensuring efficient knowledge driven business processes, having the right data classifications and metadata definitions, activating translations and so on.

There is a whole world of real world examples we can share.</description>
		<content:encoded><![CDATA[<p>Jackie, I am with you on this one.</p>
<p>No doubt that we need discussions on why data governance, master data management and data quality is important – and sure we actually also have that in the blogosphere.</p>
<p>Also we need to exchange ideas on how to establish ROI for these disciplines. That exist too.</p>
<p>But we certainly also need to share concrete opinions, experiences and observations on exactly how to achieve improvements – may they be of general nature or be specific to the single entity types and lines of business.</p>
<p>This will actually help in going round the circle. If we know how to solve the issues we will be much better in clarifying the investment part of the ROI. In doing that we will see that most people actually already know that data quality etc. is important, but many people, not at least senior management, don’t want to go to action if there is no facts for how to succeed.</p>
<p>In the realm of data quality tools and services there seem to be some trend of having it look like a kind of magic sorcery. It is a kind of an art – but as you have pointed out, it is mostly a question of ensuring efficient knowledge driven business processes, having the right data classifications and metadata definitions, activating translations and so on.</p>
<p>There is a whole world of real world examples we can share.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Before or After Headache Business Decisions by Cheap Breakdown Cover</title>
		<link>http://www.dataforge.com/wpblog/index.php/chris-roberts/before-or-after-headache-business-decisions/comment-page-1/#comment-535</link>
		<dc:creator>Cheap Breakdown Cover</dc:creator>
		<pubDate>Thu, 18 Feb 2010 07:09:57 +0000</pubDate>
		<guid isPermaLink="false">http://websrv1/wpblog/?p=30#comment-535</guid>
		<description>Super-Duper search engines internet site! I am loving it!! Will arrive again again - obtaining you feeds also, Provides Many thanks.</description>
		<content:encoded><![CDATA[<p>Super-Duper search engines internet site! I am loving it!! Will arrive again again &#8211; obtaining you feeds also, Provides Many thanks.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Data Integrity &#8211; How is this really achieved? by Kenneth Izarraras</title>
		<link>http://www.dataforge.com/wpblog/index.php/jackie-roberts/data-integrity-how-is-this-really-achieved/comment-page-1/#comment-518</link>
		<dc:creator>Kenneth Izarraras</dc:creator>
		<pubDate>Wed, 17 Feb 2010 17:45:50 +0000</pubDate>
		<guid isPermaLink="false">http://websrv1/wpblog/?p=26#comment-518</guid>
		<description>Found your site on Google and I thought that I would drop by and take a look. Nice work!</description>
		<content:encoded><![CDATA[<p>Found your site on Google and I thought that I would drop by and take a look. Nice work!</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on It&#8217;s Complicated by Marco Polo</title>
		<link>http://www.dataforge.com/wpblog/index.php/chris-roberts/its-complicated/comment-page-1/#comment-476</link>
		<dc:creator>Marco Polo</dc:creator>
		<pubDate>Thu, 11 Feb 2010 00:30:13 +0000</pubDate>
		<guid isPermaLink="false">http://www.dataforge.com/wpblog/?p=186#comment-476</guid>
		<description>This is a strongest place for such kind of articles, your site is a inspiration for me. i got so much benefits and nice results after visiting here and the grace is increasing day by day in your posts. The above information is extremly essential.</description>
		<content:encoded><![CDATA[<p>This is a strongest place for such kind of articles, your site is a inspiration for me. i got so much benefits and nice results after visiting here and the grace is increasing day by day in your posts. The above information is extremly essential.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

