Coming clean on dirty data

Share |

It's pretty certain that most CSPs are mindful that their data is neither clean nor consistent across their operations. Anyone embarking in any form of transformation project finds data cleansing the first major hurdle. It is therefore good to see that Subex and Heavy Reading conducted some reasonably in-depth research into the problem by interviewing 95 operators worldwide.

Conducting research on probably what Subex, and most other vendors of RA and data cleansing technology already know, does highlight just how serious the problem is. After all, they can't exactly publicize their customer's issues, can they? Analysys Mason also conducted research earlier this year and determined the market for Data Integrity Management, the fancy name for cleaning up 'dirty' data, will reach US$137M by 2014 from US$98M in 2010.

For many years the preferred or default method of measuring data quality and  consistency was a series of manual ad hoc audits. With the introduction of revenue assurance processes and automated tools, CSPs became exposed to data anomalies and process shortfalls they had not expected. RA vendors grasped this information to repackage and enhance their products in order to capture part of the extended data integrity market, Subex with its aptly named ROC Data Integrity Management solution.

Part of any RA discovery is to determine that order to bill processes are complete and accurate and anybody that has been involved in one of these exercises will tell you that the order process, often only partly automated, is an area of prime concern. That was also one of the key findings of the report. It found that six out of 10 operators (of the 95 that were interviewed) experienced up to 30 percent order fallouts. Of those, 40 per cent admitted that over 10 per cent of the fallouts were due to inaccurate data.

Approximately 50 per cent of operators still use ad hoc manual audits to keep inventory data in sync with the network. The message must be sinking in because the survey found that a majority of operators plan to invest in discovery and reconciliation solutions within the next 12 months. Chief amongst the reasons for this is a large discrepancy between an operator’s inventory data and what’s actually on their network. The survey estimates that 40 per cent of operators have 20 per cent of their inventory out of synch with their network. That's a serious issue when you consider the cost of network elements like switches and routers and way too much capex investment to leave sitting there as 'stranded assets'.

As European Communications commented, aside from the obvious savings and improvements to customer service, there is a wider shift that sees data as perhaps the most important commodity in a connected world. What use is a heavy investment in data analytics and business intelligence gathering if the core data is rubbish?

Aside from the obvious savings and improvements to customer service, there is a wider shift that sees data as perhaps the most important commodity in a connected world. Operators have an opportunity to play a central role, but they will only be as strong as their data is clean.

The really surprising thing about dirty data, apart from the time it has taken for someone to research it adequately, is that every CSP knows it us an issue. Perhaps denial is part of our cultural DNA, just as the first RA proponents discovered. Time to get those heads out of the sand? Most definitely!


Posted 12-05-2011 8:12 AM by The Insider

Comments

Michele Drgon wrote re: Coming clean on dirty data
on 12-05-2011 10:09 PM

Excellent post!

Just to add a Cloud spin to this: when the US White House issued their Cloud First initiative over a year ago, which included a mandate to consolidate data centers, one of the major hurdles to meeting the 18-month deadline, was that the entire ETL/Data Governance piece was, in itself, a major task.

When you think about the numerous attempts just by enterprises to  consolidate their federated data stores into  "single source or truth" data centers  - and the number of these efforts that have failed due to budgetary and time constraints -  it is definitely highly-estute to underscore the magnitude of this issue of "dirty data" as outlined in the post above.

Comparing what could be considered  "easier" pan-enterprise efforts to consolidate data through data quality processes against the myriad scenarios that we can expect when enterprises, governments and CSPs broaden their data governance strategies into the cloud ecosystem, one can indeed see the critical role that data integrity plays.

Additionally, depending upon where the cloud hosting service resides, many of the countries have Data Protection/Privacy laws that include data integrity and quality as crotoca; Privacy Principles mandated in regional (EU, APEC, Canada, US etc.) as well as country-level DPA legislation. Additionally, the HIPAA variables that must be anonymized in healthcare databases is often cleaned up during this ETL or consolidation process. So what IF a healthcare provider procures cloud services (including brokerage) from a CSP? Are there finite times when new data will be added to the cloud after it has been cleansed? Or are these continual transactions which run the risk of enabling "dirty data" to get back into the "one source of truth" database if it's hosted in the cloud?

Just points to ponder from the Cloud viewpoint... Thank you for highlighting such an important point!

Michele

We welcome your feedback! To comment on this blog post please either Log-In or Register to the TM Forum Community

Paid Advertisement
About TM Forum
Introduction, History, Board, Management Team...
Membership
How to Join, Benefits, Member List...
Community
Community Home, Groups & Teams, Blogs...
Conferences
Event Calendar, Management World, Supported Events...
Training & Webcasts
Upcoming Training Courses, Upcoming Webinars, Podcasts, On-Demand Webcasts...
Initiatives
Cable, Enabling Cloud Services, Government and Defense...
Best Practices & Standards
Frameworx, Business Process Framework (eTOM), Information Framework (SID)...
Resources
Document Library, Case Studies, White Papers
Research & Publications
Business Benchmarking, Newsletters, Insights Research...
Copyright © 1988-2012, TeleManagement Forum. All Rights Reserved
Contact Us
Careers with TM Forum
News Room
Privacy Policy
Terms of Use
Sitemap