Saturday, March 26, 2016

Hidden Costs / Hidden Factory

It seems one of the more common refrains within EA is the need to eliminate redundant applications and associated processes.  Every day at work I am reminded of my company’s own redundant applications and wonder why they were allowed to live on.  I’ve decided it must be the cost associated with retaining the legacy version(s) simply cannot be seen; they are too well hidden.  It’s these hidden costs that I want to explore more deeply, with the hope that I can create a sharper message for eliminating redundant applications up front before it’s too late.

Gartner’s IT Glossary (n.d.) defines total cost of ownership as a comprehensive assessment of IT or other costs across enterprise boundaries over time. For IT, this includes:
  • Hardware and software acquisition (hard cost)
  • End-user licenses (hard cost)
  • Management and support (partial allocation)
  • Communications (partial allocation)
  • Opportunity cost of downtime, training and other productivity losses.

The hard costs are easy to determine and should encompass licenses and initial setup, data migration and the like.   The allocations are harder to calculate but still relatively easy.  The opportunity costs are the hardest to calculate, and many organizations may not even try.  However, when forced, some reasonably defendable calculation can probably be devised.  These costs are all visible costs, although it could be argued that the opportunity costs are somewhat hidden because of how difficult they can be to calculate.

Naturally, redundant applications have visible costs and hidden costs.  The visible costs will frequently drive decision making, and, to be a bit cynical, since so many businesses seem to have redundant applications, the visible costs must not be compelling or painful enough to motivate businesses to get rid of them up front.  The decisions to get rid of redundant applications seem to come after years of painful experience.  Maybe if we can understand the hidden costs better, we can change the decision-making dynamic.

Redundant systems require training that is more complicated, longer, and harder for staff to learn, and it requires operational staff that can handle the same processes two different ways (I say the same processes, because the inputs and outputs are the same, it’s just that everything in the middle is different).

This leads to productivity issues, which, if we were so motivated, could probably be quantified.  However, where things really break down and the hidden costs soar, is with the resulting quality issues.  The Lean Six Sigma (LSS) world defines the hidden costs associated with quality issues as the “Hidden Factory”, describing these hidden costs as overhead which is smoothed over or thrown together with other costs, making the cost of the quality defects difficult to identify (SixSigmaTutorial.com, 2011).

These hidden costs still don’t have much meaning or definition to them, so I’ll try to address that based on my observations.  When quality becomes a problem, the following bullets start to frequently repeat themselves.
  • Large amounts of time are spent addressing customer complaints (and some customers are lost).
  • Large amounts of time are spent writing internal emails describing what happened and why.
  • Numerous meetings (from executives on down) are held to discuss why quality is suffering.
  • Tasks forces are established to identify the most acute quality problems.  These meetings are often notoriously unproductive and involve raised voices and lots of finger pointing.
  • Teams of subject-matter experts are assembled to analyze what it will take to retire one of the redundant systems.  They will likely conclude that they do not have the time, so the downward cycle continues.
  • In addition to the external, customer-oriented quality issues, there are numerous internal quality issues which also lead to time-eating emails, investigations, meetings, and improvement initiatives.
    • Inaccurate management reporting due to the multiple systems.
    • Greater likelihood of IT support failures which will lead to Operations not being able to get their work completed on time, overtime, upset employees, and hurried work that will likely lead to more customer quality issues.
    • Finance has redundant invoicing processes, making it tough to keep the books balanced, which requires them to spend a lot of time with IT and Operations conducting investigations into what work was actually completed.

All of the above fit the definition of hidden costs or the LSS term of “Hidden Factory”. Naturally, these costs are never part of the up-front decision to retain a redundant application.  There are methods for measuring the hidden costs associated with quality problems, which I hope to explore in a future posting.  For now, I hope EA teams can start to craft a sharper message, urgently warning of the hidden costs/factory up front, rather than relying on years of painful experiences to force a decision to eliminate the redundancy.

References

Gartner. (n.d.). Gartner IT Glossary. Retrieved from http://www.gartner.com/it-glossary/total-cost-of-ownership-tco

3 comments:

  1. We have worked on APM or application rationalization a fair bit at our organization. I agree, it often seems like a mystery how the redundancy and cost of applications seems overlooked or ignored. One of the mitigating factors I think is "emotional". Your comment on hard and soft dollars made me think about it because "staff" or "people" are also hard dollars. If, for example, you have 2 content management systems and each has an administrator, rationalizing one of those systems away could potentially lead to the difficult conclusion that you don't require as much staff (hard dollars savings). I used to say that, as much as it is difficult, the job of computers is often to eliminate the need for as many workers and I think this is also true of rationalization and lean process engineering.

    The other side of TCO that is often difficult to get agreement on is the true cost of a system. On a shared server, where the hardware is sunk cost, how much does a particular system really leverage. Is it a percentage of the overall cost, or do all systems use the cost of the whole server in their TCO model.

    I have found that those "scare diagrams" that show the inherit complexity around a specific function and why it is difficult to be nimble and responsive to change are helpful. It is also helpful to map to capabilities for showing overlap. Sometimes just having that conversation can highlight that even though a system looks redundant it really isn't.

    I especially appreciate your outlined series of events at the bottom of your post. I've personally seen subject matter experts discuss how impossible it would be to retire existing systems hitting any initiative you're proposing like an Atom Bomb. It's a great point.
    Thanks for sharing!

    ReplyDelete
  2. This is a thoughtful analysis Scott. I have also seen the problem of the costs not being allocated against the appropriate decision makers or what might be called an externality in economics. Even if the TCO to the organization in total is higher from operating an overlapped, inefficient portfolio of applications; it may not necessarily be higher for the individual business units. It is unfortunate that people cannot be expected to 'do the right thing' with respect to the costs to the larger organization, but expecting that would be as Mr. Spock from Star Trek says, "highly illogical."

    We see evidence of this behavior in the establishment of technology standards and the 'policing' efforts from IT. Those efforts are the result of being unable to put those costs back on the stakeholders who might otherwise select common hardware and software to capture economies of scale, etc... Since we do not have the mechanisms in place to allocate those costs effectively, we establish standards.

    The DevOps practice is one attempt to reallocate costs to the right people. Werner Vogels at Amazon is often credited with the "you build it, you run it" model. He says that Amazon found a strong correlation between developers getting paged at 3am, and the sudden increase in software quality!

    Illuminating the TCO is a great, required first step. The follow-up is ensuring that there is a structure to constructively pass along those costs to incentives rational decision makers to make a change.

    ReplyDelete
  3. Scott,

    Great post, I've seen a lot of the issues you've mentioned in a bullet point, but I did not have formal terminology for them, so thanks for including the link to six sigma.

    I think one of the issues is that a lot of managers and supervisors come from a time where we did not have to limit the extra use of resources. The way to get ahead in the market was to have wield your resources more efficiently than your peers. Now, we're moving to a period where organizations must still use their resources efficiently, but that entails sharing their resources with other groups. Sadly, I think most of our issues in EA will go away when the old guard leaves.

    -Larry

    ReplyDelete