The Real Problem with Data Center Efficiency
In the past week, The New York Times (and Greenpeace previously) called attention to the inefficiency in data centers. These stories bring light to a serious issue, but are a bit misguided and do not include the whole story. Generally speaking, are data centers inefficient? Absolutely! Read on to fully understand the significance of the situation, the reasons why they are inefficient and opportunities that lie ahead.
Data centers are large consumers of power. According to a 2007 U.S EPA report, data centers in 2006 accounted for a full 1.5% of the US energy consumption. That number was expected to double to 3% by 2011. At the time, 38% was attributable to the nation’s largest data centers. However, these numbers do not represent the entire footprint of data centers. Smaller facilities, closets and lab spaces were not included in the study. From experience, these represent a significant aggregate footprint.
Organizations like The New York Times and Greenpeace have called out the issues around inefficiency in data centers. Good points are made, but the focus is misdirected and backfiring. The companies called out in their reports are operating some of the most efficient data centers on the planet. So, what’s the problem? The vast majority of data centers operated by everyone else.
In my post The Future Data Center Is… Part II, I breakdown the importance of understanding differences between SMB, Mid-Tier, Enterprise and Very Large Enterprise data centers. There is a very significant difference between the different tiers, their requirements and ability to run an efficient data center. My good friend and fellow Data Center Pulse board member Mark Thiele provided additional color in his SwitchScribe post Measuring the Size of a Data Center – Yes, it Matters. Unfortunately, the majority of articles focus on the Very Large Enterprise data centers. These facilities have very specific requirements and do not represent the vast majority of data centers in use today.
Sadly, the SMB, mid-tier and enterprise (to a smaller degree) data centers are some of the most inefficient operations on the planet. And that doesn’t account for the closets and rooms that house IT equipment. Is there something to learn from the larger enterprises? Absolutely. Should (or can) the rest of data center operators mimic them? No. The very large enterprises are getting more and more efficient every day. The SMB, mid-tier and enterprise data centers (to a smaller degree) are simply not able to keep up.
Different Purposes of Data Centers
In my post A Workload is not a Workload, is not a Workload, I delve into the differences that drive data center architecture and operation. The fundamental premise behind the story is that the larger providers have two things that differentiate them from the rest.
- Monolithic Application: Organizations like eBay, Zynga, Google and Facebook run very specialized applications. These apps (and the infrastructure they run on) are highly tuned. In most other organizations, the workload is very mixed and presents challenges (if not impossible) to effectively tune.
- Scale: The same companies run their monolithic applications at web-scale, which is much larger than typical enterprise applications. The very nature of the scale requires and allows specialization in the tuning of the application and related infrastructure.
These two factors lead to a much different situation than exists in the typical enterprise, mid-tier or SMB environment.
Beyond the technical specifics, the organizational complexities cannot go unmentioned. For the above-mentioned companies, they have teams of people with specific jobs dedicated to supporting the data center and its operation. This dedication allows specialization that drives further efficiency in the facility and operations. The staff understands the PUE of the data center and how each specific each tweak impacts the number. They often run their facilities as close to maximum efficiency and capacity as possible. Why? They clearly understand the business impact.
The typical enterprise, mid-tier or SMB presents a much different situation. In this case, the responsibility of the data center is often one of many requirements in a person’s job description. That’s on top of everything else. These organizations simply don’t have the scale to justify specialization of data center operations.
There is no question that data centers are huge energy consumers. That will not change. The opportunity is to run more efficient facilities that leverage renewable energy sources. Some larger (and newer) facilities are being located near renewable power sources. Yahoo, VMware and others recently built data centers in Wanatchee, WA near the Grand Coulee dam’s hydroelectric power source. In other cases, wind and solar farms are being built near larger data centers.
It should be noted that these decisions do not always make good business sense. Renewable energy sources are not available at reasonable costs everywhere. And moving data centers near renewable power sources is not always feasible either. Staffing, backbone network connectivity and a host of other factors influence the decision. Regardless of the interest in social and environmental responsibility, a data center is an expensive business asset that requires analysis of many factors.
Other issues present challenges for data centers including costs, knowledge, legacy applications, governance and security. Data centers are complex ecosystems that require attention, understanding and specialized management. Over time, data centers are getting more complex…not simple. There needs to be an appreciation and acceptance to these issues.
In summary, the very large data center operators are running some of the most efficient facilities and operations. The data center (and broader IT) industry needs to learn from their examples. However, because of articles in the NYT and Greenpeace calling out their flaws, there is growing hesitation to share what they’re doing for fear of bad PR. Can you blame them? But leaders like Dean Nelson (VP at eBay and fellow Data Center Pulse board member) are still fighting the trend in order to benefit the industry. The data center and IT industry needs an environment where ideas and experiences can be freely shared without concern of misguided criticism in the mass media.
Data centers are huge consumers of energy. Their demand is increasing and not expected to shrink. So, what can be done to address the issues? There are several immediate changes that are needed.
- Strategic Differentiation: Organizations need to take a hard look at their own focus. Is the company in the data center business? Is the company willing to invest in the data center to truly run it efficiently? Is the data center a strategic differentiator? For the vast majority of companies currently running data centers the clear answer to this will be no.
- Efficient Facilities: The very large enterprises are already running efficient facilities and driving hard toward greater efficiency. Let’s encourage them to continue to do so! Those not able to run at this level of efficiency need to stop running their own facilities. SMB, mid-tier and some enterprise organizations need to develop plans to eliminate their own data centers and leverage more efficient ones. Consider colocation, hosted infrastructure and other options.
- Efficiency Programs: Several power utilities have offered programs with incentives to offset costs to implement energy efficient solutions. The problem is that programs were not consistent across power utilities and some do not offer the programs. The industry needs a consistent program similar to the National Data Center Power Reduction Incentive Program proposed by Data Center Pulse in 2009.
- Virtualization: Virtualization is not new and a very mature technology. However, the adoption rates are still very anemic. Depending on which analyst organization you believe, the numbers average roughly 30-40%. This isn’t the whole story though. From experience, organizations sit at opposite ends by being either heavily virtualized or virtualizing only a few servers. The excess, unused capacity is simply wasteful. There are a number of reasons for this including cost, legacy applications, knowledge, fear, and an already overwhelming plate of issues to address.
- Cloud Computing: The cloud services market is maturing very quickly. Even highly regulated industries with compliance requirements are leveraging cloud computing for even their most sensitive applications. Cloud based services is a good solution to leverage where possible.
In summary, let’s commend those that are trying hard to help our industry move forward. Let’s bring the focus to the real issues preventing the efficient operation of data centers. There are a number of viable and immediate solutions available today to help the larger contingent of organizations.
- Glanz, James. “Power, Pollution and the Internet.” The New York Times. 22 Sep 2012. <
- Glanz, James. “Data Barns in a Farm Town, Gobbling Power and Flexing Muscle.” The New York Times. 23 Sep 2012. <
- Fehrenbacher, Katie. “NYT’s data center power reports like taking a time machine back to 2006.” GigaOM. 24 Sep 2012. <
- Weinman, Joe. “The Power of IT (it’s not all in energy consumption).” GigaOM. 26 Sep 2012. <
- “How Clean is Your Cloud?” Greenpeace. 17 Apr 2012. <
- “National Data Center Power Reduction Incentive Program.” Data Center Pulse. 6 May 2009. <
- “The Green Grid Power Efficiency Metrics: PUE & DCiE.” The Green Grid. 23 Oct 2007. <
- “Report to Congress on Server and Data Center Energy Efficiency.” U.S. Environmental Protection Agency. 2 Aug 2007. <
- Thiele, Mark. “Measuring the Size of a Data Center – Yes, it Matters.” SwitchScribe. 30 Jan 2012. <