Special Coverage Series

Network Computing

Special Coverage Series

Commentary

Howard Marks
Howard Marks Network Computing Blogger

Solving VDI Problems with SSDs and Data Deduplication

When it comes to VDI, users don’t want to sacrifice a rich desktop experience, and IT doesn’t want to get crushed by the storage costs and management efforts required to provide that experience. Data dedupe and SSDs solve this dilemma.

It's been amusing to watch numerous storage vendors announce that they, and they alone, have the magic feature (derived from unicorn tears) that will allow you to build a VDI environment to satisfy two cranky constituencies: the users, who will break out torches and pitchforks if their thin-client experience is inferior to their old PCs, and the bean counters, who've drunk the Kool-Aid® and think VDI will let them save capex and opex.

In fact, you don't need unicorn tears to make this work (you can save them for SDN). Data deduplication and solid state drives may be all the magic required.

More Insights

Webcasts

More >>

White Papers

More >>

Reports

More >>

Storage makes up a significant fraction of the cost of most VDI projects. To help contain storage costs, most VDI implementations use linked clones, which reduce the storage capacity that any given set of virtual desktops will consume.

Linked clones are a great solution for non-persistent desktops, which are the low-hanging fruit of VDI workloads. But once you move past the call centers, school computer labs, hospitals and other environments where non-persistent desktops make sense, linked clones become less and less attractive.

Knowledge workers have a more intimate relationship with their personal computers than people who run one or two applications on a shared PC. They want their Hello Kitty wallpaper and the 300 icons on their desktops, which frankly can be managed as part of their roaming profile or persona. They also want to be able to install new applications once, like the WebEx plugin for their favorite browser, so they don't have to do it every time they join a conference.

You can give users a persistent desktop image with linked clones, but you have to give up most of the advantages of linked clones to do it. The disk space savings fade away as each user's linked clone grows with all the changes from the "golden master" over time.

Consider that in VMware's internal VDI implementation, the average user's clone grows by 1 Gbyte a week. Linked clones typically save 30 to 60 Gbytes per user, but if a clone grows by 1 Gbyte a week, then in about a year your linked clone implementation will actually take more space than if you just created full clones for all the users.

The other big advantage of linked clones is that updates can be posted once to the "golden master," rather than having to be installed on each desktop image. To apply Tuesday's patches, just update the master and recompose the linked clones to include the changes.

The problem is that recomposing the desktops discards the delta file that made each user's linked clone different from the master, so all the users' clones revert to the original state. The users will then have to reinstall the applications, and browser plug-ins, that made their desktop comfortable.

I know of one university where the faculty and staff revolted after the IT group recomposed their desktops and forced the recomposition process to happen only once a semester. Because those systems couldn't go a whole semester without security patches, IT had to go back to installing patches on individual desktops.

Data Dedupe: A Better Way

A better solution is to recognize that linked clones are a primitive mechanism for doing periodic data deduplication, and to replace that mechanism with a more sophisticated deduplication technology in the storage system.

As I discussed in a now-classic post Data Deduplication And SSDs: Two Great Tastes That Taste Great Together, the performance problems deduplication can create on disk storage don't apply to solid state storage.

That means we can use data deduplication to store the common data, like WINSOCK.DLL, across all our desktops just once. All solid-state storage systems with deduplication, like those from Pure Storage and GreenBytes or hybrids from TinTri or Tegile, can dedupe data and still deliver sufficient performance to support thousands of VDI users.

Full clones on deduplicated storage can be managed with the same tools you use to manage your physical desktops and give the users the same rich experience they're used to.

Disclamer: Data Domain (the originator of data deduplication), Greenbytes and Tegile are or have been clients of DeepStorage, LLC.



Related Reading



Network Computing encourages readers to engage in spirited, healthy debate, including taking us to task. However, Network Computing moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing/SPAM. Network Computing further reserves the right to disable the profile of any commenter participating in said activities.

 
Disqus Tips To upload an avatar photo, first complete your Disqus profile. | Please read our commenting policy.
 

Editor's Choice

Research: 2014 State of Server Technology

Research: 2014 State of Server Technology

Buying power and influence are rapidly shifting to service providers. Where does that leave enterprise IT? Not at the cutting edge, thatís for sure: Only 19% are increasing both the number and capability of servers, budgets are level or down for 60% and just 12% are using new micro technology.
Get full survey results now! »

Vendor Turf Wars

Vendor Turf Wars

The enterprise tech market used to be an orderly place, where vendors had clearly defined markets. No more. Driven both by increasing complexity and Wall Street demands for growth, big vendors are duking it out for primacy -- and refusing to work together for IT's benefit. Must we now pick a side, or is neutrality an option?
Get the Digital Issue »

WEBCAST: Software Defined Networking (SDN) First Steps

WEBCAST: Software Defined Networking (SDN) First Steps


Software defined networking encompasses several emerging technologies that bring programmable interfaces to data center networks and promise to make networks more observable and automated, as well as better suited to the specific needs of large virtualized data centers. Attend this webcast to learn the overall concept of SDN and its benefits, describe the different conceptual approaches to SDN, and examine the various technologies, both proprietary and open source, that are emerging.
Register Today »

Related Content

From Our Sponsor

How Data Center Infrastructure Management Software Improves Planning and Cuts Operational Cost

How Data Center Infrastructure Management Software Improves Planning and Cuts Operational Cost

Business executives are challenging their IT staffs to convert data centers from cost centers into producers of business value. Data centers can make a significant impact to the bottom line by enabling the business to respond more quickly to market demands. This paper demonstrates, through a series of examples, how data center infrastructure management software tools can simplify operational processes, cut costs, and speed up information delivery.

Impact of Hot and Cold Aisle Containment on Data Center Temperature and Efficiency

Impact of Hot and Cold Aisle Containment on Data Center Temperature and Efficiency

Both hot-air and cold-air containment can improve the predictability and efficiency of traditional data center cooling systems. While both approaches minimize the mixing of hot and cold air, there are practical differences in implementation and operation that have significant consequences on work environment conditions, PUE, and economizer mode hours. The choice of hot-aisle containment over cold-aisle containment can save 43% in annual cooling system energy cost, corresponding to a 15% reduction in annualized PUE. This paper examines both methodologies and highlights the reasons why hot-aisle containment emerges as the preferred best practice for new data centers.

Monitoring Physical Threats in the Data Center

Monitoring Physical Threats in the Data Center

Traditional methodologies for monitoring the data center environment are no longer sufficient. With technologies such as blade servers driving up cooling demands and regulations such as Sarbanes-Oxley driving up data security requirements, the physical environment in the data center must be watched more closely. While well understood protocols exist for monitoring physical devices such as UPS systems, computer room air conditioners, and fire suppression systems, there is a class of distributed monitoring points that is often ignored. This paper describes this class of threats, suggests approaches to deploying monitoring devices, and provides best practices in leveraging the collected data to reduce downtime.

Cooling Strategies for Ultra-High Density Racks and Blade Servers

Cooling Strategies for Ultra-High Density Racks and Blade Servers

Rack power of 10 kW per rack or more can result from the deployment of high density information technology equipment such as blade servers. This creates difficult cooling challenges in a data center environment where the industry average rack power consumption is under 2 kW. Five strategies for deploying ultra-high power racks are described, covering practical solutions for both new and existing data centers.

Power and Cooling Capacity Management for Data Centers

Power and Cooling Capacity Management for Data Centers

High density IT equipment stresses the power density capability of modern data centers. Installation and unmanaged proliferation of this equipment can lead to unexpected problems with power and cooling infrastructure including overheating, overloads, and loss of redundancy. The ability to measure and predict power and cooling capability at the rack enclosure level is required to ensure predictable performance and optimize use of the physical infrastructure resource. This paper describes the principles for achieving power and cooling capacity management.