What the Growth in Multi-Data Centers Means for Databases and Big Data
A recent article in InfoWorld contained some pretty interesting statistics about the rise and growth of multi-data centers. In their latest poll of data center managers, the Uptime Institute discovered that 80 percent of respondents have built a new data center or upgraded an existing facility within the past five years. Another study of the North American data center market done by Digital Realty Trust found that 92 percent of respondents said their companies will definitely or probably expand their data center space in 2012, which was the highest percentage reported in six years. This news, coupled with the fact that data centers are primarily put in place to hold (gasp!) data, makes it not hard to see that the need for databases that easily span and interact between multiple data centers is only going to escalate, and likely escalate at a rapid clip. But what does a multi-data center database look like? Does it just equate to log shipping, mirroring between data centers, master-slave replication, something else? To be sure, there are use cases where the above options will work just fine, but increasingly, I’m seeing the following short list requirements:
- The ability for a single, logical database to span 1-N datacenters; not just two.
- Multi-directional syncs between data centers; not just one way. Or, put another way: the desire to have truly location independent, read and write anywhere freedom.
- Built in network intelligence so that data is smartly transferred between data centers to minimize bandwidth overload and latency issues.
- The ability to support all key types of data traffic across data centers (e.g. real-time, analytic, search, etc.)