Six Points to Consider for Disaster Recovery and Big Data

Data Informed

Six Points to Consider for Disaster Recovery and Big Data

February 07, 2013

Six Points to Consider for Disaster Recovery and Big Data

With all the attention companies pay today to collecting more data to analyze, not enough business and IT leaders are considering the critical issues of backup and disaster recovery and the big data question: What if it breaks?

Any time a company loses data, it creates difficulties. But the stakes are higher when big data is involved simply because of the sheer volume, variety and velocity of it. If a natural disaster strikes or your data is corrupted and you lose that information, the cost to your business could prove devastating.Â If a cybercriminal should breach your defenses, it also could trigger a calamity to your revenuesâ€”and your reputation.

Related Stories

A foundation for data security and privacy in the real world.

Read moreÂ»

Establish electronic discovery policies as a risk management strategy.

Read moreÂ»

Hadoop virtualization by VMware makes idle compute cycles available for analytics.

Read moreÂ»

Of course, if your online Web provider suddenly goes down for a day, what would that cost you? Amazon is estimated to have lost about $30,000 a minute when its site went down for two hours in June 2008, according to media reports at the time.Â Just imagine the loss today, when Amazonâ€™s revenues (more than $61 billion at the end of 2012) are three times what they were at the end of 2008.

These stakes are reason enough for IT chiefs to re-evaluate their companiesâ€™ data-recovery plans to address any loss of big data. Â Here are six points to consider when evaluating or establishing data-recovery policies and procedures that cover big data.

1. Each situation is different. Remember that each disaster recovery plan is company- and industry-specific. You must consider your unique needs and business requirements. You should use the proven Business Impact Analysis (BIA) tools to assess the impact of losing your big data application and data.Â For some companies, big data is a mission critical application that requires high levels of uptime and data retention (such as a large bankâ€™s fraud detection application).Â For others, their big data app doesnâ€™t require that level of uptime (for example, a retailerâ€™s consumer sentiment application on Twitter and Facebook).

2. No overhaul required. Follow the general principles of data recovery and donâ€™t abandon your existing framework for business continuity and information lifecycle governance. For some businesses, big data will prove a business-critical issue but, for others, it wonâ€™t be as crucial. Determine the proprietary natureâ€”your intellectual propertyâ€”of that data. And just as your standard disaster-recovery process involves understanding the impact of losing data on your business, big data requires the same levels of specificity.

3. Use the 80/20 Rule. Admit it. Not all those petabytes of data are that critical. So, if your big data is breached, what do you want to make sure stands the best chance of being secure. (Itâ€™s costly to â€œinsureâ€ so much data.) This is where you can apply the 80/20 rule, which says that 20 percent of your data accounts for 80 percent of the value. Sure, it wonâ€™t always be the best measure since even the loss of a small amount of that data could have an enormous effect, say, on the output of an analytical process or a companyâ€™s reputation. If you use the familiar rule, you must determine the 20 percent of data thatâ€™s crucial to protect.

4. Consider the value window. IT leaders must gauge how long the big data theyâ€™re storing will be of value. It depends, of course, on what youâ€™re gathering and how youâ€™re using it. If itâ€™s weather information, then perhaps you retain data for years. If itâ€™s social media data that tracked one event, you probably donâ€™t need to retain much but the essentials.

5. Choose the right medium. What format and media will you use to store your data?Â Disk or tape? Cloud or on-premise? Raw or re-duplicated? Key for this choice is file restore speeds. The least expensive method is offsite, on tape, and de-duplicated.Â Of course, you pay the tax of having to wait for days to restore your data. Can you wait that long?

6. Choose the right format. Is the data cleansed or raw? Summarized? Aggregated or non-aggregated? Here is a simple example that shows the power of summarizing: if you gather one sample of data for every second for one year, you have more than 31 million records. Simply summarizing that data into hourly averages, min/max, and standard deviation gives you a 93 percent reduction in records. Doing the same for each day would yield a 99.9 percent reduction in records. Do you really need all of that detail?

If this seems overwhelming, itâ€™s understandable. But donâ€™t fret. Big Data recovery will be one of the major IT topics ahead. Itâ€™s inevitable given the gargantuan databases more businesses are mining to grow their revenues and bottom line. Look for seminars, dialogue about best practices, rules of thumb and that 80/20 rule.

Michael de la Torre is vice president of Product Management at SunGard Availability Services, a company that developed disaster-recovery services for banks and other companies in 1978. Â

The post Six Points to Consider for Disaster Recovery and Big Data appeared first on Data Informed.

DataInformed?i=lXO00vPK6QU:6E6ES1-jfH8:F7zBnMyn0Lo

DataInformed?i=lXO00vPK6QU:6E6ES1-jfH8:V_sGLiPBpWU

DataInformed?i=lXO00vPK6QU:6E6ES1-jfH8:gIN9vFwOqvQ

DataInformed?i=lXO00vPK6QU:6E6ES1-jfH8:KwTdNBX3Jqk

Dedham, MA

Big Data and Analytics in the Enterprise