Replatforming is the process of rewriting existing enterprise applications to be compatible with a modern data warehouse, and is the most common approach to migrating applications to the cloud.
Today, many enterprises are looking at cloud databases and warehouses (Gartner believes that the majority of new licensing buying likely to be for cloud deployments by 2020) because of the advantages they offer: ongoing technical innovation, security, lower CAPEX and OPEX, better scalability, less maintenance, and little-to-no tuning. This means many of them will be replatforming soon.
In this post, I’ll talk about the importance of planning and considerations that must be kept front and center before migrating to the cloud. Some of these considerations include assessing current approaches and limitations, the importance of understanding the complexity of existing database workloads before making a replatforming plan, why replatforming is so difficult and expensive, and the benefits of virtualizing the data warehouse.
There are many ways to move from an on-premise data warehouse to a cloud-native data warehouse, including rehosting, refactoring, replatforming, and repurchasing, among others. Each approach has its own pros and cons, some are easy but don’t support every kind of application, some strategies are more complete but are incredibly time-consuming and difficult to pull off.
The approaches described above could fit the needs of smaller databases or an application or two but simple data warehouses are not the norm for Global 2000 enterprises: data warehouses are the crown jewels of the enterprise and can contain millions of lines of code painstaking built over decades to serve business needs. For example, one Datometry customer had a 250TB database with a daily average query volume of 14 million. In this situation, none of the three options listed above will work. The enterprise would need to rewrite all the applications line by line for the destination cloud data warehouse and test them: migrating to the cloud would be a multi-year, expensive initiative with costs running into the tens of millions of dollars.
In fact, the complexity and challenge of enterprise replatforming initiatives is confirmed by Gartner which found that 50% percent of all migrations fail, go over budget, or run late because the effort of rewriting and testing applications to work with the new database is severely underestimated.Data warehouses are the crown jewels of the enterprise, and can contain millions of lines of code painstaking built over the course of decades. Click To Tweet
Before migrating to the cloud, enterprises need to extensively plan and prepare for a successful application rewrite. A thorough understanding of the enterprise’s applications and creating a detailed plan before replatforming are paramount to success. Mistakes made when rewriting applications could cause downtime and create significant risk for the business. A deep dive into existing workloads includes getting familiar with the application stack and figuring out which applications are already compatible with the new cloud destination and which need to be rewritten.
Typically, a system integrator is tasked to review the applications, assess the business user needs, create the architecture for the new cloud-native data warehouse, figure out how long the project will take, and determine the overall cost. This is a mostly manual effort and can take months to complete depending on the complexity and size of the data warehouse. Another approach can be using a tool or software solution, such as Datometry qInsight to determine the workload compatibility with the target cloud data warehouses. With qInsight, you can receive a detailed report summarizing the functional, operational, and performance characteristics of data warehouse workloads, assessing the compatibility of the workloads with the target data warehouse, prioritization of workload hygiene, and more.
Analyst firm Gartner found that 50% percent of all migrations fail, go over budget, or run late because the effort of rewriting and testing applications to work with the new database is severely underestimated.
Interestingly, a recent survey conducted by Datometry found that the technical effort and cost of rewriting applications were the biggest challenges in replatforming for the majority of respondents. Where does all this time and risk come from?
Replatforming can be broken down into a few parts. First, enterprises need to develop a good understanding of their application stack and determine which applications will need to be rewritten. Then, the schema will need to be generated for the cloud database or data warehouse. After that, the enterprise rewrites their applications, and finally, the enterprise can move their data into the cloud data warehouse. Most of the difficulty, risk, and cost are centered in the first three steps. It’s surprisingly hard to get a complete view of the workloads and queries that run every day, and of course, the process of generating the new schema often takes at least a few months and the costs can run into several hundred thousand dollars. But, rewriting the applications remains the biggest challenge. In fact, one customer said their plan to rewrite the applications would take five years and cost USD $35M.
New technologies—like Datometry Adaptive Data Virtualization—are making the adoption of cloud databases easy by making the application rewriting process obsolete. Virtualizing the database in weeks, not years, with minimal risk to the business, and at a fraction of the traditional replatforming costs and the competitive advantage offered by modern cloud databases is truly a winning combination!