Data Migration - Strategy and Best Practices - Talend
Data Migration - Strategy and Best Practices - Talend
Best Practices
Knowledge center » Data integration » What is Data Integration? » Understanding Data Migration:
Strategy a…
Big data is what drives most modern businesses, and big data never sleeps. That
means data integration and data migration need to be well-established, seamless
processes — whether data is migrating from inputs to a data lake, from one
repository to another, from a data warehouse to a data mart, or in or through the
cloud. Without a competent data migration plan, businesses can run over budget,
end up with overwhelming data processes, or find that their data operations are
functioning below expectations.
In the context of the extract/transform/load (ETL) process, any data migration will
involve at least the transform and load steps. This means that extracted data needs to
go through a series of functions in preparation, after which it can be loaded in to a
target location.
Organizations undertake data migrations for a number of reasons. They might need
to overhaul an entire system, upgrade databases, establish a new data warehouse, or
merge new data from an acquisition or other source. Data migration is also necessary
when deploying another system that sits alongside existing applications.
Less successful migrations can result in inaccurate data that contains redundancies
and unknowns. This can happen even when source data is fully usable and adequate.
Further, any issues that did exist in the source data can be amplified when it’s
brought into a new, more sophisticated system.
A strategic data migration plan should include consideration of these critical factors:
Cleanup — Once you identify any issues with your source data, they must be
resolved. This may require additional software tools and third-party resources
because of the scale of the work.
The draw of this method is, of course, that it all happens in one time-boxed event,
requiring relatively little time to complete. The pressure, though, can be intense, as
the business operates with one of its resources offline. This risks a compromised
implementation.
If the big bang approach makes the most sense for your business, consider running
through the migration process before the actual event.
“Trickle” Migration
Trickle migrations, in contrast, complete the migration process in phases. During
implementation, the old system and the new are run in parallel, which eliminates
downtime or operational interruptions. Processes running in real-time can keep data
continuously migrating.
Compared to the big bang approach, these implementations can be fairly complex in
design. However, the added complexity — if done right — usually reduces risks,
rather than adding them.
Talend Connect '22
In uncertain times, be certain about your data
Learn more
Back up the data before executing. In case something goes wrong during the
implementation, you can’t afford to lose data. Make sure there are backup
resources and that they’ve been tested before you proceed.
Stick to the strategy. Too many data managers make a plan and then abandon
it when the process goes “too” smoothly or when things get out of hand. The
migration process can be complicated and even frustrating at times, so prepare
for that reality and then stick to the plan.
Test, test, test. During the planning and design phases, and throughout
implementation and maintenance, test the data migration to make sure you will
eventually achieve the desired outcome.
Beyond meeting the requirements for data fields to be transferred, run an audit on
the actual data contained within. If there are poorly populated fields, a lot of
incomplete data pieces, inaccuracies, or other problems, you may reconsider
whether you really need to go through the process of migrating that data in the first
place.
If an organization skips this source review step, and assumes an understanding of the
data, the result could be wasted time and money on migration. Worse, the
organization could run into a critical flaw in the data mapping that halts any progress
in its tracks.
Considering the design, the data to be pulled over, and the target system, you can
begin to define timelines and any project concerns. By the end of this step, the whole
project should be documented.
During planning, it’s important to consider security plans for the data. Any data that
needs to be protected should have protection threaded throughout the plan.
6. Audit
Once the implementation has gone live, set up a system to audit the data in order to
ensure the accuracy of the migration.
Connectivity — Does the solution support the systems and software you
currently use?
Scalability — What are the data limits for the software, and will data needs
exceed them in the foreseeable future?
Security — Take time investigating a software platform’s security measures.
You’re data is one of your most valuable resources, and it must remain
protected.
Speed — How quickly can processing occur on the platform?
In the past, data architects were tasked with deploying sizeable server farms on-
premises to keep data within the organization’s physical resources. Part of the reason
for pushing ahead with on-site servers had been a concern for security on the cloud.
However, as major platforms adopt security practices putting them on par with
traditional IT security (and necessarily in compliance with the GDPR), this barrier to
migration has largely been overcome.
The right cloud integration tools help customers accelerate cloud data migration
projects with a highly scalable and secure cloud integration platform-as-a-service
(iPaaS). Talend’s suite of open source, cloud-native data integration tools enable
drag-and-drop functionality to simplify complex mapping, and our open-source
foundations make our solution cost-effective and efficient.
Talend’s platform includes free, open source data tools that can streamline every
step in the data migration process, from Data Preparation, to Integration, to
continued Data Streaming. Kickstart your data migration process by exploring the
software that can help you get it done. Try Talend Data Fabric today.
Contact sales