It is the process of extraction f data that you have already stored in a medium. For example, a hospital has to store huge data on the patents health; this may be in need after so many years. In those cases, the data has to be retrieved for the further process. You can use any mediums like storage devices or cloud. When you store in the cloud you will have to go for cloud based data extraction.
How are they done?
In
case of a structured of data, they can be processed and extracted within the
source system. You can retrieve them by following one of the methods
Complete extraction:
this extracts you the complete data you don’t need to track the changes in them
but the load of the system is much higher.
Incremental extraction:
it is required to track in the source data because you need not go through the
complete extraction.
In
the case of unstructured data, you have to prepare the data that you want o
extract. For the process will have to change remove the noise present in your
data by cleaning whitespace, duplicate results and found to handle the values
that are not found.
When
you have stored in the cloud you can avail cloud drawing scanning services.
What are the challenges faced when extracting a data?
When
you have to extract the data from both the structured and unstructured data you
may face some challenges. Initially, you have to decide the data that you have
to extract, and then you will have to perform ETL to combine your data from the
different sources. You will face the challenge in what to combine in what. Will
there be any trouble on mixing this content with the other?
The
next one will be the security on the data. You may have to encrypt some data,
some data will be sensitive. So you may have confusions in selecting the right
data and to avoid problems.
No comments:
Post a Comment