Maintaining meta data is as critical as keeping your DataLake fresh, and the data in it current and avoid stale data, irrelevant data over time. This requires insight, curation and annotation capabilities.
We can start put manually, but then venture into a more semi-automated approach and then ultimately move , as the ranking and probability confidence goes up, engage in the use of more automated processes.