ISSN 2394-5125
 

Research Article 


DATA PROFILING ON BIG DATA USING KEY DEPENDENCIES TO IMPROVE QUALITY OF META DATA

K. Makesh Babu, Dr. K. Mohan Kumar.

Abstract
Analyzing the big data for an organization is a big challenge, because it may contain
unstructured data. Data profiling is an important discipline which analyzes the given dataset and find the
metadata. This metadata is used to store, access and modify the unstructured data. In this case, the multi-purpose
fields present in the data source is a big issue. The Functional dependency of these data fields is the most
important for metadata structures. Most datasets do not provide their metadata explicitly, so that dataset must be
profiled. Discovering the metadata manually is more complex. Various profiling algorithms are used to
automate the discovering process of Meta data. This paper proposes a profiling algorithm that automatically
discovers the metadata using different kinds of key dependencies. The proposed Functional Dependency
Discovery Conversion Algorithm (FDDCA), work faster than the previous algorithms. Also, this algorithm will
do its works with optimum memory utilization and speed.

Key words: Big data, Data quality, Meta data, Data Profiling, Key dependency


 
ARTICLE TOOLS
Abstract
PDF Fulltext
How to cite this articleHow to cite this article
Citation Tools
Related Records
 Articles by K. Makesh Babu
Articles by Dr. K. Mohan Kumar
on Google
on Google Scholar


How to Cite this Article
Pubmed Style

K. Makesh Babu, Dr. K. Mohan Kumar. DATA PROFILING ON BIG DATA USING KEY DEPENDENCIES TO IMPROVE QUALITY OF META DATA. JCR. 2020; 7(17): 1423-1430. doi:10.31838/jcr.07.17.182


Web Style

K. Makesh Babu, Dr. K. Mohan Kumar. DATA PROFILING ON BIG DATA USING KEY DEPENDENCIES TO IMPROVE QUALITY OF META DATA. http://www.jcreview.com/?mno=29477 [Access: August 17, 2021]. doi:10.31838/jcr.07.17.182


AMA (American Medical Association) Style

K. Makesh Babu, Dr. K. Mohan Kumar. DATA PROFILING ON BIG DATA USING KEY DEPENDENCIES TO IMPROVE QUALITY OF META DATA. JCR. 2020; 7(17): 1423-1430. doi:10.31838/jcr.07.17.182



Vancouver/ICMJE Style

K. Makesh Babu, Dr. K. Mohan Kumar. DATA PROFILING ON BIG DATA USING KEY DEPENDENCIES TO IMPROVE QUALITY OF META DATA. JCR. (2020), [cited August 17, 2021]; 7(17): 1423-1430. doi:10.31838/jcr.07.17.182



Harvard Style

K. Makesh Babu, Dr. K. Mohan Kumar (2020) DATA PROFILING ON BIG DATA USING KEY DEPENDENCIES TO IMPROVE QUALITY OF META DATA. JCR, 7 (17), 1423-1430. doi:10.31838/jcr.07.17.182



Turabian Style

K. Makesh Babu, Dr. K. Mohan Kumar. 2020. DATA PROFILING ON BIG DATA USING KEY DEPENDENCIES TO IMPROVE QUALITY OF META DATA. Journal of Critical Reviews, 7 (17), 1423-1430. doi:10.31838/jcr.07.17.182



Chicago Style

K. Makesh Babu, Dr. K. Mohan Kumar. "DATA PROFILING ON BIG DATA USING KEY DEPENDENCIES TO IMPROVE QUALITY OF META DATA." Journal of Critical Reviews 7 (2020), 1423-1430. doi:10.31838/jcr.07.17.182



MLA (The Modern Language Association) Style

K. Makesh Babu, Dr. K. Mohan Kumar. "DATA PROFILING ON BIG DATA USING KEY DEPENDENCIES TO IMPROVE QUALITY OF META DATA." Journal of Critical Reviews 7.17 (2020), 1423-1430. Print. doi:10.31838/jcr.07.17.182



APA (American Psychological Association) Style

K. Makesh Babu, Dr. K. Mohan Kumar (2020) DATA PROFILING ON BIG DATA USING KEY DEPENDENCIES TO IMPROVE QUALITY OF META DATA. Journal of Critical Reviews, 7 (17), 1423-1430. doi:10.31838/jcr.07.17.182