Hadoop Based Parallel Framework for Feature
Subset Selection in Big Data

Revathi.L; A.App; iraj

Abstrakt

Hadoop Based Parallel Framework for Feature Subset Selection in Big Data

Revathi.L, A.Appandiraj

It is the era of Big Data. Since scale of data is increasing every minute, handling massive data becomes important in this era. Massive data poses a great challenge for classification. High dimensionality of modern massive dataset has provided a considerable challenge to clustering approaches. The curse of dimensionality can make clustering very slow, and, second, the existence of many irrelevant features may not allow the identification of the relevant underlying structure in the data. Feature selection is the most important part of the clustering process that involves identifying the set of features of a subset, at which they produce accurate and accordant results with the original set of features. Designing traditional machine learning algorithms and data mining algorithms with Map Reduce Programming is necessary in dealing with massive data sets. Map Reduce is a parallel processing framework for large datasets and Hadoop is its open-source implementation. The objective of this paper is to implement FAST clustering algorithm with Map Reduce programming to remove irrelevant and redundant features. Following preprocessing, cluster based map-reduce feature selection approach is implemented for effective outcome of features

Haftungsausschluss: Dieser Abstract wurde mit Hilfe von Künstlicher Intelligenz übersetzt und wurde noch nicht überprüft oder verifiziert

Zeitschriften-Highlights

Angewandte Wissenschaften Biochemie Biomedizintechnik Botanik Chromatographietechniken Flüssigkeitsdynamik Raumfahrttechnik

Indiziert in

Academic Keys

ResearchBible

CiteFactor

Kosmos IF

RefSeek

Hamdard-Universität

Weltkatalog wissenschaftlicher Zeitschriften

Gelehrter

International Innovative Journal Impact Factor (IIJIF)

Internationales Institut für organisierte Forschung (I2OR)

Kosmos

Mehr sehen

Internationale Zeitschriften

Allgemeine Wissenschaften Medizinische Wissenschaften Pharmazeutische Wissenschaften Инженерное дело

Internationale Zeitschrift für innovative Forschung in Wissenschaft, Technik und Technologie

Abstrakt

Hadoop Based Parallel Framework for Feature Subset Selection in Big Data

Zeitschriften-Highlights

Indiziert in

Internationale Zeitschriften

Adresse