Semi supervised clustering for Text Clustering

N.Saranya

Abstrakt

Semi supervised clustering for Text Clustering

N.Saranya

Based on clustering algorithm Affinity Propagation (AP) I present this paper a semisupervised text clustering algorithm, called Seeds Affinity Propagation (SAP). There are two main contributions in my approach: 1) a similarity metric that captures the structural information of texts, and 2) seed construction method to improve the semisupervised clustering process. To study the performance and efficiency of the new algorithm, I applied it to the benchmark data and compared it to two state-of-the-art clustering algorithms, namely, k-means algorithm and the original AP algorithm. Furthermore, I have analyzed the individual impact of the two proposed contributions. Results show that the proposed similarity metric is more effective in text clustering and the proposed semisupervised strategy achieves both better clustering results and faster convergence. The complete SAP algorithm obtains higher F-measure and lower entropy, improves significantly clustering execution time (25 times faster) in respect that k-means, and provides enhanced robustness compared with all other methods.

Haftungsausschluss: Dieser Abstract wurde mit Hilfe von Künstlicher Intelligenz übersetzt und wurde noch nicht überprüft oder verifiziert

Zeitschriften-Highlights

Ad-hoc-Netzwerk Adaptiv Agentenbasierte Middleware Autonomes und kontextbewusstes Computing Bioinformatik und Computerbiologie Breitband und intelligente Netzwerke CDMA/GSM-Kommunikationsprotokoll Data Warehousing Datenbanksicherheit Datenstruktur Drahtlose Sensoren Fortgeschrittene Computerarchitekturen Fortgeschrittene numerische Algorithmen Grid-Computing Muster-/Bilderkennung durch künstliche Intelligenz Quelloffene Software Radartechnologie Robotik Ruhige Technologie Sicherheitssysteme

Indiziert in

Index Copernicus

Academic Keys

CiteFactor

Kosmos IF

RefSeek

Hamdard-Universität

Weltkatalog wissenschaftlicher Zeitschriften

International Innovative Journal Impact Factor (IIJIF)

Internationales Institut für organisierte Forschung (I2OR)

Kosmos

Mehr sehen

Internationale Zeitschriften

Allgemeine Wissenschaften Medizinische Wissenschaften Pharmazeutische Wissenschaften Инженерное дело

Internationale Zeitschrift für innovative Forschung in der Computer- und Kommunikationstechnik

Abstrakt

Semi supervised clustering for Text Clustering

Zeitschriften-Highlights

Indiziert in

Internationale Zeitschriften

Adresse