Simple and Efficient Way to Cluster
Documents for Growing Database

Dikhtiarenko Oleks; r; Biloshchytskyi Andrii

Abstrakt

Simple and Efficient Way to Cluster Documents for Growing Database

Dikhtiarenko Oleksandr, Biloshchytskyi Andrii

In this article we described a new method of clustering text documents. A frequency table of words from the documents was used as a characteristic of each document. These tables were created using term frequency which were cleaned from words that do not characterize a specific document and are common to the entire set of documents or for most of it. For the identification of such words, we calculated the percentage of documents in which this word occurs (inverse document frequency). The objectives of this publication were to determine the possibility of using frequency dictionary documents as their semantic characteristics and determine clustering method using frequency tables.

Haftungsausschluss: Dieser Abstract wurde mit Hilfe von Künstlicher Intelligenz übersetzt und wurde noch nicht überprüft oder verifiziert

Zeitschriften-Highlights

Ad-hoc-Netzwerk Adaptiv Agentenbasierte Middleware Autonomes und kontextbewusstes Computing Bioinformatik und Computerbiologie Breitband und intelligente Netzwerke CDMA/GSM-Kommunikationsprotokoll Data Warehousing Datenbanksicherheit Datenstruktur Drahtlose Sensoren Fortgeschrittene Computerarchitekturen Fortgeschrittene numerische Algorithmen Grid-Computing Muster-/Bilderkennung durch künstliche Intelligenz Quelloffene Software Radartechnologie Robotik Ruhige Technologie Sicherheitssysteme

Indiziert in

Index Copernicus

Academic Keys

CiteFactor

Kosmos IF

RefSeek

Hamdard-Universität

Weltkatalog wissenschaftlicher Zeitschriften

International Innovative Journal Impact Factor (IIJIF)

Internationales Institut für organisierte Forschung (I2OR)

Kosmos

Mehr sehen

Internationale Zeitschriften

Allgemeine Wissenschaften Medizinische Wissenschaften Pharmazeutische Wissenschaften Инженерное дело

Internationale Zeitschrift für innovative Forschung in der Computer- und Kommunikationstechnik

Abstrakt

Simple and Efficient Way to Cluster Documents for Growing Database

Zeitschriften-Highlights

Indiziert in

Internationale Zeitschriften

Adresse