Abstrakt

EXTRACTING PLAIN TEXT FROM CORRUPTED WORD DOCUMENT

Saptarshi Naskar, Souvik Sarkar and Krishnendu Basuli

Text conversion is a process written in some programming language whose main task is to extract the plain text from the supplied source file in some format, and put the text in the file of different format. In this conversion procedure the extension of the file will be changed but the data within it remains unchanged in format and in size of the actual data. This conversion procedure is done through some programming language (may be written in C or C++ or some other programming language), whose main task is to read the source file line by line (in many systems it reads per character) and whenever the appropriate text (i.e. the valid data or character) is found it then copies the entire text into a different file format, remaining unchanged the format of the text that is in the source file. Here, we design the text converter in C programming language which accepts the file format with an extension of .doc, .rtf and also .txt, and extracts the plain text from these files and put the texts in a text file with remaining unchanged of the text format of the source file.

Haftungsausschluss: Dieser Abstract wurde mit Hilfe von Künstlicher Intelligenz übersetzt und wurde noch nicht überprüft oder verifiziert

Indiziert in

Google Scholar
Academic Journals Database
Open J Gate
Academic Keys
ResearchBible
CiteFactor
Elektronische Zeitschriftenbibliothek
RefSeek
Hamdard-Universität
Gelehrter
International Innovative Journal Impact Factor (IIJIF)
Internationales Institut für organisierte Forschung (I2OR)
Kosmos

Mehr sehen