Automatic Language Identification from
Written Texts ? An Overview

H L Shashirekha

Abstrakt

Automatic Language Identification from Written Texts ? An Overview

H L Shashirekha

Language Identification is the task of automatically identifying the language(s) in which the content is written in a document (web page, text document). Due to the widespread use of internet, identification of languages has become an important preprocessing step for a number of applications such as machine translation, Part-of-Speech tagging, linguistic corpus creation, supporting low-density languages, accessibility of social media or user-generated content, search engines and information extraction in addition to processing multilingual documents. In a multilingual country like India, Language Identification has wider scope to bridge the digital divide between different language users. This paper presents a brief overview of the challenges involved in automatic language identification, existing methodologies and some of the tools available for language identification.

Haftungsausschluss: Dieser Abstract wurde mit Hilfe von Künstlicher Intelligenz übersetzt und wurde noch nicht überprüft oder verifiziert

Zeitschriften-Highlights

Ad-hoc-Netzwerk Adaptiv Agentenbasierte Middleware Autonomes und kontextbewusstes Computing Bioinformatik und Computerbiologie Breitband und intelligente Netzwerke CDMA/GSM-Kommunikationsprotokoll Data Warehousing Datenbanksicherheit Datenstruktur Drahtlose Sensoren Fortgeschrittene Computerarchitekturen Fortgeschrittene numerische Algorithmen Grid-Computing Muster-/Bilderkennung durch künstliche Intelligenz Quelloffene Software Radartechnologie Robotik Ruhige Technologie Sicherheitssysteme

Indiziert in

Index Copernicus

Academic Keys

CiteFactor

Kosmos IF

RefSeek

Hamdard-Universität

Weltkatalog wissenschaftlicher Zeitschriften

International Innovative Journal Impact Factor (IIJIF)

Internationales Institut für organisierte Forschung (I2OR)

Kosmos

Mehr sehen

Internationale Zeitschriften

Allgemeine Wissenschaften Medizinische Wissenschaften Pharmazeutische Wissenschaften Инженерное дело

Internationale Zeitschrift für innovative Forschung in der Computer- und Kommunikationstechnik

Abstrakt

Automatic Language Identification from Written Texts ? An Overview

Zeitschriften-Highlights

Indiziert in

Internationale Zeitschriften

Adresse