Evaluation of TnT Tagger for Spanish

Carrasco, R.M.; Gelbukh, A.

doi:10.1109/ENC.2003.1232869

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Evaluation of TnT Tagger for Spanish

Full metadata record

DC Field	Value	Language
dc.contributor.author	Carrasco, R.M.	-
dc.contributor.author	Gelbukh, A.	-
dc.date.accessioned	2023-03-09T01:48:36Z	-
dc.date.available	2023-03-09T01:48:36Z	-
dc.date.issued	2003-09	-
dc.identifier.issn	1550-4069	-
dc.identifier.uri	https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/65617	-
dc.description.abstract	Part of speech (POS) tagger is a necessary module in many natural language text processing tasks. A POS tagger is a program that accepts an unprepared raw text in input and to each word adds a tag specifying its grammatical properties, such as part of speech, number, person, etc. One of popular POS taggers - TnT tagger - has been extensively tested for English and some other languages. This paper reports on its evaluation for Spanish language. Error analysis is reported, explaining how some specific features of Spanish language affect tagger performance. It is reported that on Spanish texts TnT shows overall tagging accuracy between 92.5% and 95.84%, specifically, between 95.47% and 98.56% on known words and between 75.57% and 83.49% on unknown words. Results show that TnT has reached a good level of maturity and is helpful enough for NLP tasks. © 2003 IEEE.	-
dc.format.extent	8	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	IEEE Computer Society	-
dc.title	Evaluation of TnT Tagger for Spanish	-
dc.type	Article	-
dc.identifier.doi	10.1109/ENC.2003.1232869	-
dc.identifier.bibliographicCitation	Proceedings of the Mexican International Conference on Computer Science, v.2003-January, pp 18 - 25	-
dc.description.isOpenAccess	N	-
dc.identifier.scopusid	2-s2.0-79952272117	-
dc.citation.endPage	25	-
dc.citation.startPage	18	-
dc.citation.title	Proceedings of the Mexican International Conference on Computer Science	-
dc.citation.volume	2003-January	-
dc.type.docType	Conference Paper	-
dc.subject.keywordAuthor	Character recognition	-
dc.subject.keywordAuthor	Error analysis	-
dc.subject.keywordAuthor	Mood	-
dc.subject.keywordAuthor	Natural languages	-
dc.subject.keywordAuthor	Speech processing	-
dc.subject.keywordAuthor	Speech recognition	-
dc.subject.keywordAuthor	Tagging	-
dc.subject.keywordAuthor	Testing	-
dc.subject.keywordAuthor	Text processing	-
dc.subject.keywordAuthor	Text recognition	-
dc.subject.keywordPlus	Character recognition	-
dc.subject.keywordPlus	Error analysis	-
dc.subject.keywordPlus	Industrial plants	-
dc.subject.keywordPlus	Natural language processing systems	-
dc.subject.keywordPlus	Software testing	-
dc.subject.keywordPlus	Speech processing	-
dc.subject.keywordPlus	Speech recognition	-
dc.subject.keywordPlus	Testing	-
dc.subject.keywordPlus	Text processing	-
dc.subject.keywordPlus	ITS evaluation	-
dc.subject.keywordPlus	Mood	-
dc.subject.keywordPlus	Natural languages	-
dc.subject.keywordPlus	Natural-language text processing	-
dc.subject.keywordPlus	Part Of Speech	-
dc.subject.keywordPlus	Spanish language	-
dc.subject.keywordPlus	Tagging	-
dc.subject.keywordPlus	Text recognition	-
dc.subject.keywordPlus	Computational linguistics	-
dc.description.journalRegisteredClass	scopus	-

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Software > School of Computer Science and Engineering > 1. Journal Articles

Show simple item record

qrcode

Altmetrics

Total Views & Downloads

STATISTICS: Total View :7,480,670; Today View :201

RSS_1.0 RSS_2.0 ATOM_1.0

84, Heukseok-ro, Dongjak-gu, Seoul, Republic of Korea (06974)02-820-6194

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Altmetrics

Total Views & Downloads

BROWSE