Evaluation of TnT Tagger for Spanish
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Carrasco, R.M. | - |
dc.contributor.author | Gelbukh, A. | - |
dc.date.accessioned | 2023-03-09T01:48:36Z | - |
dc.date.available | 2023-03-09T01:48:36Z | - |
dc.date.issued | 2003-09 | - |
dc.identifier.issn | 1550-4069 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/65617 | - |
dc.description.abstract | Part of speech (POS) tagger is a necessary module in many natural language text processing tasks. A POS tagger is a program that accepts an unprepared raw text in input and to each word adds a tag specifying its grammatical properties, such as part of speech, number, person, etc. One of popular POS taggers - TnT tagger - has been extensively tested for English and some other languages. This paper reports on its evaluation for Spanish language. Error analysis is reported, explaining how some specific features of Spanish language affect tagger performance. It is reported that on Spanish texts TnT shows overall tagging accuracy between 92.5% and 95.84%, specifically, between 95.47% and 98.56% on known words and between 75.57% and 83.49% on unknown words. Results show that TnT has reached a good level of maturity and is helpful enough for NLP tasks. © 2003 IEEE. | - |
dc.format.extent | 8 | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | IEEE Computer Society | - |
dc.title | Evaluation of TnT Tagger for Spanish | - |
dc.type | Article | - |
dc.identifier.doi | 10.1109/ENC.2003.1232869 | - |
dc.identifier.bibliographicCitation | Proceedings of the Mexican International Conference on Computer Science, v.2003-January, pp 18 - 25 | - |
dc.description.isOpenAccess | N | - |
dc.identifier.scopusid | 2-s2.0-79952272117 | - |
dc.citation.endPage | 25 | - |
dc.citation.startPage | 18 | - |
dc.citation.title | Proceedings of the Mexican International Conference on Computer Science | - |
dc.citation.volume | 2003-January | - |
dc.type.docType | Conference Paper | - |
dc.subject.keywordAuthor | Character recognition | - |
dc.subject.keywordAuthor | Error analysis | - |
dc.subject.keywordAuthor | Mood | - |
dc.subject.keywordAuthor | Natural languages | - |
dc.subject.keywordAuthor | Speech processing | - |
dc.subject.keywordAuthor | Speech recognition | - |
dc.subject.keywordAuthor | Tagging | - |
dc.subject.keywordAuthor | Testing | - |
dc.subject.keywordAuthor | Text processing | - |
dc.subject.keywordAuthor | Text recognition | - |
dc.subject.keywordPlus | Character recognition | - |
dc.subject.keywordPlus | Error analysis | - |
dc.subject.keywordPlus | Industrial plants | - |
dc.subject.keywordPlus | Natural language processing systems | - |
dc.subject.keywordPlus | Software testing | - |
dc.subject.keywordPlus | Speech processing | - |
dc.subject.keywordPlus | Speech recognition | - |
dc.subject.keywordPlus | Testing | - |
dc.subject.keywordPlus | Text processing | - |
dc.subject.keywordPlus | ITS evaluation | - |
dc.subject.keywordPlus | Mood | - |
dc.subject.keywordPlus | Natural languages | - |
dc.subject.keywordPlus | Natural-language text processing | - |
dc.subject.keywordPlus | Part Of Speech | - |
dc.subject.keywordPlus | Spanish language | - |
dc.subject.keywordPlus | Tagging | - |
dc.subject.keywordPlus | Text recognition | - |
dc.subject.keywordPlus | Computational linguistics | - |
dc.description.journalRegisteredClass | scopus | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
84, Heukseok-ro, Dongjak-gu, Seoul, Republic of Korea (06974)02-820-6194
COPYRIGHT 2019 Chung-Ang University All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.