Notes:
ACOPOST (A COllection of POS Taggers) is a collection of part-of-speech taggers, which are programs that assign a part of speech (such as noun, verb, or adjective) to each word in a piece of text. These taggers are used to automatically analyze and annotate text with information about the word categories to which the words belong. This can be useful for a variety of natural language processing tasks, such as information retrieval, machine translation, and text summarization.
ACOPOST part-of-speech taggers are modeled after well-known techniques and written in C. The taggers are intended to be portable and code-safe. ACOPOST currently consists of four taggers: the Maximum Entropy Tagger, the Trigram Tagger, the Transformation-based Tagger, and the Error-driven Tagger. These taggers are used to assign symbols from a particular set to words in a natural language text. ACOPOST is a free and open-source resource that is intended to provide a uniform environment for evaluating and comparing different POS tagging techniques.
Resources:
See also:
Tasks of Natural Language Processing
Comparing a TBL Tagger with an HMM Tagger: Time Efficiency, Accuracy, Unknown Words JH Stehouwer – Capita Selecta Paper (HMI Group, University of Twente …, 2006 – Citeseer … size tagset Brill Acopost 321 tags 9.8% ERR 229 tags 4.8% ERR 150 tags 4.7% 3.6% 55 tags 3.6% 3.1% 14 tags 2.5% 2.7% Table 1: The error-rate for the Brill and the Acopost tagger in an experiment with different sized tagsets, using a training-set of 100.000 sentences. … Cited by 2 Related articles All 3 versions
Fine-grain morphological analyzer and part-of-speech tagger for Arabic text M Sawalha, ES Atwell – Proceedings of the Seventh …, 2010 – eprints.whiterose.ac.uk … ACOPOST 1 , A Collection Of POS Taggers, consists of four taggers of different frameworks; Maximum Entropy Tagger (MET), Trigram Tagger (T3), Error-driven Transformation-based Tagger (TBT) and Example-based tagger (ET). … Cited by 26 Related articles All 10 versions
META-MultilanguagE Text Analyzer P Basile, M de Gemmis, A Gentile, L Iaquinta… – Proceedings of the …, 2008 – di.uniba.it … We develop a JAVA version of ACOPOST tagger 3 using Trigram Tagger T3 algorithm. … mor- phological resource 4; 2http://snowball.tartarus.org/ 3http://acopost.sourceforge. net/ 4http://sslmitdev-online.sslmit.unibo.it/linguistics/morph-it.php … Cited by 22 Related articles All 4 versions
Enhancing Semantic Search using N-Levels Document Representation. P Basile, A Caputo, AL Gentile, M Degemmis, P Lops… – SemSearch, 2008 – academia.edu … Lemmatization. For POS-tagging, a JAVA version of ACOPOST tagger2 has been implemented; it adopts Trigram Tagger T3 algorithm based on Hidden Markov Models. … 1 http://snowball. tartarus. org/ 2 http://acopost. sourceforge. net … Cited by 17 Related articles All 9 versions
Combining Learning and Word Sense Disambiguation for Intelligent User Profiling. G Semeraro, M Degemmis, P Lops, P Basile – IJCAI, 2007 – aaai.org … The algorithm is based on three different procedures for nouns, verbs, adverbs and adjectives, called JIGSAWnouns, JIGSAWverbs, JIGSAWothers, respec- tively. The POS tag of each word is computed by the HMM- based tagger ACOPOST t34. … 4http://acopost.sourceforge.net/ … Cited by 62 Related articles All 12 versions
The JUMP project: domain ontologies and linguistic knowledge @work P Basile, M De Gemmis, AL Gentile… – SEMANTIC WEB …, 2007 – researchgate.net … In our project we adopt the Snowball stemmer 4; –POS-tagging: it is the process of assign a part-of-speech to each token. We develop a JAVA version of ACOPOST tagger 5 using Trigram Tagger T3 algorithm. … tartarus. org/ 5 http://acopost. sourceforge. net/ 65 Page 74. … Cited by 7 Related articles All 8 versions
A trigram part-of-speech tagger for the Apertium free/open-source machine translation platform ZMAW Sheikh, F Sánchez Martínez – 2009 – rua.ua.es … HMMs have al- ready been implemented in various projects, the most well-know implementations are the TnT tag- ger (Brants, 2000) and its open-source alternative, HunPos (Halácsy et al., 2007).4 There is also an implementation of a trigram tagger in Acopost (A Collection Of … Cited by 3 Related articles All 8 versions
A Maximum Entropy Approach to Kannada Part Of Speech Tagging BR Shambhavi, R Kumar… – International Journal of …, 2012 – search.proquest.com … ( ). 21 RDP Reduplication 22 UT Quotative. 23 NUM Numbers 24 ECH Echo words. 25 UNK Unknown Hello. 1 http://acopost.sourceforge.net/. Web End =http://acopost.sourceforge.net/. 10. International Journal of Computer Applications (0975 8887) Volume 41 No.13, March 2012 … Cited by 3 Related articles All 2 versions
Improving arabic part-of-speech tagging through morphological analysis M Albared, N Omar, MJ Ab Aziz – Intelligent Information and Database …, 2011 – Springer … from the relative frequencies. Many staistical taggers have been made available to the community for scien- tific purposes during the last years for example, the TnT tagger [10], HunPos[16] and ACOPOST[17]. However, NLP … Cited by 2 Related articles All 4 versions
Content-based recommendation services for personalized digital libraries G Semeraro, P Basile, M de Gemmis, P Lops – Digital Libraries: Research …, 2007 – Springer … The POS tag of each word to be disambiguated is computed by the HMM-based tagger ACOPOST t32. JIGSAW proceeds in several iterations by using the disambiguation results of the previous iteration to reduce the complexity of the next one. … 2 http://acopost.sourceforge.net/ … Cited by 6 Related articles All 4 versions
Part of speech tagging for bengali with hidden markov model S Dandapat, S Sarkar – Proceeding of the NLPAI Machine Learning …, 2006 – ltrc.iiit.ac.in … 1 http://acopost.sourceforge.net/ 2 The training data includes both the data provided in non-privileged and privileged mode. Page 3. 2.5 Tag Set and Corpus Ambiguity The training data has been manually annotated using a tag set consisting of 27 grammatical tags. … Cited by 16 Related articles All 6 versions
Fast-track Text Processing for Real-time Text-to-speech on Mobile Devices S Saychum, N Thatphithakkul, C Wutiwiwatchai… – saki.siit.tu.ac.th … POS is tagged automatically after word segmentation by using the ACOPOST tagger [13] trained by a Thai POS tagged corpus. … Technical report FBI-HH-M-314/02. Department of Computer Science, University of Hamburg [13] http://acopost.sourceforge.net Related articles
[BOOK] Discovering user profiles from semantically indexed scientific papers G Semeraro, P Basile, M de Gemmis, P Lops – 2007 – Springer … The POS tag of each word is computed by the HMM-based tagger ACOPOST t35. More details for each one of the above mentioned procedures follow. 4.1 JIGSAWnouns … The similarity between two 5 http://acopost.sourceforge.net/ Page 7. … Cited by 5 Related articles All 7 versions
Integrating Sense Discrimination in a Semantic Information Retrieval System P Basile, A Caputo, G Semeraro – Information Retrieval and Mining in …, 2011 – Springer … For POS- tagging, we implemented a JAVA version of ACOPOST tagger4 which adopts the Trigram Tagger T3 algorithm [7] based on Hidden Markov Models. For 3 http://snowball.tartarus.org/ 4 http://acopost.sourceforge.net/ Page 9. … Cited by 4 Related articles All 8 versions
A Modified Information Retrieval Approach to Produce Answer Candidates for Question Answering. J Leveling – LWA, 2007 – researchgate.net … About 80% accuracy for LOCATION and less than 60% for ORGANIZATION were achieved in ex- periments with the Acopost taggers6 and MBT7. … First, the variation detection tool DECCA9 was applied to spot inconsistent annotations. This variation 6http://acopost. sourceforge. … Cited by 4 Related articles All 5 versions
Using language resource independent detection for spanish named entity recognition Z Kozareva, O Ferrández, A Montoyo… – Proceedings of the …, 2005 – researchgate.net … GV04B-276 and GV04B-268. 1catalogues of names of people, locations, organizations etc. 2http://acopost.sourceforge.net/ For entity detection, a language resource in- dependent and portable set was used. Initially this set was … Cited by 4 Related articles
Semantic Web Personalization in a Scientific Congress Scenario. G Semeraro, M Degemmis, P Lops, P Basile… – SWAP, 2006 – Citeseer … The POS tag of each word to be disambiguated is computed by the HMM- based tagger ACOPOST t36. … Then, verbs are disambiguated by JIGSAWverbs by exploiting the 5http://wordnet.princeton.edu 6http://acopost.sourceforge.net/ … Cited by 1 Related articles All 5 versions
POS tagset design for Italian R Bernardi, A Bolognesi, C Seidenari… – Proc. 5th International …, 2006 – researchgate.net … the tagset proposed by TUT using three dif- ferent taggers: (a) the CORISTagger, an HMM-based tag- ger which embodies a powerful Italian morphological anal- yser based on a 100.000-lemma lexicon (Tamburini, 2000), (b) the HMM-based tagger ACOPOST t3 (Schröder … Cited by 2 Related articles All 10 versions
NLPBench: A Tool for Studying the Architectural Characteristics of Natural Langauge and Speech Applications A Mukhopadhyay – … : Theory and Applications, 2007. ICCTA’07. …, 2007 – ieeexplore.ieee.org … Application Domain Components Function Brill’s Tagger A Parts Of Speech Tagger based on transformation driven error learning Collin’s Parser A statistical natural language parser Acopost ET An example based tagger for natural languages Machine Translation … Related articles All 6 versions
Feature-rich part-of-speech tagging for morphologically complex languages: Application to bulgarian G Georgiev, V Zhikov, P Osenova, K Simov… – Proceedings of the 13th …, 2012 – dl.acm.org … trained and evalu- ated on the BulTreeBank:2 TreeTagger (Schmid, 1994), which uses decision trees, TnT (Brants, 2000), which uses a hidden Markov model, SVMtool (Giménez and M`arquez, 2004), which is based on support vector machines, and ACOPOST (Schröder, 2002 … Cited by 17 Related articles All 17 versions
Knowledge-Based Identification of the ERK2/STAT3 Signal Pathway as a Therapeutic Target for Type 2 Diabetes and Drug Discovery T Kinoshita, H Sugiyama, S Kinoshita… – Chemical biology & …, 2011 – Wiley Online Library … text mining process. At first, entities of genes/proteins or compounds were recognized by hidden Markov model part-of-speech tagger, ACOPOST h3 program (18), with three types of postprocessing (16). Then, phrases were … Cited by 2 Related articles All 4 versions
Automated Linguistic Analysis and Pedagogical Use of CLC Y Wang – Intelligent Human-Machine Systems and Cybernetics …, 2014 – ieeexplore.ieee.org … The main POS-tagging software includes: Stanford POS tagger, MBT, TreeTagger, SVMTool, ACOPOST, MXPOST, QTAG Part of speech tagger, The TOSCA/LOB tagger, Lingua-EN-Tagger, TATOO, TnT, Birmingham tagger, CLAWS tagger, The AMALGAM tagger, Xerox XRCE … Related articles
A Maximum Entropy Approach to Kannada Part Of Speech Tagging BB Bangalore – Citeseer Page 1. International Journal of Computer Applications (0975 – 8887) Volume 41– No.13, March 2012 9 A Maximum Entropy Approach to Kannada Part Of Speech Tagging Shambhavi.B. R Ramakanth Kumar P Revanth G Department … Related articles All 2 versions
Combining data-driven systems for improving named entity recognition Z Kozareva, O Ferrández, A Montoyo, R Munoz… – … and Information Systems, 2005 – Springer … classifier. For the case of B tags, B:I signifies that the NE is supposed to be B, but the classifier assigned an I tag 3 http://acopost.sourceforge.net/ Page 6. Combining Data-Driven Systems for Improving Named Entity Recognition 85 … Cited by 11 Related articles All 12 versions
Combining automatic acquisition of knowledge with machine learning approaches for multilingual temporal recognition and normalization E Saquete, O Ferrández, S Ferrández… – Information …, 2008 – Elsevier This paper presents an improvement in the temporal expression (TE) recognition phase of a knowledge based system at a multilingual level. For this purpose, the. Cited by 11 Related articles All 3 versions
Automatic part-of-speech tagging for Bengali: An approach for morphologically rich languages in a poor resource scenario S Dandapat, S Sarkar, A Basu – Proceedings of the 45th Annual Meeting …, 2007 – dl.acm.org … The accuracy achieved using TnT are 87.44% and 87.36% respectively with bigram and trigram model for 40K training data. The accuracy with ACOPOST is 86.3%. This reflects that the higher order Markov mod- els do not work well under the current experi- mental setup. … Cited by 32 Related articles All 17 versions
Patent Application Text Pre-processing for Patent Examination Procedure AG Kravets, AG Mironenko, SS Nazarov… – Creativity in Intelligent, …, 2015 – Springer … commercial products. English Language. We considered a number of morphological analyzers for English language: Stanford POS tagger, hunpos, SVMTool, ACOPOST, MXPOST, YamC- ha,TreeTagger etc. Stanford POS tagger …
Combining data-driven systems for improving Named Entity Recognition Z Kozareva, Ó Ferrández, A Montoyo, R Muñoz… – Data & Knowledge …, 2007 – Elsevier … equation(2). arg max T p ( T ) p ( W | T ). The toolkit we used is called ACOPOST 7 implemented for the task of POS tagging, but adapted to NER [30]. 2.2. Feature description. To perform the Named Entity Recognition task, different sets of features can be used. … Cited by 30 Related articles All 4 versions
Part-of-Speech Tagging for Bengali S Dandapat – 2009 – computing.dcu.ie Page 1. Part-of-Speech Tagging for Bengali Thesis submitted to Indian Institute of Technology, Kharagpur for the award of the degree of Master of Science by Sandipan Dandapat Under the guidance of Prof. Sudeshna Sarkar and Prof. Anupam Basu … Cited by 3 Related articles All 5 versions
A standard tag set expounding traditional morphological features for Arabic language part-of-speech tagging M Sawalha, E Atwell – Word Structure, 2013 – researchgate.net … ACOPOST,2 A Collection Of POS Taggers, consists of four taggers of different frameworks: Maximum Entropy Tagger (MET), Trigram Tagger (T3), Error-driven Transformation-Based Tagger (TBT) and Example-based tagger (ET). … Cited by 2 Related articles All 3 versions
Dependency Parsing with Lattice Structures for Resource-Poor Languages S Sudprasert, A Kawtrakul, C Boitet… – … on Information and …, 2009 – search.ieice.org Page 1. 2122 IEICE TRANS. INF. & SYST., VOL.E92–D, NO.10 OCTOBER 2009 PAPER Dependency Parsing with Lattice Structures for Resource-Poor Languages Sutee SUDPRASERT †a) , Asanee KAWTRAKUL † , Members … Cited by 2 Related articles All 6 versions
Personalization in Digital Library: An Intelligent Service based on Semantic User Profiles G Semeraro, P Lops, M Degemmis, P Basile, A Gentile – 2007 – delos2007.cab.unipd.it … The description of the synset is “give a certain impression or have a certain outward aspect She seems to be sleeping This appears to be a very difficult problem”. First, the JIGSAWverbs includes 6http://acopost.sourceforge.net/ Page 4. … Related articles All 3 versions
Mink: An Incremental Data-Driven Dependency Parser with Integrated Conversion to Semantics. R Cantrell – RANLP, 2009 – anthology.aclweb.org … practical task. We chose to use the parser to distinguish between different types of utterances, namely questions, commands, and state- ments. We chose not to use a more complex discourse 2 http://acopost. sourceforge. net … Cited by 5 Related articles All 8 versions
[BOOK] Open-source resources and standards for Arabic word structure analysis: Fine grained morphological analysis of Arabic text corpora MSS Sawalha – 2011 – etheses.whiterose.ac.uk Page 1. Open-source Resources and Standards for Arabic Word Structure Analysis: Fine Grained Morphological Analysis of Arabic Text Corpora By Majdi Shaker Salem Sawalha Submitted in accordance with the requirements for the degree of Doctor of Philosophy … Cited by 6 Related articles All 5 versions
A Robust Transformation-Based Learning Approach Using Ripple Down Rules for Part-Of-Speech Tagging DQ Nguyen, DQ Nguyen, DD Pham… – arXiv preprint arXiv: …, 2014 – arxiv.org Page 1. A Robust Transformation-Based Learning Approach Using Ripple Down Rules for Part-Of-Speech Tagging Dat Quoc Nguyena,d,?, Dai Quoc Nguyenb,d, Dang Duc Phamc, Son Bao Phamd aDepartment of Computing … Related articles All 2 versions
Literature review on patient-friendly documentation systems H Åhlfeldt, L Borin, N Grabar, C Hallett, D Hardcastle… – 2006 – pub.uni-bielefeld.de Page 1. Semantic Mining – Work Package 27 – Deliverable 1 Literature Review on Patient-Friendly Documentation Systems HansÅhlfeldt4, Lars Borin2, Philipp Daumke1, Natalia Grabar3, Catalina Hallett5, David Hardcastle5, Dimitrios Kokkinakis2, … Cited by 6 Related articles All 14 versions
Literature Review on Patient-Friendly Documentation Systems A Hans, L Borin, P Daumke, N Grabar, C Hallett… – 2006 – Citeseer Page 1. ISSN 1744-1986 Technical Report N o 2006/04 Literature Review on Patient-Friendly Documentation Systems Hans Ahlfeldt Lars Borin Philipp Daumke Natalia Grabar Catalina Hallett David Hardcastle Dimitrios Kokkinakis … Related articles All 3 versions
Active learning-an explicit treatment of unreliable parameters M Becker – 2008 – era.lib.ed.ac.uk Page 1. Active Learning An Explicit Treatment of Unreliable Parameters Markus Becker Doctor of Philosophy Institute for Communicating and Collaborative Systems School of Informatics University of Edinburgh 2008 Page 2. Page 3. Abstract … Cited by 11 Related articles All 8 versions