Apache OpenNLP & Coreference Resolution 2016


Resources:

  • deepnl .. a deep learning nlp pipeline
  • ixa-pipes .. ready to use NLP tools (coreference resolution tool available soon)
  • lappsgrid .. provides facilities to select from hundreds of NLP tools to create workflows
  • meaningcloud .. affordable way to extract the meaning of unstructured content
  • opener .. set of ready to use tools to perform some natural language processing tasks
  • opennlp .. machine learning based toolkit for the processing of natural language
  • pkde4j .. entity and relation extraction for public knowledge discovery
  • textserver .. platform of language analysis services that can be used to process texts

Wikipedia:

See also:

100 Best Apache OpenNLP VideosApache OpenNLP & Dialog Systems | Apache OpenNLP 2016 | OpenCCG (OpenNLP CCG Library) | Stanford CoreNLP & Coreference Resolution 2016


Corp: Coreference resolution for portuguese
E Fonseca, R Vieira, A Vanin – Proceedings of the International …, 2016 – ontolp.inf.pucrs.br
Page 1. CORP: Coreference Resolution for Portuguese Evandro Fonseca1, Renata Vieira1 and Aline Vanin2 … OpenNLP provides POS tagging and named entities recognition, while Cogroo provides noun phrase chunks and shal- low structure. …

Summ-it++: an enriched version of the Summ-it corpus
A Antonitsch, A Figueira, D Amaral, E Fonseca… – of the Language …, 2016 – inf.pucrs.br
… Rela- tion extraction can be useful in many NLP tasks, in par- ticular, for Coreference Resolution, the focus of which is to determine antecedent chains. … As a pre-processing phase, the POS tagging was pro- vided through the use of the OpenNLP parser. …

TOP10 TOOLS FOR NATURAL LANGUAGE PROCESSING (NLP)-RESEARCH AND DEVELOPMENT
A Nayyar, V Puri – The CSI Vision:” lT for Masses – csi-india.org
… Apache OpenNLP supports various NLP tasks like Tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing and coreference resolution. The Latest version available for download from www. opennlp. apache. org is 1.6. …

How to train dependency parsers with inexact search for joint sentence boundary detection and parsing of entire documents
A Björkelund, A Fale?ska, W Seeker… – Proceedings of the 54th …, 2016 – aclweb.org
… They ap- ply early update in a coreference resolution system and observe that the task is inherently so difficult that the correct item practically … detec- tion on the token level using F-measure (F1).6 Typical sentence boundary detectors such as CORENLP or OPENNLP focus on …

What Happens Next? Event Prediction Using a Compositional Neural Network Model.
M Granroth-Wilding, S Clark – AAAI, 2016 – aaai.org
… The observed event is marked in bold. tagging and dependency parsing and OpenNLP1 for phrase- structure parsing and coreference resolution. … 1https://opennlp.apache.org/ 2http://mark.granroth-wilding.co.uk/\\papers/what\ happens\ next/ 2729 Page 4. …

Open NLP based Refinement of Software Requirements
M Mohanan, P Samuel – … Journal of Computer Information Systems and …, 2016 – mirlabs.net
… part-of-speech (POS) tagger, the named entity recognizer (NER), the parser, the coreference resolution system, sentiment … Open NLP based Refinement of Software Requirements … The OpenNLP is a research area that aims to obtain how computer understands and process the …

LiMoSINe pipeline: Multilingual UIMA-based NLP platform
O Uryupina, B Plank, G Barlacchi, FV Albacete… – ACL 2016, 2016 – aclweb.org
… In addi- tion, many research groups publicly release their pre- 1http://opennlp. … entity tagging syntactic/sem- antic parsing pos-tagged sentences entity mention detection named entities relation extraction parsed sentences opinion mining coreference resolution mentions entity …

Preprocessing Technology
O Uryupina, R Zanoli – Anaphora Resolution, 2016 – Springer
… It is therefore essential for any coreference resolution system to rely on a rich linguistic representation of a document to be analyzed. … State-of-the-art coreference resolution systems incorporate, therefore, a number of external Natural Language Processing (NLP) modules. …

New York University 2016 system for KBP event nugget: A deep learning approach
TH Nguyen, A Meyers, R Grishman – Proceedings of Ninth Text Analysis …, 2016 – tac.nist.gov
… EVENT COREFERENCE RESOLUTION (corefEN) … In order to prepare the input documents for neu- ral networks, our preprocessing steps include sen- tence detection and tokenization using the OpenNLP toolkit1, and dependency parsing for the detected sentences using the …

Indonesian essay grading module using Natural Language Processing
T Ajitiono, Y Widyani – Data and Software Engineering …, 2016 – ieeexplore.ieee.org
… 2. Apache OpenNLP [8] has features regarding text processing such as sentence detector, tokenizer, name finder, docwnent classifier, part-of-speech tagger, chunker, parser, and coreference resolution. However, it is not maintained as last update was on 2014. …

Cross Lingual Mention and Entity Embeddings for Cross-Lingual Entity Disambiguation
H Shahbazi, C Ma, X Fern, P Tadepalli – tac.nist.gov
… languages. Our annotator uses pre-trained models from Stanford CoreNLP [Manning et al., 2014] and OpenNLP imported in the Reconcile system [Stoyanov et al., 2010]. We … 5.2. 5.1 Within Document Coreference Resolution We …

QTLeap WSD/NED Corpora: Semantic Annotation of Parallel Corpora in Six Languages.
A Otegi, N Aranberri, A Branco, J Hajic, M Popel… – LREC, 2016 – di.fc.ul.pt
… model has been trained with the averaged Perceptron algorithm as described in Collins (2002) and as implemented in Apache OpenNLP. … 3.4. Coreference Basque ixa-pipe-coref-eu is an adaptation of the Stanford Deterministic Coreference Resolution (Lee et al., 2013), which …

Phrase Detectives Corpus 1.0 Crowdsourced Anaphoric Coreference.
J Chamberlain, M Poesio, U Kruschwitz – LREC, 2016 – lrec-conf.org
… the case of fre- quent errors: 1. A pre-processing step normalised the input, applied a sentence splitter and ran a tokeniser over each sen- tence (developed from the openNLP toolkit11); 2. A custom-developed processing step …

Visual Analytics for Narrative Text
M John, S Lohmann, S Koch, M Wörner, T Ertl – visualdataweb.org
… 3http://nlp.stanford.edu/software/corenlp.shtml 4http://opennlp.apache.org/ 5https://gate.ac.uk/ ie/annie.html Page 4. … Coreference resolution is the task of resolving noun phrases to the entities that they re- fer to and there already exist robust methods (Raghu- nathan et al., 2010 …

Detecting Non-reference and Non-anaphoricity
O Uryupina, M Kabadjov, M Poesio – Anaphora Resolution, 2016 – Springer
… Keywords. Detecting non-reference and non-anaphoricity coreference resolution Non-reference Non-anaphoricity. 1 Introduction. … One of the most commonly used coreference resolution evaluation metric, the muc scorer [69], is particularly sensitive to the former type of errors. …

A quantitative and qualitative evaluation of sentence boundary detection for the clinical domain
D Griffis, C Shivade, E Fosler-Lussier… – AMIA Summits on …, 2016 – ncbi.nlm.nih.gov
… 11 is an NLP system designed for information extraction from clinical text, building on the general-domain OpenNLP toolkit. cTAKES performs a rich variety of NLP tasks, including parsing, extraction and annotation of UMLS concepts, and coreference resolution, among others. …

Review on Opinion Mining for Fully Fledged System
A Dhokrat, S Khillare… – Indonesian Journal of …, 2016 – section.iaesonline.com
… Semantic reasoning, Provides lexical resources such as WordNet Apache Open NLP [10] Tokenization … Part-of-speech tagging, Named entity extraction, Chunking, Parsing, Coreference resolution LingPipe [11 … ac.nz/ml/weka/ [9] http://www.nltk.org/ [10] https://opennlp.apache.org …

Natural Language Processing using Hadoop and KOSHIK
E Erturk, H Shi – arXiv preprint arXiv:1608.04434, 2016 – arxiv.org
… many functions for NLP, the model to process the document should be considered that Verspoor (2012) argued that OpenNLP has low … to analyse text that it provides most of the common core NLP steps, from tokenization through to coreference resolution (Manning, 2014). …

Improving Question-Answering for Portuguese Using Triples Extracted from Corpora
R Rodrigues, P Gomes – … Conference on Computational Processing of the …, 2016 – Springer
… Except for lemmatization and dependency parsing, these tasks are done using the Apache OpenNLP toolkit 1 , with some minor tweaks for better … Another aspect that should be considered is the use of coreference resolution in order to increase the number of extracted triples by …

Support for traceability management of software artefacts using Natural Language Processing
A Arunthavanathan, S Shanmugathasan… – Moratuwa …, 2016 – ieeexplore.ieee.org
… It uses multiple tools and techniques such as WordNet, OpenNLP parser, concept extraction engine, class extraction engine and a … Part-of-speech (POS) tagger, named entity recognizer (NER), parser, coreference resolution (anaphora analysis), parser and bootstrapped pattern …

Collective disambiguation and semantic annotation for entity linking and typing
M Chabchoub, M Gagnon, A Zouaq – Semantic Web Evaluation Challenge, 2016 – Springer
… OpenNLP 2 uses machine learning and maximum entropy models. LingPipe 3 uses n-gram character language models. … Every mention that contains a verb is removed from the list. Finally, we use Stanford Coreference resolution to find, for each pronoun, the coreferent mention. …

Grammatical case based IS-A relation extraction with boosting for polish
P ?ozi?ski, D Czerski… – Computer Science and …, 2016 – ieeexplore.ieee.org
… Using this knowledge in OpenNLP tag- ger reduces search space for this word 500 times. … The problem of detecting implicit references to earlier parts of text is known in natural language processing as coreference resolution and …

Opinion Mining on E-Commerce: Need of the Hour
JS Aravindan – ijtrd.com
… Some of the tasks of NLP in sentiment analysis are Stemming, POS- Part Of Speech, Coreference Resolution, Negation Handling, Word … Some of the NLP tools include LingPipe, OpenNLP, Stanford Parser, POS Tagger, OpenFST, NTLK, Opinion Finder, Tawlk/osae, GATE, textir …

Implementing Spoken Language Understanding
M McTear, Z Callejas, D Griol – The Conversational Interface, 2016 – Springer
… OpenNLP tools 23 support various natural language processing tasks including tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution. There is a tutorial on how to use Apache OpenNLP through a …

State of the art in knowledge extraction from online polls: a survey of current technologies
M Stabauer, G Grossmann, M Stumptner – Proceedings of the …, 2016 – dl.acm.org
… brings to- gether some of the aforementioned fields as well as numer- ous other aspects like coreference resolution or relationship … The available engines include a large number of OpenNLP-based services, OpenCalais inte- gration, DBpedia Spotlight annotation and many …

Automatic analysis of textual hotel reviews
A García-Pablos, M Cuadros, MT Linaza – Information Technology & …, 2016 – Springer
… Some of the provided tools are based on already available third-party tools, like Apache OpenNLP library 2 or DBpedia Spotlight 3 that have been adapted and conveniently wrapped to achieve the scalability and modularity desired for the modules of the platform. …

Visual Analysis of Character and Plot Information Extracted from Narrative Text
M John, S Lohmann, S Koch, M Wörner… – … Joint Conference on …, 2016 – Springer
… The ViTA implementation offers three different analysis tools that users can choose from: Stanford CoreNLP 3 , OpenNLP 4 and ANNIE 5 … Also, methods for coreference resolution might be integrated to detect alternative occurrences of the entities in the text and compute more …

Text Segmentation using Named Entity Recognition and Co-reference Resolution in English and Greek Texts
P Fragkou – arXiv preprint arXiv:1610.09226, 2016 – arxiv.org
… Apache OpenNLP [NP ==_NN] =_SYM =_SYM =_SYM =_SYM =_SYM =_SYM =_SYM =_SYM [NP _NNP Vincent_NNP
Knowledge graph construction for research literatures
A Oldoni – 2016 – raw.githubusercontent.com
… and that are relevant for Information Extraction are Coreference resolution and pronominal anaphora resolution. … The tool used to extract Figure 2.6 is the Stanford Coreference Resolution annotator [12], which is part of the latter group of tools that use cluster level features. …

Multilingual Automated Text Anonymization
FMC Dias – 2016 – inesc-id.pt
… v Page 6. Page 7. Keywords Text Anonymization Privacy Named Entity Recognition Coreference Resolution Sanitization Palavras-Chave Anonimização de Texto Privacidade Reconhecimento de Entidades Mencionadas … 22 3.1.3 Metrics for Coreference Resolution . . . . . …

DISCOURSE ANALYSIS OF LYRIC AND LYRIC-BASED CLASSIFICATION OF MUSIC
F JIAKUN – 2016 – scholarbank.nus.edu.sg
Page 1. DISCOURSE ANALYSIS OF LYRIC AND LYRIC-BASED CLASSIFICATION OF MUSIC JIAKUN FANG B.Eng., Northeastern University of China A THESIS SUBMITTED FOR THE DEGREE OF MASTER OF SCIENCE DEPARTMENT OF COMPUTER SCIENCE …

Data Quality Centric Application Framework for Big Data
VN Gudivada, D Rao, WI Grosky – ALLDATA 2016, 2016 – researchgate.net
… boundary and sentence detection in spoken text, parts-of-speech tagging, parsing, named entity recognition, and coreference resolution are fundamental … Open source libraries to consider for this task include Apache OpenNLP, Stanford NLP, NLTK, Apache Lucene, Apache Solr …

Extracting Information from Social Media with GATE
K BONTCHEVA, L DERCZYNSKI – Working with Text: Tools …, 2016 – books.google.com
… 5 http://alias-i. com/lingpipe/ 6 http://opennlp. … The main tasks carried out during information extraction are • named entity recognition, which consists on the identification and clas- sification of different types of names in text; • coreference resolution, which is the task of deciding if …

Harnessing open information extraction for entity classification in a french corpus
F Gotti, P Langlais – Canadian Conference on Artificial Intelligence, 2016 – Springer
… While anaphoric facts may be partially sanitized by coreference resolution, measuring the informativeness of a fact is still an unresolved issue. … The preprocessing steps relying on Apache OpenNLP were adapted to use French statistical models for sentence segmentation, word …

The role of coreference resolution in extractive summarization
S Sonawane, P Kulkarni – Computing, Analytics and Security …, 2016 – ieeexplore.ieee.org
… IV. PROPOSED SYSTEM The proposed system architecture is shown in figure 1. Text document preprocessing is done using OPEN NLP model. Following preprocessing steps are applied. … The coreference resolution system consist of following three steps. …

Event Detection, version 3 Deliverable D4. 2.3
ALM Vossen – kyoto.let.vu.nl
… We have improved the modules that perform tokenization, POS-tagging, parsing, time recognition and normalization, named entity recognition, word sense dis- ambiguation, named entity disambiguation, coreference resolution, semantic role labeling, temporal and causal …

Benchmarking mi-pos: Malay part-of-speech tagger
BCM Xian, M Lubani, LK Ping… – International …, 2016 – umexpert.um.edu.my
… OpenNLP [14] is an open source NLP code library with pre-trained models to perform different NLP tasks such as tokenization, POS tagging, Named-Entity recognition (NER), chunking, parsing and coreference resolution. Although …

Automatic annotation of structured facts in images
M Elhoseiny, S Cohen, W Chang, B Price… – arXiv preprint arXiv …, 2016 – arxiv.org
… facts that we denote as <S, A>; see Fig 4. Similar to some existing systems OpenNLP (Baldridge, 2014 … eg, sentence segmentation, tokenization, part-of-speech tagging, named entity extraction, chunking, dependency and constituency-based parsing, and coreference resolution. …

Clinical Practice Ontology Automatic Learning from SOAP Reports
D Mendes, IP Rodrigues, CF Baeta – Handbook of Research on …, 2016 – books.google.com
… openNLP Natural Language Processing Library. Retrieved from http://opennlp. apache. … Improving machine learning approaches to coreference resolution. Proceedings of the 40th Annual Meeting on As- sociation for Computational Linguistics-ACL ’02. …

Terminological inconsistency analysis of natural language requirements
J Misra – Information and Software Technology, 2016 – Elsevier
… Examples of the POS tags [19] are NN (noun, singular), NNP (proper noun, singular), VB (verb, base form), RB (Adverb), JJ (Adjective). For example, POS tagging for the Req1 using OpenNLP POS Tagging library [20] results into: …

Seed, an End-User Text Composition Tool for the Semantic Web
B Eldesouky, M Bakry, H Maus, A Dengel – International Semantic Web …, 2016 – Springer
… NER), coreference resolution, … etc. The implementation is carried out in a modular way that eases integrating or swapping various NLP toolkits as implied in Fig. 1. The current implementation of Seed specifically builds upon Stanford CoreNLP [27] and Apache OpenNLP [1] to …

Robust multilingual Named Entity Recognition with shallow semi-supervised features
R Agerri, G Rigau – Artificial Intelligence, 2016 – Elsevier
… Nowadays NERC systems are widely being used in research for tasks such as Coreference Resolution [51], Named Entity Disambiguation [19 … Our system learns Perceptron models [17] using the Machine Learning machinery provided by the Apache OpenNLP project 2 with our …

Entity Analysis with Weak Supervision: Typing, Linking, and Attribute Extraction
X Ling – 2016 – digital.lib.washington.edu
… The semantic types of the mentions are then predicted [45], commonly used as salient features for down-stream NLP applications, such as coreference resolution [54, … Other main tasks of IE include coreference resolution — detecting expressions referring to the same …

Context-based co-reference resolution for text document using graph model (cont-graph)
SS Sonawane, PA Kulkarni – International Journal of …, 2016 – inderscienceonline.com
… Stop word removal: removal of stop word from text document. • Stemming: root form of word used to represent inflected words in meaningful way. These pre-processing techniques are implemented using OPEN-NLP model. 3.2 Quote sentence attribution …

Textual Inference for Machine Comprehension
M Gleize – 2016 – theses.fr
… The reader would have to realize that the pronouns ‘his’ and ‘him’ refer to Peter to fully understand the meaning. What is usually called coreference resolution in the field of NLP is not the only phenomenon encompassed by bridging inferences. 15 Page 21. …

Agile in-litero experiments
RL Richardet – 2016 – infoscience.epfl.ch
… At the semantic level, NLP models deal with labeling entities like person or semantic proteins, clustering tokens that refer to the same entity (coreference resolution), relation and knowledge extraction (eg is-a relationships or protein-protein interaction). …

Natural language processing for the semantic web
D Maynard, K Bontcheva… – Synthesis Lectures on the …, 2016 – morganclaypool.com
Page 1. MA YN AR D • E T AL N A T UR AL L ANGU A GE P R O CE SSING FOR T H E SE MAN T IC WE B M O R GAN & CL A YPOO L Natural Language Processing for the Semantic Web Diana Maynard Kalina Bontcheva Isabelle Augenstein Page 2. Page 3. …

A study of relation extraction for biomedical text
Y Peng – 2016 – search.proquest.com
A study of relation extraction for biomedical text. Abstract. A crucial area of Biomedical Natural Language Processing is relation extraction, the study of identifying relations between entities. One main challenge of relation extraction is text variations. …

Cross-Platform Text Mining and Natural Language Processing Interoperability
RE de Castilho, S Ananiadou, T Margoni, W Peters… – 2016 – pdfs.semanticscholar.org
Page 1. LREC 2016 Workshop Cross-Platform Text Mining and Natural Language Processing Interoperability PROCEEDINGS Edited by Richard Eckart de Castilho, Sophia Ananiadou, Thomas Margoni, Wim Peters, Stelios Piperidis 23 May 2016 Page 2. …

Cross-Platform Text Mining and Natural Language Processing Interoperability-Proceedings of the LREC2016 conference
RE de Castilho, S Ananiadou, T Margoni, W Peters… – 2016 – eprints.gla.ac.uk
Page 1. LREC 2016 Workshop Cross-Platform Text Mining and Natural Language Processing Interoperability PROCEEDINGS Edited by Richard Eckart de Castilho, Sophia Ananiadou, Thomas Margoni, Wim Peters, Stelios Piperidis 23 May 2016 Page 2. …

Automatic Detection and Extraction of Event Locations in News Report to locate in Map.
D Shivakoti – 2016 – brage.bibsys.no
… Apache OpenNLP [29] uses a different underlying approach than Stanford’s library, the OpenNLP project is an Apache-licensed suite of tools to do tasks like tokenization, part of speech tagging, parsing, and named entity recognition. Page 19. 9 …

A computational framework for converting textual clinical diagnostic criteria into the quality data model
N Hong, D Li, Y Yu, Q Xiu, H Liu, G Jiang – Journal of biomedical informatics, 2016 – Elsevier
… as clinical guidelines, clinical notes, and electronic health records (EHRs) [5] and [6]. Typical clinical NLP tools that could support term recognition and text annotation from clinical text include Health Information Text Extraction tool (HITex) [7], MetaMap [8], OpenNLP [9], and …

Evaluation of NER systems for the recognition of place mentions in French thematic corpora
C Brando, C Dominguès, M Capeyron – Proceedings of the 10th …, 2016 – dl.acm.org
… The last column (# mentions) summarizes the number of place mentions in the corpus. It is noteworthy to indicate that we chose to not display the results of the test in the Oscar corpus using the OpenNLP tool combined with the NERC-fr model because the values are very low. …

Extracting temporal and causal relations between events
P Mirza – arXiv preprint arXiv:1604.08120, 2016 – arxiv.org
Page 1. PhD Dissertation International Doctorate School in Information and Communication Technologies DISI – University of Trento Extracting Temporal and Causal Relations between Events Paramita Mirza Advisor: Dr. Sara Tonelli Università degli Studi di Trento April 2016 …

Design of multimodal mobile interfaces
K Brown, D Dahl, A Degani, A Rudnicky, B van Over… – 2016 – books.google.com
Page 1. Nava Shaked, Ute Winter Design of Multimodal Mobile Interfaces Page 2. Also of Interest Series: Speech Technology and Text Mining in Medicine and Healthcare Amy Neustein (Ed.) ISSN: 2329-5198 Published Titles …

Ideede sidumine toetamaks uuendajaid ja ettevõtjaid
K Hambardzumyan – 2016 – dspace.ut.ee
… JATE……………33 Apache Open NLP……………33 SNLP……………33 …

Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge Series
G Rizzoa, B Pereirab, A Vargac, M van Erpd… – semantic-web-journal.net
Page 1. Semantic Web 0 (0) 1–35 1 IOS Press Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge Series Emerging Trends in Mining Semantics from Tweets Editor(s): Andreas Hotho, Julius-Maximilians …

Advanced natural language processing and temporal mining for clinical discovery
S Mehrabi – 2016 – search.proquest.com
… expressions. The next AE is the cTAKES sentence detector (Savova, et al., 2010). This is a UIMA wrapper around the openNLP sentence detector (OpenNLP), which was originally used during the first pass. Because sentences …

Parsing and Evaluation. Improving Dependency Grammars Accuracy. Anàlisi Sintàctica Automàtica i Avaluació. Millora de qualitat per a Gramàtiques de …
M Lloberes Salvatella – 2016 – diposit.ub.edu
Page 1. Parsing and Evaluation. Improving Dependency Grammars Accuracy Anàlisi Sintàctica Automàtica i Avaluació. Millora de qualitat per a Gramàtiques de Dependències Marina Lloberes Salvatella Aquesta tesi doctoral …

Text Stemming: Approaches, Applications, and Challenges
J Singh, V Gupta – ACM Computing Surveys (CSUR), 2016 – dl.acm.org
Page 1. 45 Text Stemming: Approaches, Applications, and Challenges JASMEET SINGH and VISHAL GUPTA, Panjab University, Chandigarh Stemming is a process in which the variant word forms are mapped to their base form. …