Resources and linguistic processors (2013) .. by Rodrigo Agerri etc
Contents
1 Introduction 13
2 Processing Events in Text 15
2.1 Event detection from multilingual textual sources … 15
2.2 Progress on semantic processing … 16
2.3 Progress on event-detection … 19
3 Text Classification 20
3.1 Tools … 21
3.1.1 JEX … 21
3.1.2 Mahout … 22
3.1.3 OpenNLP Document Categorizer … 22
3.1.4 Classifier4j … 22
3.1.5 jTCat … 22
3.1.6 RTextTools … 23
3.1.7 TCatNG … 23
3.1.8 libTextCat … 23
3.1.9 TexLexAn … 23
3.1.10 Mallet … 24
4 Named Entity Recognition and Classification 24
4.1 Data Sources … 26
4.1.1 CoNLL 2002 datasets … 28
4.1.2 CoNLL 2003 datasets … 29
4.1.3 JRC Names … 29
4.1.4 Ancora … 30
4.1.5 Italian Content Annotation Bank (I-CAB) … 30
4.2 Tools … 30
4.2.1 OpenCalais … 31
4.2.2 Stanford CoreNLP … 32
4.2.3 Illinois Named Entity Tagger … 32
4.2.4 Freeling … 32
4.2.5 OpenNLP … 32
4.2.6 TextPro … 32
5 Coreference Resolution 34
5.1 Data Sources … 35
5.1.1 MUC … 35
5.1.2 ACE … 35
5.1.3 OntoNotes … 36
5.1.4 AnCora-Co … 36
5.2 Tools … 38
5.2.1 GUITAR … 38
5.2.2 BART … 38
5.2.3 Illinois Coreference Package … 39
5.2.4 ARKref … 39
5.2.5 Reconcile … 40
5.2.6 MARS … 40
5.2.7 CherryPicker … 40
5.2.8 Stanford CoreNLP … 41
5.2.9 RelaxCor … 43
5.2.10 JavaRAP … 43
6 Named Entity Disambiguation 43
6.1 Data Sources … 45
6.1.1 KBP at TAC … 47
6.1.2 Cucerzan 2007 … 48
6.1.3 Fader 2009 … 48
6.1.4 Dredze 2010 … 48
6.1.5 ACEtoWIKI … 48
6.1.6 AIDA CoNLL Yago … 49
6.1.7 Illinois Wikifier Datasets … 49
6.1.8 Wikipedia Miner … 49
6.1.9 DBpedia … 50
6.1.10 Freebase … 50
6.1.11 YAGO2 … 50
6.1.12 GeoNames … 50
6.1.13 LinkedGeoData … 51
6.2 Tools … 51
6.2.1 OKKAM … 51
6.2.2 The Wiki Machine … 53
6.2.3 Zemanta … 53
6.2.4 Illinois Wikifier … 53
6.2.5 DBpedia Spotlight … 53
6.2.6 WikiMiner … 54
6.2.7 TAGME … 54
7 Word Sense Disambiguation 54
7.1 Data Sources … 58
7.1.1 SemCor … 58
7.1.2 OntoNotes … 59
7.1.3 Ancora … 59
7.1.4 Senseval/SemEval corpora … 60
7.2 Tools … 61
7.2.1 SenseLearner … 61
7.2.2 IMS … 61
7.2.3 SuperSenseTagger … 61
7.2.4 GWSD … 61
7.2.5 UKB … 61
8 Sentiment Analysis 62
8.1 Data Sources … 62
8.2 Tools … 70
9 Semantic Role Labeling 72
9.1 Data Sources … 73
9.1.1 PropBank and Nombank … 73
9.1.2 VerbNet … 74
9.1.3 FrameNet … 74
9.2 Tools … 75
9.2.1 Mate-Tools … 75
9.2.2 SwiRL … 75
9.2.3 SENNA … 75
9.2.4 SEMAFOR … 76
9.2.5 Shalmaneser … 76
9.3 Implicit Semantic Role Labeling … 77
10 Recognising and Interpreting Time 79
10.1 Resources … 79
10.2 Tools … 83
11 Factuality Module for Events 85
11.1 Resources … 85
11.2 Tools … 85
12 Event Detection and Classification 85
12.1 Event types … 85
12.2 Tools … 88
13 Event Coreference 90
13.1 Data Sources … 92
13.2 Tools … 92
14 Event Relations 92
14.1 Data Sources … 92
14.1.1 Temporal relations … 92
14.1.2 Causal relations … 93
14.2 Tools … 94
15 Structured Data RDF 95
15.1 Tools … 95
15.1.1 Databases-to-RDF … 95
15.1.2 XML-to-RDF … 96
15.1.3 Spreadsheet-to-RDF … 96
16 Conclusions