Notes:
Calais is a service by Thomson Reuters that automatically extracts semantic information from web pages and other online content. Calais uses natural language processing (NLP) and other AI technologies to analyze the text and structure of web pages, and to identify and extract semantic information, such as named entities (people, organizations, locations, etc.), events, facts, or other information that can be used on the semantic web.
The semantic web is a vision of the World Wide Web in which the meaning of web content is explicit and can be understood and processed by machines, enabling them to perform tasks such as searching, reasoning, or decision making. Calais is designed to facilitate the use of semantic web technologies by automatically extracting semantic information from web pages, and providing it in a format that can be easily integrated into semantic web applications.
Calais is a powerful and versatile tool for extracting semantic information from web pages, and it is used by a variety of organizations and industries, including media, publishing, finance, and intelligence. By automatically extracting semantic information from web pages, Calais enables users to quickly and easily access and use the meaning and insights that are hidden in online content, making it a valuable tool for a wide range of applications.
Named Entity Recognition and Disambiguation (NERD) is a natural language processing task that involves identifying and classifying named entities in text, such as people, organizations, locations, and dates. NERD systems typically use machine learning algorithms to analyze text and extract these named entities, which are then labeled with their corresponding categories (e.g., “person,” “organization,” etc.).
The Calais service by Thomson Reuters is a natural language processing platform that provides a range of tools for text analysis, including named entity recognition and disambiguation. Calais uses machine learning algorithms to analyze text and extract named entities, which are then disambiguated and linked to a structured database of entities, such as Wikipedia and other publicly available data sources. This allows Calais to provide detailed, structured information about the named entities it extracts, such as their definitions, relationships, and other relevant information. Calais can be used for a variety of applications, including information extraction, content categorization, and text analysis.
Resources:
- a.nnotate.com .. online document review and collaboration, upload, annotate, share
- aemoo .. users can query aemoo about the linking network of any entity
- alias-i.com .. lingpipe is tool kit for processing text using computational linguistics
- any23.org .. anything to triples
- arcomem.eu .. european archives, museums and libraries in the age of the social web
- arrowsmith.psych.uic.edu .. identify meaningful links between two sets of medline articles
- athento.com .. smarter document management
- babylon-enterprise.com .. information access solutions
- basex.org .. the xml database
- code.google.com/p/rdfquery .. easy-to-use javascript library for rdf-related processing
- cs.waikato.ac.nz .. computer science department, university of waikato
- d2rq.org .. a system for accessing relational databases as virtual, read-only rdf graphs
- datahub.io .. share and find data, quickly and easily
- dbpedia-spotlight.org .. tool for automatically annotating mentions of dbpedia resources
- developer.nytimes.com .. times developer network, apis
- developer.yahoo.com .. measure, monetize, advertise and improve your apps with yahoo tools
- dose.sourceforge.net .. a distributed open semantic elaboration platform
- ellogon.org .. a multi-lingual, cross-platform, general-purpose language engineering environment
- expertsystem.com .. a semantic intelligence company that creates artificial intelligence
- finchcomputing.com .. solving complex, unstructured text challenges in real-time and at scale
- foaf-project.org .. a computer language defining a dictionary of people-related terms
- fox .. federated knowledge extraction framework
- gate.ac.uk .. general architecture for text engineering
- geonames.org .. geographical database covers all countries
- geotxt.org .. detects location name mentions in text
- iis.sinica.edu.tw .. academia sinica institute of information science, taiwan
- insemtives.eu .. incentives for semantics
- jquery.com .. fast, small, and feature-rich javascript library
- ksl.stanford.edu .. stanford knowledge systems, ai laboratory
- leafletjs.com .. a javascript library for mobile-friendly maps
- linkeddata.org .. connect distributed data across the web
- lucene.apache.org .. ultra-fast search library and server
- mahout.apache.org .. scalable machine learning and data mining
- medialab.di.unipi.it .. laboratorio multimediale
- medworm.com .. medical search engine and rss news
- metamap.nlm.nih.gov .. a tool for recognizing umls concepts in text
- mg4j.di.unimi.it .. managing gigabytes for java
- mongodb.org .. mongodb atlas database as a service
- nactem.ac.uk .. national centre for text mining, text mining tools and services
- nerd.eurecom.fr .. named entity recognition and disambiguation
- nlp.stanford.edu .. the stanford nlp (natural language processing) group
- nltk.org .. platform for building python programs to work with human language data
- ontos.com .. linking information from different heterogeneous data silos
- ontotext.com .. semantic technologies for smarter information retrieval and content management
- opencalais.com .. opencalais web service api
- opennlp.apache.org .. a machine learning based toolkit for the processing of natural language text
- poolparty.biz .. poolparty semantic suite
- rapidminer.com .. predictive analytics, data mining, self-service, open source
- rdf-translator.appspot.com .. a multi-format conversion tool for structured markup
- rdfs.org .. a home for ontologies / vocabularies on the semantic web
- reverb.cs.washington.edu .. open information extraction software
- sameas.org .. find co-references between different data sets
- schema.org .. promote schemas for structured data on the internet
- sensebot.com .. search engine that finds sense in a heap of web pages
- sig.ma .. semantic information mashup enterprise edition
- simile-widgets.org .. free, open-source data visualization web widgets, and more
- smiy.sourceforge.net .. an overview of a couple of semantic web ontologies and projects
- stanbol.apache.org .. a set of reusable components for semantic content management
- swoogle.umbc.edu .. swoogle semantic web search engine
- tagaroo.opencalais.com .. automatically suggests relevant tags and images
- timeml.org .. markup language for temporal and event expressions
- tools.seobook.com .. free seo tools and search engine optimization software
- topquadrant.com .. solutions enable a semantic ecosystem among people, applications and data
- ukp.tu-darmstadt.de .. ubiquitous knowledge processing (ukp) lab
- urbandictionary.com .. crowdsourced online dictionary of slang words and phrases
- viaf.org .. virtual international authority file
- virtuoso.openlinksw.com .. openlink virtuoso universal server
- wordnet.princeton.edu .. there are currently no plans for future wordnet releases
- xlwrap.sourceforge.net .. spreadsheet-to-rdf wrapper
- yago-naga .. a conveniently searchable, large-scale, highly accurate knowledge base of common facts
- zemanta.com .. an omni-channel programmatic native dsp for media agencies
Wikipedia:
References:
See also:
Text analytics APIs, Part 2: The smaller players
R Dale – Natural Language Engineering, 2018 – cambridge.org
… https://aylien.com Bitext https://www.bitext.com Dandelion https://dandelion.eu Geneea https://www.geneea.com Indico https://indico.io Intellexer https://www.intellexer.com MeaningCloud https://www.meaningcloud.com Open Calais http://www.opencalais.com ParallelDots https …
Content-driven, unsupervised clustering of news articles through multiscale graph partitioning
MT Altuncu, SN Yaliraki, M Barahona – arXiv preprint arXiv:1808.01175, 2018 – arxiv.org
… More information is available on https: //cloud.google.com/natural-language/ 5Thomson Reuters has an API service, accessible from http://www.opencalais.com/ 6Full list of Open Calais supported IPTC topics at http://www.opencalais.com/ wp-content/uploads/folder …
Identifying Fake News and Fake Users on Twitter
CS Atodiresei, A T?n?selea, A Iftene – Procedia Computer Science, 2018 – Elsevier
… Lecture Notes in Computer Science, 2015: 9283, p. 41-52. 12. Open Calais: http://www.opencalais.com/ accessed last time on March, 2018. 13. Sentiment140: http://www.sentiment140.com/ accessed last time on March, 2018. 14 …
Connectionlens: finding connections across heterogeneous data sources
C Chanial, R Dziri, H Galhardas, J Leblay… – Proceedings of the …, 2018 – dl.acm.org
… re- lationship extraction to identify in D occurrences of enti- ties (such as people, places, organizations etc.) and of rela- tionships (such as bornIn, worksFor etc.) Any off-the-shelf extractor (or set of extractors) can be used; Connection- Lens currently uses OpenCalais (http://www …
Newsmap: A semi-supervised approach to geographical news classification
K Watanabe – Digital Journalism, 2018 – Taylor & Francis
… Geoparser.io. 2 2. The Edinburgh Geoparser: https://www.ltg.ed.ac.uk/software/ geoparser/; CLAVIN: https://clavin.bericotechnologies.com; Open Calais: http://www.opencalais.com; Geoparser.io: https://geoparser.io.View all notes …
A Multilingual Information Extraction Pipeline for Investigative Journalism
G Wiedemann, SM Yimam, C Biemann – arXiv preprint arXiv:1809.00221, 2018 – arxiv.org
… DocumentCloud6 is an open-source tool specif- ically designed for journalists to analyze, annotate and publish findings from textual data. In addition to full-text search, it offers named entity recogni- tion (NER) based on OpenCalais7 for person and location names …
Estimating User Interest from Open-Domain Dialogue
M Inaba, K Takahashi – Proceedings of the 19th Annual SIGdial Meeting …, 2018 – aclweb.org
… For example, Abel et al. modeled Twitter users using the appearance frequencies of certain named entities (eg, people, events, or music groups), acquired using OpenCalais 1 (Abel et al., 2011) … Given an utterance set Us = (u1,u2, …, un) ut- 1http://www.opencalais.com …
New/s/leak 2.0 – Multilingual Information Extraction and Visualization for Investigative Journalism
G Wiedemann, SM Yimam, C Biemann – International Conference on …, 2018 – Springer
… 541–547. Lisbon, Portugal (2015)Google Scholar. 15. Thomson Reuters: Open Calais: API user guide (2017). http://www.opencalais.com/opencalais-api. 16. Yimam, SM, et al.: new/s/leak – information extraction and visualization for investigative data journalists …
Inferring event geolocation based on Twitter
Y Ying, C Peng, C Dong, Y Li, Y Feng – Proceedings of the 10th …, 2018 – dl.acm.org
… Yue Ying, Chen Peng, Chao Dong, Yang Li, and Yan Feng news in Twitter, use their tagging engine OpenCalais1 and some rules to identify locations … SpaCy [1] is an Industrial-Strength 1Thomson Reuters’s NLP Tool http://www.opencalais.com/ 2http://geonames.org …
Document clustering as a record linkage problem
N Pittaras, G Giannakopoulos, L Tsekouras… – Proceedings of the …, 2018 – dl.acm.org
… measures. The JedAI Record Linkage toolkit is employed for most of the record linkage pipeline tasks (ie prepro- cessing, scalable feature representation, blocking and clus- tering) and the OpenCalais platform for entity extraction …
Tagging Assistant for Scientific Articles
Z Nasar, SW Jaffry, MK Malik – faculty.pucit.edu.pk
… 102–107. 16. C. Mitre, “Callisto – Home Page,” 2013. [Online]. Available: https://mitre.github. io/callisto/index.html. [Accessed: 07-Jul-2018]. 17. “Open Calais,” Open Calais, 2008. [Online]. Available: http://www.opencalais.com/. [Accessed: 06-Sep-2017]. 18 …
Using Web Crawlers for Feature Extraction of Social Nets for Analysis
F Noor, A Shah, W Gill, SA Khan – Information Technology-New …, 2018 – Springer
… Saif et al. [12] investigated three services Zemanta, OpenCalais, and AlchemyAPI and finally used AlchemyAPI for the Semantic annotation of tweets. Abdel et al. [13] used OpenCalais API to detect named entities in tweets. While Steiner et al …
To cite this version
F Goasdoué, K Karanasos, Y Katsis, J Leblay… – hal.inria.fr
… When a document is opened within the client, the information extraction mod- ule (which relies on OpenCalais [3]) automatically finds the topics, entities and relationships it contains … http://www.w3.org/TR/rdf-mt/, 2004. [3] OpenCalais. http://opencalais.com/. [4] BaseX …
Document Enrichment using DBPedia Ontology for Short Text Classification
J Flisar, V Podgorelec – Proceedings of the 8th International Conference …, 2018 – dl.acm.org
… 3.1 Annotation tools Many tools for detecting meaningful terms in text and linking them to relevant knowledge base concepts, especially Wikipedia, have been developed (DBpedia Spotlight2, TagMe3, Wikify!, Zemanta4, OpenCalais5, AlchemyAPI6) [22] …
Using microtasks to crowdsource DBpedia entity classification: A study in workflow design
Q Bu, E Simperl, S Zerr, Y Li – Semantic Web, 2018 – content.iospress.com
… There is a number of existing tools and APIs such as DBpedia spotlight,7 Dandelion,8 Alchemy API,9 Open Calais,10 GATE,11 and NERD12 that can be used to classify entities based on a predefined ontology, but they very often fail to produce a type for some of the entities …
A Coherent Unsupervised Model for Toponym Resolution
E Kamalloo, D Rafiei – Proceedings of the 2018 World Wide Web …, 2018 – dl.acm.org
… We also examine three commercial prod- ucts including Reuters OpenCalais, Yahoo! YQL Placemaker, and Google Cloud Natural Language API. The … Yahoo! YQL Placemaker, OpenCalais and Google Cloud Natural Language API) …
A Comparative Study Of Word Representation Methods With Conditional Random Fields And Maximum Entropy Markov For Bio …
MT Abd, M Mohd – Malaysian Journal of Computer Science, 2018 – ajba.um.edu.my
… To date, most NER tools such as (Stanford NER, Illinois NET, and OpenCalais NER WS) have been able to capture the characteristics of different entity classes using feature engineering, which finds the set of features that best help distinguish entities of a specific type from …
Leveraging Web Data to Monitor Changes in Corporate-Government Interlocks in India
A Sen, A Agarwal, A Guru, A Choudhuri… – Proceedings of the 1st …, 2018 – dl.acm.org
… Entities are extracted from the article text using the OpenCalais service [21], which provides entities of type Person, Company, Or- ganization, City, Province, and Country from an article. Along with the entities, OpenCalais also …
Natural Language Processing for Information Extraction
S Singh – arXiv preprint arXiv:1807.02383, 2018 – arxiv.org
… toolkit for NLP), Stanford NER, GExp, Mallet (Machine learn- ing for language toolkit), Natural Language Toolkit (Suite of Python libraries for NLP), DBpedia Spotlight (Open source tool for Named Entity Recognition and Named Entity Linking) and Open- Calais (Automated IE …
From “Trust Me” to “Show Me” Journalism: Can DocumentCloud help to restore the deteriorating credibility of news?
N Mor, Z Reich – Journalism Practice, 2018 – Taylor & Francis
… The platform relies on online services (DocViewer and OpenCalais) which turn the files, stored as PDFs, into fully searchable texts and allow users to embed their 1092 NIV MOR AND ZVI REICH Page 3. documents directly into their news items …
GeoHbbTV: A framework for the development and evaluation of geographic interactive TV contents
D Luaces, JRR Viqueira, P Gamallo, D Mera… – Multimedia Tools and …, 2018 – Springer
… Some examples are Yahoo PlaceSpotter (a service of the Yahoo BOSS geo services, acces- sible through table Geo.Placemaker of Yahoo Query Language (YQL)2), the Cartographic Location And Vicinity Indexer (CLAVIN)3 and Open Calais.4 The identification of geographic …
Capisco: low-cost concept-based access to digital libraries
A Hinze, D Bainbridge, SJ Cunningham… – International Journal on …, 2018 – Springer
… 3.3.1 Semantic annotation Automatic annotation tools, such as OpenCalais,3 Zemanta,4 DBpedia Spotlight,5 and Cohse [91], are services for the semantic web community to increase the volume of intercon- nected data. Most …
Ontology based news extraction system using recurrent neural networks
SK George, SK Gopalan – Journal of Innovation in …, 2018 – innovationjournals.com
… Fig. 1. Seq2Seq Model In this work, Open Calais ontology is used which is created by Thomson Reuters and it follows IPTC (International Press Telecommunications Council) news classification standard. The dataset used to train and test the model is BBC news dataset [19] …
An automatic extraction method of static and dynamic spatial contexts from texts
L Moncla, M Gaio, E Egorova… – Atelier Science des …, 2018 – hal.archives-ouvertes.fr
… Page 6. L. Moncla et al. tools such as OpenCalais 3, OpenNLP 4 and Stanford-NER 5 consider only the entity ‘Aragon Region’, and therefore lead to inaccuracies in classification and/or disambiguation. ENER ? ? ? ? ? …
An Analysis of the Semantic Annotation Task on the Linked Data Cloud
G Michel, Z Amal, A Francisco, E Faezeh… – arXiv preprint arXiv …, 2018 – arxiv.org
… Open Calais6 is a service offered by Thomson Reuters … 3http://docs.aylien.com/ 4http://babelfy. org/ 5https://dandelion.eu/ 6http://www.opencalais.com/ 7https://github.com/dbpedia/dbpedia/ wiki 8http://tagme.di.unipi.it/ 9http://www.umbel.org/web-services/tagger-concept-noun …
The Effect of Corpora Size on Performance of Named Entity Recognition
Z Liaghat – Highlighting the Importance of Big Data Management …, 2018 – Springer
… Illinois NER (https://cogcomp.cs.illinois.edu/page/software_view/NETagger), Stanford NER (http://nlp.stanford.edu/ner/), GATE ANNIE (http://gate.ac.uk/ie/annie.html), Minor third (https://sourceforge.net/projects/minorthird), OpenCalsis (http://www.opencalais.com), Lingpipe …
An Extensible Event Extraction System With Cross-Media Event Resolution
F Petroni, N Raman, T Nugent, A Nourbakhsh… – Proceedings of the 24th …, 2018 – dl.acm.org
… We conceptualize an event as a semantic entity with four main dimensions: what, where, who, and when. We use OpenCalais4 to identify the first three dimensions (if present) in each tweet … 4http://www.opencalais.com modeled on tweet content [26] …
A natural language processing and geospatial clustering framework for harvesting local place names from geotagged housing advertisements
Y Hu, H Mao, G McKenzie – International Journal of Geographical …, 2018 – Taylor & Francis
Skip to Main Content …
What influence would a cloud based semantic laboratory notebook have on the digitisation and management of scientific research?
S Kanza – 2018 – eprints.soton.ac.uk
… 108 5.5 Semanti-Cat . . . . . 108 5.5.1 Architecture & Implementation . . . . . 109 5.5.1.1 OpenCalais API . . . . . 109 5.5.1.2 GATE . . . . . 111 Page 8. viii CONTENTS …
An efficient web search engine for noisy free information retrieval.
P Sahoo, R Parthasarthy – Int. Arab J. Inf. Technol., 2018 – iajit.org
… Document Entity And Resolution (DEAR) system [23] combines semantic similarity matching as provided by the open source Word Net database with the ability to recognize named entities through the Open Calais system. When used in concert, it provides a novel way …
An approach to extracting thematic views from highly heterogeneous sources of a data lake
C Diamantini, PL Giudice, L Musarella… – Atti del Ventiseiesimo …, 2018 – logiudice.eu
… metadata. It includes the source and target locations of the corresponding data, 4 RDF Concepts and abstract Syntax: http://www.w3.org/TR/2004/ REC-rdf-concepts-20040210/ 5 http://www.opencalais.com Page 4. the associated file size, the number of their records, and so …
A Novel Ensemble Method for Named Entity Recognition and Disambiguation Based on Neural Network
L Canale, P Lisena, R Troncy – International Semantic Web Conference, 2018 – Springer
… 0.49. 0.6. 0.77. 0.52. meaning cloud. 0.59. 0.91. 0.44. 0.72. 0.78. 0.69. opencalais. 0.56. 0.97. 0.39. 0.69. 0.71. 0.68. textrazor. 0.74. 0.86. 0.65. 0.77. 0.81 … 0.34. 0.5. 0.61. 0.45. meaning cloud. 0.82. 0.88. 0.77. 0.8. 0.87. 0.76. opencalais. 0.58. 0.81. 0.45. 0.81. 0.9. 0.76. textrazor. 0.81 …
ATR4S: toolkit with state-of-the-art automatic terms recognition methods in Scala
N Astrakhantsev – Language Resources and Evaluation, 2018 – Springer
… Some tools are limited by searching for mentions of (named) entities (for example, OpenCalais 6 ) or named entites and Wikipedia concepts [Texterra (Turdakov et al. 2014)]. Another tool 7 supports only supervised recognition of 1-word and 2-words terms …
A preliminary, structured review of how professional experience is detected in natural–language texts
A Williams, A Rainer – 2018 – researchgate.net
Page 1. A preliminary, structured review of how professional experience is detected in natural–language texts Technical Report Ashley Williams Department of Computer Science and Software Engineering University of Canterbury, NZ ashley.williams@pg.canterbury.ac.nz …
What’s missing in geographical parsing?
M Gritta, MT Pilehvar, N Limsopatham… – Language Resources and …, 2018 – Springer
… There are several related NER/NED tools such as OpeNER, 6 Thomson Reuters Open Calais, 7 however, these do not disambiguate down to the coordinates level, hence do not qualify as geoparsers. AIDA 8 (Yosef et al. 2011 …
Using linked data resources to generate web pages based on a BBC case study
L Zemmouchi-Ghomari, R Sefsaf, K Azni – Science and Information …, 2018 – Springer
… BBC developers use a feature retrieval system called Muddy Boots to process BBC articles. Products such as OpenCalais, Twine and Zemanta are based on Named Entity Recognition technique (NER) to extract terms from the text, using their own entity identifiers …
User interest prediction over future unobserved topics on social networks
F Zarrinkalam, M Kahani, E Bagheri – Information Retrieval Journal, 2018 – Springer
… 2014; Lu et al. 2012). For example, Abel et al. (2011b, c) have proposed to enrich Twitter posts by linking them to related news articles and then extracting the concepts mentioned in the enriched posts using Web services provided by OpenCalais …
Constructing Differentiated Educational Materials Using Semantic Annotation for Sustainable Education in IoT Environments.
Y Kim, J Moon, E Hwang – Sustainability (2071-1050), 2018 – search.ebscohost.com
… [17] proposed a method for linking Twitter posts with related news articles to contextualize Twitter activities. They used OpenCalais to extract entities and topics from tweets and news, and the extracted information is used for semantic enrichment by annotating tweets and news …
Programming and Pre-Processing Systems for Big Data Storage and Visualization
HU Rahman, RU Khan, A Ali – Handbook of Research on Big Data …, 2018 – igi-global.com
… visualizations. Like Datameer, it uses Hadoop behind the scenes to handle very large amounts of data along with services like OpenCalais to cope with extracting useful structured information from a soup of unstructured text …
A Practical Approach to Constructing a Knowledge Graph for Cybersecurity
Y Jia, Y Qi, H Shang, R Jiang, A Li – Engineering, 2018 – Elsevier
… This classifier uses the standard unigram bag-of-words vector model. Once a potential vulnerability description is identified, the framework extracts security-related entities and concepts using standard named entity recognition tools such as Open Calais …
Live Twitter Sentiment Analysis
D Sorvisto, P Cloutier, K Magnusson, T Al-Sarraj… – Applications of Data …, 2018 – Springer
… The authors of “Semantic Sentiment Analysis of Twitter” have tested their algorithm against the AlchemyAPI and a few other open-source solutions such as Open Calais 6 (a Reuter’s product), a web service that identifies entities in unstructured data …
NERD for NexGenTV
L Farinetti, R Troncy, L Canale – 2018 – webthesis.biblio.polito.it
… 11 2.1.5 DBpedia Spotlight . . . . . 11 2.1.6 Meaning Cloud . . . . . 13 2.1.7 Opencalais . . . . . 13 2.1.8 TextRazor . . . . . 15 2.2 Ensemble Approaches …
Diversity Checker: Toward recommendations for improving journalism with respect to diversity
J Peperkamp, B Berendt – Adjunct Publication of the 26th Conference on …, 2018 – dl.acm.org
… tensive ontology. Another tool called Enrycher [32] also links en- tities recognized to ontology concepts. Another tool in a similar vein is OpenCalais, see [14]. None of these tools currently works for Dutch, however. We notice …
LETTER OF APPROVAL
A Shakya – 2018 – researchgate.net
… different domains eg the Europe Media Monitor (EMM) NewsExplorer and the Thomson Reuters Open Calais. The EMM NewsExplorer automatically gathers news articles from … Reuters Open Calais uses Natural Language Processing (NLP) and machine learning …
Geotagging Text Data on the Web—A Geometrical Approach
MA Radke, N Gautam, A Tambi, UA Deshpande… – IEEE …, 2018 – ieeexplore.ieee.org
… FIGURE 1. Block diagram of the proposed system. In [10], the authors devised a simple method for finding the focus depending on the frequency of the place names in a document and evaluate their approach using Yahoo! Placespotter, Open Calais etc …
Data for Journalists: A Practical Guide for Computer-Assisted Reporting
B Houston – 2018 – books.google.com
… Boxes Chapter 1 Box 1.1: History of Computer-Assisted Reporting Box 1.2: The Basic Tools and the Advanced Tools Chapter 2 Box 2.1: Purposes of Online Resources Box 2.2: Different File Types Chapter 3 Box 3.1: Open Calais Box 3.2: Kinds of Social Media Box 3.3: Useful …
Big Data analytics in oil and gas industry: An emerging trend
M Mohammadpoor, F Torabi – Petroleum, 2018 – Elsevier
… visualization tools. BigSheets also utilizes Hadoop to process massive datasets. It also employs some additional tools such as OpenCalais to facilitate the extracting of structured data from a pool of unstructured data. This tool …
Integrated environments for designing, delivering and monitoring learning: extending problem-based learning with learning analytics and learning semantics
? ????? – 2018 – dspace.lib.uom.gr
… 106 Figure 2-49 Demo usage of AIDA ….. 107 Figure 2-50 Demo usage of Apache Stanbol ….. 108 Figure 2-51 Demo usage of Open Calais ….. 109 Page 14. xvi …
Corpus-driven Annotation Enrichment
F Kuhr, B Witten, R Möller – ifis.uni-luebeck.de
… Tipalo [8] is an automatic system identifying types from the text of Wikipedia documents for DBpedia entities. OpenCalais [14] is a knowledge extraction tool by Reuters, which automatically tags data in unstructured text using a large ontology …
A multimodal analytics platform for journalists analysing large-scale, heterogeneous multilingual and multimedia content
S Vrochidis, A Moumtzidou, I Gialampoukidis… – Frontiers in Robotics …, 2018 – frontiersin.org
… Regarding tools and APIs whose goal is deeper text understanding, some examples are churnalism, opencalais 7 , and AlchemyAPI 8 , having as main requirement the Natural Language Processing (NLP) paradigm. Opencalais …
Utility of social media and crowd-intelligence data for pharmacovigilance: a scoping review
AC Tricco, W Zarin, E Lillie… – BMC medical …, 2018 – bmcmedinformdecismak …
Skip to content Advertisement. Advertisement …
An information system for assessing the likelihood of child labor in supplier locations leveraging Bayesian networks and text mining
A Thöni, A Taudes, AM Tjoa – Information Systems and e-Business …, 2018 – Springer
… The goal of the next step is to extract these incident observations together with the corresponding attribute values from the text. DBpedia Spotlight and Open Calais by Reuters were used for geography tagging, and Open Calais for company tagging, yielding respective URIs …
Modular, Expandable Typologies
MK Bergman – A Knowledge Representation Practionary, 2018 – Springer
… 5]. Sekine put forward and refined over many years his extended entity types, which grew to about 200 types [6]. These ideas of extended entity types helped inform a variety of tagging services over the past decade, notably including OpenCalais, Zemanta, AlchemyAPI, and …
Exploiting semantic web knowledge graphs in data mining
P Ristoski – 2018 – ub-madoc.bib.uni-mannheim.de
Page 1. Exploiting Semantic Web Knowledge Graphs in Data Mining Inauguraldissertation zur Erlangung des akademischen Grades eines Doktors der Naturwissenschaften der Universität Mannheim presented by Petar Ristoski Mannheim, 2017 Page 2. ii …
Limiting the Spread of Fake News on Social Media Platforms by Evaluating Users’ Trustworthiness
O Balmau, R Guerraoui, AM Kermarrec… – arXiv preprint arXiv …, 2018 – arxiv.org
Page 1. Limiting the Spread of Fake News on Social Media Platforms by Evaluating Users’ Trustworthiness Oana Balmau EPFL oana.balmau@epfl.ch Rachid Guerraoui EPFL rachid.guerraoui@epfl.ch Anne-Marie Kermarrec Mediego, EPFL amk@mediego.com …
Entity linking of tweets based on dominant entity candidates
Y Feng, F Zarrinkalam, E Bagheri, H Fani… – Social Network Analysis …, 2018 – Springer
… Abel et al. (2011) have proposed to enrich Twitter posts by linking them to related news articles and then extracting the semantic entities mentioned in the enriched posts using OpenCalais. The identified semantic entities are then used to build user interest profiles …
A Combined Approach for Ontology Enrichment from Textual and Open Data
C Alec, C Reynaud-Delaître, B Safara – Advances in Knowledge Discovery …, 2018 – Springer
… We chose to use the Gate annotation software (Bontcheva et al. 2004; Cunningham et al. 2011) because it performs different text analysis tasks allowing the user to choose the ontology which will guide the process. Other tools like Open Calais 2 are not able to do that …
Social software infrastructure for e-participation
L Porwol, A Ojo, JG Breslin – Government Information Quarterly, 2018 – Elsevier
Skip to main content …
Extracting data governance information from Slack chat channels
S Quigley – 2018 – scss.tcd.ie
… better performance than other well-known publicly available NER tools such as Illinois Named Entity Tagger and OpenCalais Named Entity Recogniser in discursive texts such as biographies.(Atda? & Labatut 2013) Stanford …
Entity Linking
K Balog – Entity-Oriented Search, 2018 – Springer
Page 1. Chapter 5 Entity Linking Machine-understanding of text is an extremely challenging problem. The impor- tance of named entities in this regard has been acknowledged early on in natural language processing research …
Information extraction meets the Semantic Web: A survey
JL Martinez-Rodriguez, A Hogan… – Semantic …, 2018 – content.iospress.com
We provide a comprehensive survey of the research literature that applies Information Extraction techniques in a Semantic Web setting. Works in the intersection of these two areas can be seen from two overlapping perspectives: using Semantic Web reso.
Technologies of cross-border digital services in the EU, formalized ontologies and blockchain
V Kupriyanovsky, O Grinko, Y Volokitin… – International Journal of …, 2018 – injoit.ru
Page 1. International Journal of Open Information Technologies ISSN: 2307-8162 vol. 6, no.7, 2018
Ontology-based Approach for Semantic Data Extraction from Social Big Data: State-of-the-art and Research Directions
P Wongthontham, B Abu-Salih – arXiv preprint arXiv:1801.01624, 2018 – arxiv.org
… (Rizzo and Troncy 2011) evaluate five popular entity extraction tools on a dataset of news articles ie AlchemyAPI, Zemanta, OpenCalais, DBPedia Spotlight, and Extractiv. (Saif, He, and Alani 2012) chose to evaluate the first three of the five entity extraction tools on tweets …
Ontology-based approach for identifying the credibility domain in social Big Data
P Wongthongtham, BA Salih – Journal of Organizational Computing …, 2018 – Taylor & Francis
… Rizzo and Troncy (2011) evaluate five popular entity extraction tools on a dataset of news articles ie, AlchemyAPI, Zemanta, OpenCalais, DBPedia Spotlight, and Extractiv. Saif, He, and Alani (2012) chose to evaluate the first three of the five entity extraction tools on tweets …
Benchmarking and Optimization of OBDA Systems
D Lanti – 2018 – bia.unibz.it
Page 1. PHD THESIS davide lanti Benchmarking and Optimization of OBDA Systems PhilosophiæDoctor (PhD) Faculty of Computer Science Free University of Bozen-Bolzano Page 2. Davide Lanti: PhD Thesis, Benchmarking …
A Data Analytics Approach to the Cybercrime Underground Economy
J An, HW Kim – IEEE Access, 2018 – ieeexplore.ieee.org
… 2) COMPANY NAME EXTRACTION Named entity recognition is an information extraction tech- nique that classifies named entities based on a predefined dictionary. We used the Open Calais API to recognize com- pany and personal names. For example, Fig …
VoxEL: A benchmark dataset for multilingual Entity Linking
H Rosales-Méndez, A Hogan, B Poblete – International Semantic Web …, 2018 – Springer
The Entity Linking (EL) task identifies entity mentions in a text corpus and associates them with corresponding entities in a given knowledge base. While traditional EL approaches have largely…
Extraction Of Technical Information From Normative Documents Using Automated Methods Based On Ontologies: Application To The Iso 15531 Mandate Standard …
AF Cutting-Decelle, A Digeon, RI Young… – arXiv preprint arXiv …, 2018 – arxiv.org
Page 1. 1 EXTRACTION OF TECHNICAL INFORMATION FROM NORMATIVE DOCUMENTS USING AUTOMATED METHODS BASED ON ONTOLOGIES : Application to the ISO 15531 MANDATE standard – Methodology and first results …
Sensing and detecting traffic events using geosocial media data: A review
S Xu, S Li, R Wen – Computers, Environment and Urban Systems, 2018 – Elsevier
Social media platforms, or social networks, have allowed millions of users to post online content about topics related to our daily lives. Traffic is one of the.
Mining user interests over active topics on social networks
F Zarrinkalam, M Kahani, E Bagheri – Information Processing & …, 2018 – Elsevier
… For example, Abel et al. (2011) have proposed to enrich Twitter posts by linking them to related news articles and then extracting the semantic concepts mentioned in the enriched posts using web services provided by OpenCalais 1 . The identified semantic concepts are then …
Inferring user interests in microblogging social networks: a survey
G Piao, JG Breslin – User Modeling and User-Adapted Interaction, 2018 – Springer
… these topics. In Abel et al. (2011b, c, 2013a), the authors also used topics for representing user interests where those topics were extracted by ready-to-use NLP (Natural Language Processing) APIs such as OpenCalais. 24. Pros …
Transforming Open Data to Linked Open Data Using Ontologies for Information Organization in Big Data Environments of the Brazilian Government: the Brazilian …
M Victorino, MT de Holanda, E Ishikawa… – KO KNOWLEDGE …, 2018 – nomos-elibrary.de
… For example, Zeng et al. (2014) presents a study on computer-assisted semantic analysis, using Open- Calais, to verify its potential for generating access to the issue at the levels of “description” and “identification” for archival record groups and philosophy theses …
Mining location from social media: A systematic review
K Stock – Computers, Environment and Urban Systems, 2018 – Elsevier
During the last ten years, a large body of research extracting and analysing geographic data from social media has developed. We analyse 690 papers across 20 so.
Audiovisual Metadata Platform (AMP) Planning Project: Progress Report and Next Steps
JW Dunn, JL Hardesty, T Clement, C Lacinak… – 2018 – scholarworks.iu.edu
Page 1. This publication was made possible through a generous grant from The Andrew W. Mellon Foundation. Page 2. AMP Planning Project Report – March 2018 Report Publication Date March 27, 2018 Report Authors Jon …
A survey on mining stack overflow: question and answering (Q&A) community
A Ahmad, C Feng, S Ge, A Yousif – Data Technologies and …, 2018 – emeraldinsight.com
Analyzing the boundaries of balance theory in evaluating cause-related marketing compatibility
JT Yun – 2018 – ideals.illinois.edu
Page 1. ANALYZING THE BOUNDARIES OF BALANCE THEORY IN EVALUATING CAUSE- RELATED MARKETING COMPATIBILITY BY JOSEPH T. YUN DISSERTATION Submitted in partial fulfillment of the requirements for …
Linked Data
S Sakr, M Wylot, R Mutharaju, D Le Phuoc, I Fundulaki – Springer
Page 1. Linked Data Sherif Sakr · Marcin Wylot Raghava Mutharaju · Danh Le Phuoc Irini Fundulaki Storing, Querying, and Reasoning Page 2. Linked Data Page 3. Sherif Sakr • Marcin Wylot • Raghava Mutharaju • Danh Le Phuoc • Irini Fundulaki Linked Data …
Information Extraction
CC Aggarwal – Machine Learning for Text, 2018 – Springer
In its most basic form, text is a sequence of tokens, which is not annotated with the properties of these tokens. The goal of information extraction is to discover specific types of useful properties…
Images in Social Media: Categorization and Organization of Images and Their Collections
S Ørnager, H Lund – Synthesis Lectures on Information …, 2018 – morganclaypool.com
Page 1. Images in Social Media Categorization and Organization of Images and Their Collections Susanne Ørnager Haakon Lund IMA GE S IN SO CIAL ME DIA ØR N A GE R • L UND M O R GAN & CL A YPOO L Page 2. Page 3. Images in Social Media …
Values in technology and practice: Using Activity Theory to consider the role of values and technology in everyday activities
RC Gomer – 2018 – eprints.soton.ac.uk
Page 1. UNIVERSITY OF SOUTHAMPTON FACULTY OF PHYSICAL SCIENCES AND ENGINEERING ELECTRONICS AND COMPUTER SCIENCE Volume 1 of 1 Values in Technology and Practice: Using Activity Teory to consider …