Notes:
Sentence splitting, also known as sentence boundary detection or sentence segmentation, is the process of dividing a block of text into individual sentences. This is an important task in natural language processing (NLP) and is often used in dialog systems to help the system understand and respond to user input.
In a dialog system, sentence splitting is typically used to identify the boundaries between individual sentences within a user’s input, so that the system can parse and analyze each sentence separately. This can help the system to understand the structure and meaning of the user’s input, and to generate an appropriate response.
There are a variety of techniques and algorithms that can be used for sentence splitting, including rule-based approaches that use regular expressions or other patterns to identify sentence boundaries, and machine learning-based approaches that use training data to learn to identify sentence boundaries automatically.
Wikipedia:
See also:
FreeLing: An Open-Source Suite of Language Analyzers. X Carreras, I Chao, L Padró, M Padró – LREC, 2004 – hnk.ffzg.hr … is a need for most NL applications such as Machine Trans- lation, Summarization, Dialogue systems, etc … The first version of the suite includes the following functionalities for English, Spanish, and Catalan: ¢ Tokenization & sentence splitting ¢ Morphological analysis ¢ Multiword … Cited by 235 Related articles All 11 versions Cite Save More
Freeling 3.0: Towards wider multilinguality L Padró, E Stanilovsky – 2012 – upcommons.upc.edu … 2473 Page 2. as ca cy en es gl it pt ru Tokenization X X X X X X X X X Sentence splitting X X X X X X X X X Number detection X X X X X X X … Vi-Clone: Impressive virtual assistants for corporate web pages. Some FreeLing components are being in- tegrated in the dialog system. … Cited by 74 Related articles All 6 versions Cite Save
Emotion estimation and reasoning based on affective textual interaction C Ma, H Prendinger, M Ishizuka – Affective computing and intelligent …, 2005 – Springer … In order to improve textual methods such as e-mail, online chat systems and dialog system, some recent systems are based on like-like embodied agents as a new multi-modal … 2.3.1 Sentence Splitting In this step, multiple-sentence text is spited into single sentences. … Cited by 37 Related articles All 9 versions Cite Save
Data-driven strategies for an automated dialogue system H Hardy, T Strzalkowski, M Wu, C Ursu… – Proceedings of the …, 2004 – dl.acm.org … Galaxy is a distributed, message-based, hub-and-spoke infrastructure, optimized for spoken dialogue systems. … ANNIE includes customizable components necessary to complete the IE task – tokenizer, gazetteer, sentence splitter, part of speech tagger and a named entity … Cited by 22 Related articles All 8 versions Cite Save
Natural language question generation using syntax and keywords S Kalady, A Elikkottil, R Das – … of QG2010: The Third Workshop on …, 2010 – oro.open.ac.uk … questions is an important research area potentially useful in intelligent tutoring systems, dialogue systems, educational technologies … Input Document Appositive Processing Complex Sentence Splitting Prepositional Phrase Removal Individual sentences Subject NP QG Subject … Cited by 16 Related articles All 9 versions Cite Save More
Optimizing sentence segmentation for spoken language translation. S Rao, IR Lane, T Schultz – INTERSPEECH, 2007 – cs.cmu.edu … These applications include among others Spoken Language Translation systems (SLT), speech summarization and dialog systems. … Sentence splitting has been used to improve Example-Based Machine Translation (EBMT) performance where longer sentences do not yield … Cited by 12 Related articles All 7 versions Cite Save
Language Resources and Tools for Swedish: A Survey. K Elenius, E Forsbom, B Megyesi – LREC, 2008 – stp.lingfil.uu.se … The Other category includes: dialog systems, multimodal systems, translation, text production, language aids, building lexica, computer assisted … The most common tools reported are part-of-speech taggers, tokenizers, morphological segmenters, and sentence splitters. … Cited by 9 Related articles All 18 versions Cite Save More
Computational approaches for emotion detection in text H Binali, C Wu, V Potdar – Digital Ecosystems and …, 2010 – ieeexplore.ieee.org … Web blog text Tokeniser Sentence splitter POS Tagger Emotion Intelligence PR … 42, pp. 419-427, 2004. [11] A. Haag, S. Goronz, P. Schaich, and J. Williams, “Emotion Recognition Using Bio-sensors: First Steps towards an Automatic System ” in Affective Dialogue Systems. vol. … Cited by 8 Related articles All 5 versions Cite Save
Some experiments on clustering similar sentences of texts in portuguese ERM Seno, MGV Nunes – Computational Processing of the Portuguese …, 2008 – Springer … as paraphrase generation [1], automatic summari- zation [4] [5] [6], ontology building [11], digital library systems [13], dialogue systems [15], etc. … SiSPI is composed by two main processing modules named Sentence Splitting and Sentence Clustering (Figure 1). The former splits … Cited by 11 Related articles All 8 versions Cite Save
The Amitiés system: Data-driven techniques for automated dialogue H Hardy, A Biermann, RB Inouye, A McKenzie… – Speech …, 2006 – Elsevier … Galaxy is a distributed, message-based, hub-and-spoke infrastructure, optimized for spoken dialogue systems. … ANNIE includes customizable components necessary to complete the IE task––tokenizer, gazetteer, sentence splitter, part of speech tagger and a named entity … Cited by 24 Related articles All 5 versions Cite Save
DSim, a Danish Parallel Corpus for Text Simplification. S Klerke, A Søgaard – LREC, 2012 – lrec.elra.info … simplification has been applied to facilitate parsing, translation and generation of text for question generation and answering, summarizing and dialog systems. … short sentences aligning only to a part of a source sentence and enables us to rediscover cases of sentence splitting. … Cited by 6 Related articles All 5 versions Cite Save More
Ranking automatically generated questions as a shared task M Heilman, NA Smith – The 2nd Workshop on Question Generation, 2009 – cs.cmu.edu … On the other hand, a QG task in the context of a dialogue system for flight reservations might focus too much on … Many existing NLP transformations may be exploited in this stage, in- cluding extractive summarization, sentence compression, sentence splitting, sen- tence fusion … Cited by 5 Related articles All 6 versions Cite Save More
Augmenting variation of system utterances using corpora in spoken dialogue systems R Higashinaka, R Prasad… – … Speech Recognition and …, 2005 – ieeexplore.ieee.org … We focus on restaurant recommendation utterances in MATCH, a multi-modal dialogue system providing entertainment information for New York … keyword detection, we employed GATE [17], which is a suite of linguis- tic processors including a tokenizer, a sentence splitter, a part … Cited by 2 Related articles All 9 versions Cite Save
Communication with WWW in Czech L Svoboda, L Popelínský – Kybernetika, 2004 – dml.cz … SRI Highlight Information Extraction system [17] goes further and employs natural language processing techniques like sentence splitting, tagging, name entity … The most famous is the multilingual dialog system [2, 19] that has been developed at the University of West Bohemia … Cited by 4 Related articles All 6 versions Cite Save
Simplifying text in medical literature E Ong, J Damay, G Lojico, K Lu… – Journal of Research in …, 2007 – xsite.dlsu.edu.ph … translation, text generation, knowledge acquisition, and human-computer dialogue systems. Text simplification is an area of NLP that involves applying grammar rules to transform … 4 ANNIE handles sentence splitting, tokenizing, and part-of-speech tagging the input document, … Cited by 7 Related articles All 3 versions Cite Save More
Modeling the dative alternation with automatically extracted features H Zhong, A Stent, M Swift – … approaches for spoken dialogue systems, 2006 – aaai.org … destination is mislabeled or ambiguous. Also, some incomplete sentences are included in our dialog data set because of sentence splitting errors. Modeling The Dative Alternation (Bresnan et al. 2005) demonstrated that pragmatic … Cited by 2 Related articles All 10 versions Cite Save
Mobile agents architecture in data presentation domain A Laukaitis, O Vasilecas – Advances in Information Systems Development, 2006 – Springer … space is defined by the collection of all variables that characterize the state of the dialogue system at a … implements various elements from natural language processing area: named entities recognition, co- reference resolution, tokenisation, sentence splitting, gazetteer lookup … Cited by 1 Related articles All 3 versions Cite Save
Towards Constructing a Chinese Information Extraction System to Support Innovations in Library Services Z Zhixiong, L Sa, W Zhengxin, L Ying – IFLA journal, 2007 – ifl.sagepub.com … It is a sub-component of the SmartWeb multi-modal dialog system. … ANNIE consists of the main processing resources for information extraction such as: tokenizer, sentence splitter, POS tagger, gazetteer, finite state transducer and orthomatcher (see Figure 1). … Cited by 1 Related articles All 13 versions Cite Save
Using CBR for Portuguese Question Generation D Diéguez, R Rodrigues, P Gomes – Proceedings of the 15th …, 2011 – epia2011.appia.pt … of Natural Language Processing (NLP) [8], becoming an important research area with potential for use in advanced learn- ing technologies, such as intelligent tutoring systems [17], dialogue systems [5], and … These tools are a POS tagger, a lemmatizer and a sentence splitter. … Cited by 1 Related articles Cite Save More
Syntactic Filtering and Content-based Retrieval of Twitter Sentences for the Generation of System Utterances in Dialogue Systems R Higashinaka, N Kobayashi, T Hirano, C Miyazaki… – Proc. IWSDS, 2014 – uni-ulm.de … This filter is intended to remove sentence fragments caused mainly by sentence splitting errors. … Such sentences need Proceedings of 5th International Workshop on Spoken Dialog Systems Napa, January 17-20, 2014 115 Page 4. R. Higashinaka et al. … Related articles Cite Save More
Analysing Input in Dialog Systems D Scerri, A Dingli – System – ijclnlp.org … or wide-domain conversational agents, which in contrast to such existing agents (mainly online chatterbots), make use of dialog system aspects to … This could be achieved by the use of a trivial sentence splitter, but a valid way to handle multiple sentences simultaneously needs … Related articles Cite Save More
System for Generating Questions Automatically From Given Punjabi Text S Garg, V Goyal – hnk.ffzg.hr … State-of-the-art spoken dialogue systems can be considered as a good example. … Mannem, Prasad et al. (2010) has developed Question Generation from Paragraphs by using Sentence splitter, Part of Speech (POS) tagger, Semantic Role Labeler, NER and other NLP tools. … Related articles Cite Save More
Demonstration of a prototype for a Conversational Companion for reminiscing about images Y Wilks, A Dingli, R Catizone, W Cheng – Proceedings of the ACL 2010 …, 2010 – dl.acm.org … The basic processes carried out by GATE are: tokenizing, sentence splitting, POS tagging, parsing and Named Entity Recog- nition. … dialogue evaluations often use measures like “stickiness” to determine how much a user will stay with or stick with a dialogue system and not … Related articles All 8 versions Cite Save
An Ontological Representation of EU Consular Law E Schweighofer – Proceedings of LOAIT 2010 IV Workshop on Legal …, 2010 – ceur-ws.org … System) tool is very helpful for a more detailed analysis: segmentation of documents (tokenizer), words, gazetteer, sentence splitter and semantic … Dialogue system: Such a system is intended to converse with a human in a coherent structure (Wikipedia: Dialogue Systems 2010 … Related articles All 8 versions Cite Save More
Optimization Tasks in the Conversion of Natural Language Texts into Function Calls P Barabás, L Kovács – Applied Information Science, Engineering and …, 2014 – Springer … Our goal is to define and implement a natural language framework using a frame- based dialog system that can be applied to … Hungarian toolkit for linguistic processing called “magyarlanc” [9] developed by the University of Szeged, which contains a sentence splitter, a tokenizer … Related articles All 2 versions Cite Save
Corpus and Method for Identifying Citations in Non-Academic Text Y He, A Meyers – lrec-conf.org … punctuation marks, existing sentence splitters developed mainly for news text do no perform reliably. Therefore, we conduct training and testing on paragraphs instead of sentences. … The predicted “citation” in EX4 is in fact a list of commands acceptable by a dialog system. … Related articles Cite Save More
Multimedia indexing through multi-source and multi-language information extraction: the MUMIS project H Saggion, H Cunningham, K Bontcheva… – Data & Knowledge …, 2004 – Elsevier … JAPE uses a compiler that translates grammar rules into Java objects that target the GATE API (and a regular expression library). JAPE grammars were also used to develop the rule-based sentence splitter used in the system. … Cited by 50 Related articles All 4 versions Cite Save
A prototype for a conversational companion for reminiscing about images Y Wilks, R Catizone, S Worgan, A Dingli… – Computer Speech & …, 2011 – Elsevier … preliminary studies and results for these, in particular a novel approach to enabling reinforcement learning for open dialogue systems through the … The basic processes carried out by GATE are: tokenizing, sentence splitting, POS tagging, parsing and Named Entity Recognition. … Cited by 9 Related articles All 10 versions Cite Save
Survey on Swedish language resources K Elenius, E Forsbom, B Megyesi – Speech, Music and Hearing, …, 2008 – stp.lingfil.uu.se … 18 31,6% Search and knowledge mining 17 29,8% Language learning 12 21,1% Speech technologies 10 17,5% Other, please specify: 11 19,3% Others specified were: – dialog systems – multimodal systems – translation Page 15. Elenius, Forsbom and Megyesi 7 … Cited by 5 Related articles All 9 versions Cite Save More
Simulating Cub Reporter Dialogues: The collection of naturalistic human-human dialogues for information access to text archives E Barker, R Higashinaka, F Mairesse… – Proceedings of the …, 2006 – researchgate.net … source or background texts to support the behaviours of the information provider; (5) system requirements for a dialogue system targeting the … Corpus We employed GATE (Cunningham et al., 2002), a suite of linguistic processors including a tokeniser, a sentence splitter, a part … Cited by 1 Related articles All 14 versions Cite Save More
Facilita: reading assistance for low-literacy readers WM Watanabe, AC Junior, VR Uzêda… – Proceedings of the 27th …, 2009 – dl.acm.org … These operations are: • Sentence splitting: consists in finding a point where to split the original sentence (such as conjunctions, relative clause limits, or apposition limits). Two new clauses are generated with their respective subject ad- justed accordingly. … Cited by 32 Related articles All 13 versions Cite Save
An Overview of the ILSP Unit Selection Text-to-Speech Synthesis System P Tsiakoulis, S Karabetsos, A Chalamandaris… – Artificial Intelligence: …, 2014 – Springer … 2.1 The NLP Frontend Module The NLP component is mainly responsible for the tokenization and sentence splitting, the analysis and the transformation of the input text into an intermediate symbolic format, appropriate to feed the DSP component. … Related articles All 3 versions Cite Save
Question generation via overgenerating transformations and ranking M Heilman, NA Smith – 2009 – DTIC Document … Many existing NLP transformations might be exploited in this stage, including extractive summarization, sen- tence compression, sentence splitting, sentence fu- sion, paraphrase, textual entailment, lexical se- mantics for word substitution. … Cited by 30 Related articles All 12 versions Cite Save
A very fast and efficient linear classification algorithm KI Diamantaras, I Michailidis… – Machine Learning for …, 2005 – ieeexplore.ieee.org … for creat- ing semantic representation of sentences like in the case of Information Extraction systems [13] and Human-Machine Dialogue systems, or simply … Engineering Platform [16] in order to: (a) split the two corpora into sentences and tokens (use of a Sentence Splitter and a … Cited by 6 Related articles All 3 versions Cite Save
Greek Named Entity Recognition using Support Vector Machines I Michailidis, K Diamantaras… – … Conference on Greek …, 2005 – icgl7.icte.uowm.gr … This kind of answers are very useful for creating semantic representations of sentences like in the case of Information Extraction systems [1][2] and Human-Machine Dialogue systems or simply for … split the two corpora into sentences and tokens5 (use of a Sentence Splitter and a … Cited by 3 Related articles Cite Save More
Dynamic QA Systems Using Knowledgebase S Hu – ru.nl … systems. 1.3. Interactive Dialogue System Another famous question-answering system is human-machine dialogue system. In 1950, AlanTuring put forward an idea to treat Q&A as the test toward machine intelligence. He assumed … Related articles Cite Save More
Parallel corpora, alignment technologies and further prospects in multilingual resources and technology infrastructure I Tufis, R Ion – … Conference on Speech and Dialogue Systems, 2007 – racai.ro … On the other hand, depending on the alignment granularity, required accuracy, and the purpose of the alignment, the input textual data might need pre-processing steps in all languages of the parallel corpus (eg sentence splitting, tokenization, POS-tagging and lemmatization … Cited by 9 Related articles All 5 versions Cite Save More
Artificial Conversational Companions A Requirements Analysis S Danilava, S Busemann… – … Conference on Agents …, 2012 – publications.uni.lu … Tokenizer, sentence splitter, POS tag- ger, parser and Named Entity Recogniser (NER) are involved in the process. … There is no unique measure for conversation qual- ity, and there is a large amount of work done on evaluating spoken dialogue systems eg, (Danieli and Gerbino … Cited by 5 Related articles All 3 versions Cite Save
Word Ordering with Phrase-Based Grammars M Tomalin, W Byrne – mi.eng.cam.ac.uk … The BLEU scores for bags of size 21 and higher are an artefact of our sentence splitting procedure. … These input representations are suitable for ap- plications such as dialog systems, where the sys- tem maintains the information needed to generate the input representation for … Related articles Cite Save More
Summarizing opinion-related Information for mobile devices G Di Fabbrizio, AJ Stent, R Gaizauskas – Mobile Speech and Advanced …, 2013 – Springer … Value 3 2 4 50 50 % Service 5 5 5 100 0 Overall 4 4 5 100 0 9 For this purpose we used a statistical sentence splitter trained on email data and using n-gram and word part-of-speech features. Page 9. 297 11 Summarizing Opinion-Related Information for Mobile Devices … Cited by 1 Related articles All 3 versions Cite Save
Text Processing DC Gibbon, Z Liu – Introduction to Video Search Engines, 2008 – Springer … named entity recognition, language modeling, story summariza- tion, topic segmentation, data mining, machine translation, speech recogni- tion, spoken dialogue system, etc. … 3. Sentence splitter, which is a cascade of finite-state transducers which segments text into sentences. … Related articles Cite Save
Evaluation of sentiment polarity prediction using a dimensional and a categorical approach I Muresan, A Stan, M Giurgiu… – Speech Technology and …, 2013 – ieeexplore.ieee.org … A lexical analyzer was employed to filter out stop words, highlight valence shifters and negations, while a sentence splitter divided the … Cai, “Modeling the Expressivity of Input Text Semantics for Chinese Text-to-Speech Synthesis in a Spoken Dialog System”, IEEE Transactions … Related articles All 4 versions Cite Save
Sentence-Based Sentiment Analysis for Expressive Text-to-Speech T Trilla, F Alías – Audio, Speech, and Language Processing, …, 2013 – ieeexplore.ieee.org … text Input (plain) XML text Input (tagged) Lexical Analyser Sentence splitter POS Tagger + Bounds Raw strings Tokens + POS + VAC EmoLib processing pipeline Fig. … 2) Sentence splitter: delimits the sentences through a bi- nary decision tree following [32]. … Cited by 3 Related articles All 3 versions Cite Save
[BOOK] Procedural Or Non-procedural: That is the Question KK Wu – 2007 – books.google.com … UNIX editing program, emacs, dating back to the 1970’s, or more famously, the psychotic HAL 9000 computer system in Stanley Kubrick’s 2001: A Space Odyssey; question answering researchers are likely to contend, quite legitimately, that these are dialogue systems and an … All 2 versions Cite Save
[BOOK] Multilingual speech processing T Schultz, K Kirchhoff – 2006 – books.google.com … 11 Multilingual Spoken Dialog Systems Helen Meng and Devon Li 11.1 Introduction . . . . . … Phrase alignment as sentence splitting. The concept of N-grams (a) in sequences (b) in trees. An alignment between an English phrase and its corresponding IF representation. … Cited by 122 Related articles All 7 versions Cite Save
A web corpus of Spanish automatically annotated with semantic roles M Pilar Valverde Ibdñez Universidade de Santiago de Compostela E Bick – A Mosaic of Corpus Linguistics: Selected Approaches, 2010 – books.google.com … questions like who, when, where or what happened, which is useful in systems that require the comprehen- sion of sentences, like dialogue systems, information retrieval … Other post- processing steps involved encoding normalisation (to ISO-latin-1) and sentence splitting. … Related articles Cite Save
Automatic generation of multiple choice questions using dependency-based semantic relations N Afzal, R Mitkov – Soft Computing, 2013 – Springer … documents are first pre-processed and then go through various sub-processes which include: text extrac- tion, sentence splitting, tagging and … question generation has the potential to be employed in various areas such as intelligent tutoring systems, dialogue systems (Walker et … Related articles Cite Save
Modelling space and time in narratives about restaurants ET Mueller – Literary and Linguistic Computing, 2007 – ALLC … GATE natural language processing architecture (Cunningham et al., 2002); we feed the text through the tokenizer, sentence splitter, part-of … They could be used to improve the effectiveness of dialogue systems, help systems, news-tracking services, question-answering systems … Cited by 36 Related articles All 8 versions Cite Save
Supporting domain experts to construct conceptual ontologies: A holistic approach R Denaux, C Dolbear, G Hart, V Dimitrova… – Web Semantics: Science, …, 2011 – Elsevier … The Rabbit Language Processor is implemented using GATE 4 and is inspired by the parsing in CLOnE [20]. During parsing, we use GATE to perform natural language processing tasks such as tokenizing, sentence splitting and part-of-speech tagging. … Cited by 23 Related articles All 17 versions Cite Save
Language Model Adaptation using Interlinked Semantic Data K Kilgour – 2009 – isl.anthropomatik.kit.edu … context free grammars (CFG). 26 Page 27. 2.1 Basics of Language Modeling CFG language models are successfully used in command interpreters and dialog systems where only simple sentences are encountered. They are not useful when working with continuous speech. … Related articles Cite Save More
Data Mining and Query Answer techniques applied to a bio-nutritional trials focused Expert System D Ria?o – … Research and Development: Proceedings of the …, 2012 – books.google.com … Once the previous setup is ?nished the following processes are run:(1) sentence splitter (split annotations),(2) tokenize (token annotations), eg, identify signi?cant words … For this, a dialog system has been built, in which a user is able to ask for some information or action in its … Related articles Cite Save
Determinación Automática de Roles Semánticos usando Preferencias de Selección sobre Corpus muy Grandes A GELBUKH, H CALVO – Computación y Sistemas – revistas.unam.mx … tokenizing, sentence splitting, tagging, and lemmatizing—and then apply the parsing algorithm 3.1 Preprocessing Tokenization and sentence splitting: The text is tokenized into words and punctuation marks and split into sentences. … Related articles Cite Save
Consolidation-Based Speech Translation and Evaluation Approach H Chiori, Z Bing, S Vogel, A Waibel… – IEICE transactions on …, 2009 – search.ieice.org Page 1. IEICE TRANS. INF. & SYST., VOL.E92–D, NO.3 MARCH 2009 477 PAPER Consolidation-Based Speech Translation and Evaluation Approach Chiori HORI †a) , Member, Bing ZHAO †† , Stephan VOGEL †† , Alex WAIBEL †† , … Cited by 1 Related articles All 5 versions Cite Save
EReXS: Event and Relations Extraction for SWHi P Olango – 2006 – odur.let.rug.nl … The sentence splitter is a cascade of finite-state transducers which segments the text into sentences. … Raciappa, and Melanie Siegel in their paper [3] entitled, ‘Ontology-based Information Extraction with SOBA’, as a sub-component of the SmartWeb multi-modal dialog system. … Related articles All 6 versions Cite Save More
The QALL-ME Architecture Design Issues and QA Framework G Neumann, C Spurk, B Sacaleanu – 2007 – qallme.fbk.eu Page 1. The QALL-ME Architecture Design Issues and QA Framework Authors: Günter Neumann, Christian Spurk, Bogdan Sacaleanu Affiliation: DFKI Keywords: QALL-ME architecture, QA framework, QA components Abstract … Related articles All 3 versions Cite Save More
Lexicons and grammars for named entity annotation in the National corpus of Polish A Savary, J Piskorski – Intelligent Information Systems, Siedlce, Poland, 2010 – nkjp.pl … based finite-state grammar parser and interpreter. The basic processing compo- nents include, ia, tokenizer, sentence splitter, morphological analyzer, gazetteer look-up component, etc. They can be flexibly combined into a … Cited by 8 Related articles All 7 versions Cite Save More
[BOOK] Text-to-speech synthesis P Taylor – 2009 – books.google.com … l Tokenisation 64 4.3.2 Tokenisation and punctuation 65 4.3.3 Tokenisation algorithms 66 4.3.4 Sentence splitting 67 4.4 Processing documents 68 4.4.1 Markup languages 68 4.4.2 Interpreting characters 70 4.5 TeXt-to-speech architectures 71 4.6 Discussion 75 4.6.1 Further … Cited by 161 Related articles All 13 versions Cite Save
A database design for complex linguistic data in collaborative web application CL Chan, HB Yan, SY Lee, J Webster… – … Technology and it’s …, 2012 – ieeexplore.ieee.org … Text in GATE format can be read and modified through a comprehensive Java library. The library provides functionalities area including but not limited to annotation, corpora visualization, tokenization, sentence splitting, alignment, etc. … Cited by 1 Related articles Cite Save
Indexing as an ontological support for legal reasoning E Schweighoferf – Technologies for Supporting Reasoning …, 2011 – books.google.com … Next steps will be a deep refinement ofthe ontology and the development of a dialogue system. … It is then up to legal practice to implement these “simplified syllo- gisms” in knowledge systems, dialog systems etc. 214 Page 239. … Cited by 1 Related articles All 3 versions Cite Save
Sammelwut und Wegwerfmanie. Zur Versprachlichung von, Laster’als Übertreibung im Deutschen und Schwedischen A Malmqvist – Korpuslinguistik deutsch: synchron-diachron- …, 2005 – books.google.com … 0-1, 1-0, 2-1, 1-2, 2-2) -Material may be inserted or deleted -In identifying sentences (sentence splitting), the ambiguity … Applications of HMMs: predicting phonetic sequences in speech recognition, speech acts in dialogue systems, and predicting parts of speech in parsing (NLP … Cited by 1 Related articles All 2 versions Cite Save
Linguistic structure prediction NA Smith – Synthesis Lectures on Human Language …, 2011 – morganclaypool.com … Semantic Role Labeling Martha Palmer, Daniel Gildea, and Nianwen Xue 2010 Spoken Dialogue Systems Kristiina Jokinen and Michael McTear 2009 Introduction to Chinese Natural Language Processing Kam-Fai Wong, Wenjie Li, Ruifeng Xu, and Zheng-sheng Zhang 2009 … Cited by 39 Related articles All 8 versions Cite Save
The Linguistics of Sentiment Analysis L Hart – 2013 – pdxscholar.library.pdx.edu … Beyond wordlevel tokenization, there is also the process of segmenting a document by its sentences, known as sentence splitting. This can typically be done by looking for punctuation such as a period, exclamation mark, or question mark. … Related articles Cite Save
Tense, Aspect And Mood Based Event Extraction For Situation Analysis And Crisis Management AL? HÜRR?YETO?LU – 2012 – etd.lib.metu.edu.tr … As a whole, this system is used in various natural language applications (question answering, dialog systems, database interface systems, etc.). The TimeML specification language mainly deals with event and temporal expressions in natural language texts. … Related articles All 2 versions Cite Save More
Methods for lean, precision-oriented, and targeted coreference resolution D Lindmark – 2012 – stp.lingfil.uu.se … such applications may include systems for Information Extraction (IE), Passage Retrieval, Question Answering (QA), Automatic Sum- marization and Abstracting, Machine Translation (MT) and Dialogue Systems. … 3. For each document, tokenization and sentence splitting is done. … Related articles Cite Save More
Synthetic Partner F Meijerink, SA Nijholt, B van Dijk, D Heylen – 2008 – hmi.ewi.utwente.nl Page 1. Page 2. Synthetic Partner The Design of a Relational Affective Diary Thesis Date April 11, 2008 Author Ferdi Meijerink <ferdi@ferdi.eu> Supervisors Anton Nijholt Betsy van Dijk Dirk Heylen External Supervisor Peter-Paul van Maanen Project Manager Tony van Vliet No. … Related articles All 3 versions Cite Save More
Automatic factual question generation from text M Heilman – 2011 – lti.cs.cmu.edu … tions deal with the analysis of student responses rather than the generation of instructional content. For example, tutorial dialogue systems (Litman and Silliman, 2004; Graesser et al., 2005; Boyer et … generation and dialogue systems. … Cited by 14 Related articles All 10 versions Cite Save More
Advanced Techniques for Personalized, Interactive Question Answering S Quarteroni – 2007 – pisa.cs.york.ac.uk … Page 18. 17 Question Answering in the late 1970s and until the end of the 1980s was tightly linked to human-computer dialogue systems, such as expert systems drawing information from structured knowledge bases. Indeed … Cited by 6 Related articles All 10 versions Cite Save More
Automatic Generation of Factual Questions from Video Documentaries Y Skalban – 2013 – wlv.openrepository.com … 67 4.4.2 Sentence splitting ….. 68 … Question Generation is an important component in dialogue systems, virtual environments, and learning technologies such as Intelligent Tutoring Systems, inquiry-based environments, … Related articles All 4 versions Cite Save
Document clustering on target entities using persons and organizations KEIJ MING, J RAPHAEL – 2004 – scholarbank.nus.sg Page 1. DOCUMENT CLUSTERING ON TARGET ENTITIES USING PERSONS AND ORGANIZATIONS JEREMY R. KEI National University of Singapore 2003 Page 2. DOCUMENT CLUSTERING ON TARGET ENTITIES USING PERSONS AND ORGANIZATIONS BY … Related articles All 3 versions Cite Save
Compositional question answering with collaborative strategies HJ Oh – 2008 – library.kaist.ac.kr … ELIZA (1966) by Weizenbaum et al [15] is the earliest dialogue system using QA. … answers. As same with question analysis, document analysis involves various linguistic processes such as sentence splitting, POS tagging, and chunk parsing. … Cited by 3 Related articles All 2 versions Cite Save
Domain-And Language-Adaptive Natural Language Controlling Framework P Barabás – 2013 – iit.uni-miskolc.hu … ix OWL Web Ontology Langauge POI Point Of Interests POS Part-Of-Speech QLF Quasi-Logical Form RDF Resource Description Framework SDK Software Development Kit SDS Speech Dialog System SNLP Stanford NLP SNLPG Stanford Natutal Language Processing Group … Related articles All 4 versions Cite Save More
[BOOK] Intuitive ontology authoring using controlled natural language R Denaux – 2013 – etheses.whiterose.ac.uk … 148 5.2 Dialogue Systems Overview . . . . . 151 … inputs in order to provide understandable feedback to domain experts? How can dialogue systems be used to formalise and improve ontology au- thoring interactions for better support of domain experts? … Related articles All 3 versions Cite Save
Open-domain question answering MA Greenwood – 2005 – nlp.shef.ac.uk … If we sidestep the issue of determining if a question is being asked by assuming that all input to a QA system is in fact a question (the problems of a wider dialogue are beyond the scope of this study although the history of dialogue systems, as they pertain to question answering … Cited by 10 Related articles All 7 versions Cite Save More
A Supervised Machine Learning Method for Word Sense Disambiguation of Portuguese Nouns M Zampieri – Bulletin de Linguistique Aplique et Gnrale-BULAG, 2010 – string.l2f.inesc-id.pt … more complex task for NLP systems: as stated by Leech and Weisser (2000), and their need is usually restricted to spoken dialogue systems (SDS). As the scope of this work is restricted to lexical meaning, it is important to point … Cited by 1 Related articles All 4 versions Cite Save More
Resources and courses M O’Donnell – Continuum companion to systemic functional …, 2009 – muele.mak.ac.ug Page 225. 12 Resources and courses Mick O’Donnell Universidad Autónoma de Madrid This chapter provides details of resources for those practicing Systemic Func- tional Linguistics (SFL), and where to get more information … Cited by 1 Related articles All 3 versions Cite Save More
Computational-Linguistic Approaches to Biological Text Mining AB Clegg – 2008 – biotext.org.uk … timeframe. The subfields of speech and gesture processing can be ignored wholesale, as can the large bodies of work dealing with multi-agent dialogue systems, emotional tone, colloquial and ungrammatical utterances, and so on. … Cited by 10 Related articles All 3 versions Cite Save More
Computational Humour JW Tinholt – 2007 – hmi.ewi.utwente.nl Page 1. Computational Humour Utilizing cross-reference ambiguity for conversational jokes JW Tinholt Faculty of Electrical Engineering, Mathematics and Computer Science Department of Human Media Interaction University of Twente … Related articles All 2 versions Cite Save More
Linguistics parameters for zero anaphora resolution SC Pereira – 2010 – sapientia.ualg.pt … Language Processing applications such as information extraction, automatic abstracting, dialog systems, machine translation and question answering. The main goal of this … applications such as information extraction, automatic abstracting, dialogue system, … Cited by 1 Related articles All 4 versions Cite Save
Anaphora Resolution in Portuguese An hybrid approach JS Marques – 2013 – inesc-id.pt … real-world NLP applications. Proper treatment of anaphoric relations shapes the performance of today’s applica- tions such as information extraction, machine translation, text summarization, or dialogue systems. Among many … Related articles All 2 versions Cite Save More
[BOOK] Open-source resources and standards for Arabic word structure analysis: Fine grained morphological analysis of Arabic text corpora MSS Sawalha – 2011 – etheses.whiterose.ac.uk Page 1. Open-source Resources and Standards for Arabic Word Structure Analysis: Fine Grained Morphological Analysis of Arabic Text Corpora By Majdi Shaker Salem Sawalha Submitted in accordance with the requirements for the degree of Doctor of Philosophy … Cited by 4 Related articles All 5 versions Cite Save
Toward Portable Information Extraction MV Tablan – 2009 – valentin.tablan.net … 139 A.6.1 The Unicode Tokeniser . . . . . 140 A.6.2 The Gazetteer Look-up Component . . . . . 143 A.6.3 The Sentence Splitter . . . . . 144 A.6.4 The Part of Speech Tagger . . . . . 144 A.6.5 The Named Entity Recogniser . . . . . … Cited by 1 Related articles All 3 versions Cite Save More
Discourse processing M Stede – Synthesis Lectures on Human Language …, 2011 – morganclaypool.com … Spoken Dialogue Systems Kristiina Jokinen and Michael McTear 2009 … Notice that the standard document pre-processing tasks of sentence splitting and tokenization can sensibly be integrated into the determination of logical document structure, so that the leaves of the tree … Cited by 22 Related articles All 5 versions Cite Save
Functional Specification Of Learner And Context Modeling Services D Dagger, V Nitti – wis.ewi.tudelft.nl Page 1. DELIVERABLE D4.1: FUNCTIONAL SPECIFICATION OF LEARNER AND CONTEXT MODELING SERVICES Project Acronym: ImREAL Project Name: Immersive Reflective Experience-based Adaptive Learning Call: FP7-ICT-2009.4.2: Technology-Enhanced Learning … Related articles Cite Save More
Combining Natural Language Processing and Statistical Text Mining: A Study of Specialized Versus Common Languages J Jarman – 2011 – scholarcommons.usf.edu Page 1. University of South Florida Scholar Commons Graduate School Theses and Dissertations Graduate School 1-1-2011 Combining Natural Language Processing and Statistical Text Mining: A Study of Specialized Versus Common Languages … All 3 versions Cite Save
EXCITEMENT open platform: Architecture and Interfaces TG Noh, S Pado, A Stern, O Bronstein, R Wang… – 2013 – hltfbk.github.io … Core. Linguistic Analysis Pipeline (LAP) A set of linguistic tools that analyze a given set of Text-Hypothesis pairs, typically performing steps such as sentence splitting, POS tagging, Named Entity Recognition etc. Entailment … Related articles All 2 versions Cite Save More
On Applying Controlled Natural Languages for Ontology Authoring and Semantic Annotation BP Davis – 2013 – vmserver14.nuigalway.ie Page 1. On Applying Controlled Natural Languages for Ontology Authoring and Semantic Annotation Brian Patrick Davis Submitted in fulfillment of the requirements for the degree of Doctor of Philosophy PRIMARY SUPERVISOR: Prof. … Related articles All 2 versions Cite Save
Otros portales de tesis N Català Roig, N Castell Ariño – tdx.cat … The complete process of construction makes an extensive use of linguistic knowledge as well as specific abilities implemented mainly by NLP tools. Some of these tools are of general-purpose in the NL field such as sentence splitters, syntactic analyzers or semantic taggers. … Related articles Cite Save
Specification of knowledge acquisition and modeling of the process of the consensus A Zhdanova, M Bonifacio, S Dasiopoulou, J Euzenat… – 2004 – hal.archives-ouvertes.fr Page 1. Copyright © 2004 The contributors D 2.3.2 Specification of Knowledge Acquisition and Modeling of the Process of the Consensus Coordinator: Anna V. Zhdanova (UIBK) Matteo Bonifacio (UniTn), Stamatia Dasiopoulou … Related articles All 6 versions Cite Save More
Network-based distributional semantic models E Iosif – 2013 – telecom.tuc.gr … I gratefully acknowledge the financial support I received from the PortDial project (“Language Resources for Portable Multilingual Spoken Dialog Systems”) supported by the EU Seventh Framework Programme (FP7), grant number 296170, and and the Basic Research … Related articles All 4 versions Cite Save More
Automatic evaluation of users’ short essays by using statistical and shallow natural language processing techniques DRP Mar?n – 2004 – eps.uam.es Page 1. Automatic evaluation of users’ short essays by using statistical and shallow natural language processing techniques Advanced Studies Diploma Work Diana Pérez Mar´?n under the supervision of Enrique Alfonseca and Pilar Rodr?guez May, 2004 Page 2. Contents … Cited by 3 Related articles All 7 versions Cite Save More