Notes:
The Gutenberg Corpus is a collection of texts compiled by Project Gutenberg, an organization that aims to digitize and make available public domain books. The corpus contains over 53,000 books in English, German, French, Spanish, and other languages. It is commonly used in natural language generation research as a source of training data for language models.
Natural language generation is the process of automatically creating written or spoken language from structured data. By training a language model on a large corpus of text such as the Gutenberg Corpus, researchers can develop algorithms that can generate text in a variety of styles and languages. This can be used for a variety of applications, such as generating summaries of large amounts of text, creating personalized responses in chatbots, or even writing entire books.
Translationese is a term used to describe the phenomenon of translated texts often sounding unnatural or awkward in the target language. This can happen for a variety of reasons, such as when a translator tries to translate idioms or syntax from the source language too literally, or when a translator is not fluent in the target language and doesn’t fully understand the nuances and idioms of the language. Translationese can also occur when a translator is not familiar with the cultural context of the source language and does not know how to accurately convey the meaning of the text in the target language.
- Computational storytelling refers to the use of computational tools and techniques to create interactive, narrative-based experiences. This can include the use of natural language processing and machine learning algorithms to generate text or dialogue, or the use of interactive media to create immersive, branching storylines.
- Generative literature is a type of literature that is created using computational tools and techniques, such as natural language generation algorithms or interactive media. Generative literature can take many forms, including stories, poems, and other written works, and can be interactive or non-interactive.
- Generative text refers to text that is generated by a computer program, rather than being written by a human. Generative text can be created using natural language processing algorithms or other computational techniques, and can be used in a variety of contexts, such as in natural language generation systems or as part of generative literature. Generative texts are texts that are generated by a computer program, rather than being written by a human. Generative texts can take many forms, including stories, poems, and other written works, and can be created using natural language processing algorithms or other computational techniques. Generative texts can be interactive or non-interactive, and can be used in a variety of contexts, such as in natural language generation systems or as part of generative literature.
- Neural language generation is a type of natural language generation that uses neural networks to generate language. Neural language generation systems typically use large, pre-trained language models that have been trained on large datasets of human-generated language, and can generate language that is similar in style and content to human-generated language. These systems are often used to generate responses to questions, summarize texts, or produce other language-based outputs.
- Procedural generation refers to the use of algorithms to generate content, such as game levels, textures, or other types of media, in a automated or semi-automated way. These algorithms can be used to create a wide range of content, including terrain in video games, levels in puzzle games, or music and sound effects. Procedural generation can be used to create a virtually limitless amount of content, and can help save time and resources when creating games or other interactive media.
Resources:
- cwb.sourceforge.net .. open-source tools for managing and querying large text corpora
- gitenberg.org .. open source community curating ebooks with detailed metadata in a variety of formats
- gutenberg.org .. project gutenberg
- gutentag.sdsu.edu .. nlp-driven tool for digital humanities research in the project gutenberg corpus
- liwc.wpengine.com .. linguistic inquiry and word count
- receptiviti.ai .. enabling ai platforms with emotional intelligence
Wikipedia:
References:
- Aesthetics and Design for Game-based Learning (2015)
- Robots that Talk and Listen: Technology and Social Impact (2015)
See also:
Corpus Workbench | Generative Literature & Natural Language Processing | Generative Text & Natural Language Processing | Procedural Generation & Natural Language Processing | RiTa Toolkit
Data-driven alibi story telling for social believability
B Li, M Thakkar, Y Wang, MO Riedl – Social Believability in Games, 2014 – Citeseer
… N-Gram corpus [16] and books from Project Gutenburg (http://www.gutenberg.org), we offer … the selection of a subset of events from an alibi story— and natural language generation … corpora of social situations, the Google N-gram corpus, and Project Gutenberg corpus are all …
Dead Man Tweeting
D Nilsson, M Sahlgren, J Karlgren – Workshop on Collecting and …, 2016 – diva-portal.org
… there are substantial amounts of data freely available on sites such as Project Gutenberg.7 For … Most previ- ous studies and applications of natural language generation has focused on closed domains, such as weather … Building a large-scale commer- cial NLG system for an EMR …
A Rhetorical Analysis Approach to Natural Language Processing
B Englard – arXiv preprint arXiv:1301.3547, 2013 – arxiv.org
… previous estimate. 6.4 Calculating Entropy When the original tests of the Natural Language Generation system were completed, it was … 8.2 Author Identification Problem Results and Discussion Using NLTK’s access to the works in the Project Gutenberg [14] corpus, the …
EloquentRobot: A Tool for Automatic Poetry Generation
JD McGovern, G Scott – gavinoscott.com
… CCS Concepts •Computing methodologies ? Information extrac- tion; Natural language generation; Lexical semantics; Language resources; … Similarly, Project Gutenberg has a poetry bookshelf2, but it had the same issues as Bartleby …
An analysis of game environments as measured by fractal complexity
D Della-Bosca, D Patterson, S Roberts – Proceedings of the Australasian …, 2017 – dl.acm.org
… The Myst series was crucial to this study as the series represented a significant milestone in games design, procedural generation and general advancement and experimentation with immersive principles … http://www.gutenberg.org/files/6081/6081- h/6081-h.htm#link2H_LIST …
On the art of writing with data
C Rodley, A Burrell – The future of writing, 2014 – Springer
… not static like Borges’s text; it is an infinitely scroll- ing list dynamically generated by searching Twitter and/or the Project Gutenberg literature library … not recognise this type of writing as a category,7 though she notes that the interrelated category of generative text – creating new …
Summarising news stories for children
I Macdonald, A Siddharthan – … International Natural Language Generation …, 2016 – aclweb.org
Page 1. Proceedings of The 9th International Natural Language Generation conference, pages 1–10, Edinburgh, UK, September 5-8 2016. cO2016 Association for Computational Linguistics Summarising News Stories for Children …
FIGURE8: A Novel System for Generating and Evaluating Figurative Language.
S Harmon – ICCC, 2015 – computationalcreativity.net
… Within the context of creative natural language generation, a third criterion has been noted: a creative system must gener- ate context-appropriate knowledge outside of its pre-existing knowledge base (Pérez y Pérez … Free ebooks – Project Gutenberg … http://www.gutenberg.org …
Zero-Shot Style Transfer in Text Using Recurrent Neural Networks
K Carlson, A Riddell, D Rockmore – arXiv preprint arXiv:1711.04731, 2017 – arxiv.org
… The problem of stylistic paraphrasing is clearly relevant for the creation of natural language generation systems … Names and rare words are difficult to handle in NLG … subword units from all of our Bible versions, a random sampling of modern works from gutenberg.org, and articles …
The case for being average: A mediocrity approach to style masking and author obfuscation
G Karadzhov, T Mihaylova, Y Kiprov… – … Conference of the Cross …, 2017 – Springer
… Finally, the values in the average column are calculated on the training dataset and on some texts from Project Gutenberg; these are the target values we want to … Kabbara, J., Cheung, JCK: Stylistic transfer in natural language generation systems using recurrent neural networks …
Text generation with Language models
AK Behera – 2016 – pdfs.semanticscholar.org
… Language generation from any machine represented model for any specific domain is known as Natural language generation. Random text generation is a branch of language … project Gutenberg. These texts are literary novels such as Emma by Jane Austen and The …
The Case for Being Average: A Mediocrity Approach to Style Masking and Author Obfuscation
G Georgiev, I Koychev, P Nakov – Experimental IR Meets …, 2017 – books.google.com
… Finally, the values in the average column are calculated on the training dataset and on some texts from Project Gutenberg; these are … doi: 10.1007/978-3-642-24212-09 Kabbara, J., Cheung, JCK: Stylistic transfer in natural language generation sys- tems using recurrent neural …
Linked open data driven game generation
R Warren, E Champion – International Semantic Web Conference, 2014 – Springer
… While the procedural generation of game terrain through the detailed simulation of an ecosystem, as described by Dussel et al … The same can be done with books whose contents and covering images are available from LOD data-sets such as Project Gutenberg or Archive.org …
Adversarial Objectives for Text Generation
A Vani – nevitus.com
… [2015]). Although computer vision has seen a large fraction of related research in adversarial networks, adversarial objectives for natural language generation is less explored … Project Gutenberg. https://www.gutenberg.org/. A. Radford, L. Metz, and S. Chintala …
Neural Text Generation: A Practical Guide
Z Xie – arXiv preprint arXiv:1711.09534, 2017 – arxiv.org
… Consequently, while we focus on natural language, to be precise, this guide does not cover natural language generation (NLG), which entails generating documents or longer descriptions from structured data. The primary …
A theme-rewriting approach for generating algebra word problems
R Koncel-Kedziorski, I Konstas, L Zettlemoyer… – arXiv preprint arXiv …, 2016 – arxiv.org
… Recall that since our system needs no an- notated thematic training data, we can easily gen- erate from any theme where thematic text is avail- able. To demonstrate this fact, we include gener- ated examples in a Western theme from novels from the Project Gutenberg corpus …
Concept generalization and fusion for abstractive sentence generation
R Belkebir, A Guessoum – Expert Systems with Applications, 2016 – Elsevier
… Some of these are based on natural language generation … We find in the literature works that have tried to address it using operations like sentence compression, text simplification, text revision and natural language generation …
Improving the Reproducibility of PAN’s Shared Tasks
M Potthast, T Gollub, F Rangel, P Rosso… – … Conference of the Cross …, 2014 – Springer
Page 1. Improving the Reproducibility of PAN’s Shared Tasks: Plagiarism Detection, Author Identification, and Author Profiling Martin Potthast 1 , Tim Gollub 1 , Francisco Rangel 2,3 , Paolo Rosso 3 , Efstathios Stamatatos 4 , and Benno Stein 1 …
Speech and language processing
D Jurafsky, JH Martin – 2014 – cs.colorado.edu
… research. This period also saw considerable work on natural language generation. 1.6.5 The Field Comes Together: 1994–1999 By the last five years of the millennium it was clear that the field was undergoing ma- jor changes …
Automatic generation of english reference question by utilising nonrestrictive relative clause
AY Satria, T Tokunaga – Proceedings of the 9th …, 2017 – pdfs.semanticscholar.org
… Consider the follow- ing example. 4https://www.gutenberg.org … As a source of texts, we used the texts from the Project Gutenberg … In Proceedings of the Fourth Inter- national Natural Language Generation Conference, pages 111–113. Association for Computational Lin- guistics …
Summarising News Stories for Children
A Siddharthan, I MacDonald – … on Natural Language Generation …, 2016 – aura.abdn.ac.uk
Page 1. Summarising News Stories for Children Iain Macdonald Computing Science University of Aberdeen Scotland, UK iain.j.macdonald.12@aberdeen.ac.uk Advaith Siddharthan Computing Science University of Aberdeen Scotland, UK advaith@abdn.ac.uk Abstract …
Towards Data Submissions for Shared Tasks: First Experiences for the Task of Text Alignment.
M Potthast, S Göring, P Rosso, B Stein – CLEF (Working Notes), 2015 – Citeseer
… it has been frequently pointed out in the position papers submitted to a workshop organized by the natural language generation community on the … For the PAN plagiarism corpora 2009-2012, we employed documents obtained from the Project Gutenberg for the most part [50 …
Merits of Hierarchical Story and Discourse Planning with Merged Languages
DR Winer, RM Young – 2017 – liquidnarrative.cs.utah.edu
… the predominant strategies for designing auto- mated storytelling systems is to adhere to the natural lan- guage generation (NLG) pipeline: start … examples and is used to generate text with a storytelling style by extracting style parameters from the Project Gutenberg book corpus …
Improvisational computational storytelling in open worlds
LJ Martin, B Harrison, MO Riedl – International Conference on Interactive …, 2016 – Springer
… Natural language generation is not yet a solved problem, let alone generating creative sentences … The likelihood of a sentence can be computed by constructing a language model over a large corpus such as Wikipedia or Google’s Project Gutenberg that estimates the how likely …
suppose and find
F D’hoedt, H Cuyckens – … at the Syntax-Semantics Interface: A …, 2017 – books.google.com
… While the generative literature on verb complementation has largely focused on synchronic, syntactic issues, such as the constituent structure of dif- ferent … 0), a collection of texts drawn from the Project Gutenberg and the Oxford Text Archive and spanning the period from 1710 …
Digital Research
B Nicholson – Research Methods for History, 2016 – books.google.com
… Books Millions of out-of-copyright books have been digitised and made freely avail- able online. The largest collections of historical publications are currently held by Google Books, the Internet Archive, the HathiTrust, the European Library/Europeana and Project Gutenberg …
D4. 1: Natural Multimodal Interaction Baseline
B Kiefer, HU Krieger, I Kruijff-Korbayová, S Raccioppa – pal4u.eu
… The existing linguistic resources need adaption to better support natural language generation and interpretation in the PAL context … We collected recordings from LibriVox https://librivox.org/ and the corresponding texts from project Gutenberg https://www.gutenberg … org …
SL-Bots: Automated and Autonomous
JO Turner, M Nixon, J Bizzocchi – New Opportunities for Artistic …, 2015 – books.google.com
Page 289. 263 Chapter 12 SL-Bots: Automated and Autonomous Performance Art in Second Life Jeremy Owen Turner Simon Fraser University, Canada Michael Nixon Simon Fraser University, Canada Jim Bizzocchi Simon …
Phrase Detectives
M Poesio, J Chamberlain, U Kruschwitz – Handbook of Linguistic …, 2017 – Springer
… narrative text from Project Gutenberg 23 including in particular a number of tales (eg, Aesop’s Fables, Grimm’s Fairy Tales, Beatrix Potter’s tales), and more advanced narratives such as several Sherlock Holmes short stories by A. Conan-Doyle, Alice in Wonderland, and several …
A study of how people infer social relationships from people’s behavior in simple economic games
N Blank, C Kemp, A Jern – ceur-ws.org
Page 1. Short Papers Short papers presetted as posters at the 2017 Modert AI atd Cogtitive Scietce Cotderetce. Short Papers MAICS 2017 186 Page 2. A study of how people infer social relationships from people’s behavior in simple economic games …
Double Topic Shifts in Open Domain Conversations: Natural Language Interface for a Wikipedia-based Robot Application
K Jokinen, G Wilcock – Proceedings of the Open Knowledge Base and …, 2016 – aclweb.org
… The frequency calculations currently use text corpora from Project Gutenberg, but we also plan to use larger corpora such as British … In Paris, CL, Swartout, WR and Moore, WC (Eds.) Natural Language Generation in Artificial Intelligence and Computational Lin- guistics, pp 103 …
Big data driven natural language processing research and applications
VN Gudivada, D Rao, VV Raghavan – Handbook of Statistics, 2015 – Elsevier
Skip to main content …
Using games to create language resources: Successes and limitations of the approach
J Chamberlain, K Fort, U Kruschwitz… – The People’s Web Meets …, 2013 – Springer
… from the ‘Featured Articles’ page 12 and the page of ‘Unusual Articles’ 13 ; and narrative text from Project Gutenberg 14 including … in support of the GIVE-2 challenge for generating instructions in virtual environments, initiated in the Natural Language Generation community [40] …
Transmedial Ekphrasis. From Analogic to Digital Formats
ALV Azcárate – International Journal of Transmedia Literacy (IJTL), 2015 – ledonline.it
Page 1. International Journal of Transmedia Literacy – 1.1 – December 2015 http://www.ledonline. it/transmedialiteracy/ 45 Transmedial Ekphrasis. From Analogic to Digital Formats Asunción López-Varela Azcárate Universidad Complutense de Madrid, Spain …
Is there a formula for formulaic language?
RS Forsyth, ? Grabowski – Poznan Studies in Contemporary …, 2015 – degruyter.com
Page 1. Pozna? Studies in Contemporary Linguistics 51(4), 2015, pp. 511–549 © Faculty of English, Adam Mickiewicz University, Pozna?, Poland doi: 10.1515/psicl-2015-0019 IS THERE A FORMULA FOR FORMULAIC LANGUAGE …
Opening up Utopia
S Duncombe – DIY Utopia: Cultural Imagination and the …, 2016 – books.google.com
… to be of great continuance.” 18 Indeed it will continue, for the very plan and structure of More’s Utopia makes it a generative text, one that … Although this translation can sound a bit stilted to modern ears, it is the one used in the copyright-free Project Gutenberg text, and thus, out of …
Artificial Intelligence’s Fair Use Crisis
BLW Sobel – 2017 – papers.ssrn.com
… 20 2. Google’s Natural Language Generation … 2,000-year-old criticism of written language as inferior to memorized language, see PLATO, PHAEDRUS (Benjamin Jowett trans., 1999), available at https://www.gutenberg.org/ebooks/1636 …
Generating natural questions about an image
N Mostafazadeh, I Misra, J Devlin, M Mitchell… – arXiv preprint arXiv …, 2016 – arxiv.org
… all ‘event’ and ‘process’ hyponyms in WordNet (Miller, 1995), 1,000 most frequent TimeBank events (Puste- jovsky et al., 2003) and a set of manually curated 30 stereotypical events, from which we selected the top 1,200 queries based on Project Gutenberg word frequencies …
Tree recurrent neural networks with application to language modeling
X Zhang, L Lu, M Lapata – CoRR, abs/1511.00060, 2015 – pdfs.semanticscholar.org
… Previous work has shown that it is relatively straightforward to jointly train neural language models with other types of neural networks to per- form tasks involving natural language generation such as translating sentences and generating de- scriptions for images (Kalchbrenner …
Generative Theatre of Totality
PA da Veiga – Journal of Science and Technology of the Arts, 2017 – artes.ucp.pt
… Balpe, J. (2005). Principles and Processes of Generative Literature – Questions to Literature. Retrieved November 07, 2017 from http://www.dichtung-digital.de/2005/ 1/Balpe/ Carter, A. & O’Shea, J. (2010). Performing Dance …
Visions and open challenges for a knowledge-based culturomics
N Tahmasebi, L Borin, G Capannini… – International Journal on …, 2015 – Springer
… Using the collections of projects like Project Gutenberg [67] or Google books [25], we can directly access the historical source rather than read modern interpretations. Access is offered online and often minimal effort is necessary for searching and browsing …
Towards a narrative based user-centered design of an immersive news environment
EO den Buysch, F Nack, B Brouwers – researchgate.net
Page 1. Towards a narrative based user-centered design of an immersive news environment Eefje Op den Buysch Student number 10642552 Final version: 01.07.2016 Dr. F. Nack (supervisor) Prof. drs. B. Brouwers (second reader) …
Intrasentential Grammatical Correction with Weighted Finite State Transducers
J Goth III – 2013 – search.proquest.com
… To construct our language model, we use a corpus of childrens stories from Project Gutenberg. For the noise model, a corpus consisting of passages composed by middle school students obtained from corpus acquisition experiments is utilized … Gutenberg Corpus. N-gram ARPA …
Evaluation of Automatically Generated Pronoun Reference Questions
AY Satria, T Tokunaga – Proceedings of the 12th Workshop on …, 2017 – aclweb.org
… In this par- ticular example, the Centering theory tells us that “legend” in the second sentence of (2) has a higher status than “knowledge” because the former is a 1https://www.gutenberg. org/ subject and the latter is an element in the prepo- sitional phrase …
Designing user interfaces for curation technologies
G Rehm, J He, J Moreno-Schneider, J Nehring… – … Conference on Human …, 2017 – Springer
… We could, in addition, include sources from Project Gutenberg, Archive.org or other structured knowledge bases … In: Gangemi, A., Gardent, C. (eds.) Proceedings of the 2nd International Workshop on Natural Language Generation and the Semantic Web (WebNLG 2016), pp …
The People’s Web Meets NLP
I Gurevych, J Kim – 2013 – Springer
… applications of NLP techniques to gain insights into the use and functioning of language, as well as the use of language technology in applications that enable communication, knowledge management and discovery such as natural language generation, information retrieval …
Domain adaptive extraction of topical hierarchies for Expertise Mining
G Bordea – 2013 – aran.library.nuigalway.ie
Page 1. Provided by the author(s) and NUI Galway in accordance with publisher policies. Please cite the published version when available. Downloaded 2017-12-05T06:05:00Z Some rights reserved. For more information, please see the item record link above …
Correcting Writing Errors with Convolutional Neural Networks
NA Dronen – 2016 – search.proquest.com
… 1036.9 Project Gutenberg … 104. 6.8 The models performance on contrasting cases derived from books available from Project Gutenberg. The drop in performance compared to the in-domain Wikipedia test set is shownin parentheses …
Implications of punctuation mark normalization on text retrieval
E Kim – 2013 – search.proquest.com
… Dale (1992) identified discourse structure primarily in the context of natural language generation … He reported some of the notable characteristics of punctuation marks based on a number of corpora including Philosophy, Project Gutenberg, and Usenet …
FORMULIB: Formulaic Language Software Library
I Why – richardsandesforsyth.net
Page 1. Page 1 of 35 FORMULIB : Formulaic Language Software Library (User Notes by Richard Forsyth, June 2016) This software helps to explore the somewhat nebulous concept of “formulaic language” as well as identifying …
Word Associations as a Language Model for Generative and Creative Tasks
O Gross – 2016 – helda.helsinki.fi
Department of Computer Science Series of Publications A Report A-2016-2 Word Associations as a Language Model for Generative and Creative Tasks Oskar Gross To be presented, with the permission of the Faculty of Science of the University of Helsinki, for public criticism in …
Virgil Aeneid X: A Selection
C Tanfield – 2016 – books.google.com
Page 1. Introduction. – – nºtes and Vºcabulary by Christopher Ae M & d X Tam?ield A SELECTION BL () ( ) MSEU R Y. Page 2. Virgil Aeneid X i Page 3. The following titles are available from Bloomsbury for the OCR specifications …
Spreading patterns: Diffusional change in the English system of complementation
H De Smet – 2013 – books.google.com
Page 1. JWF RDSTJDHESTMTHEH IS TLRWDFEMG | | SH SPREAD | NG PAT TERNS Diffusional Change in the English System of Complementation HENL R. I KLE SM ET Page 2. Spreading Patterns Page 3. OXFORD STUDIES …
Digital Technology and Innovative Poetry
T Jenks – 2018 – repository.edgehill.ac.uk
… 109 Combinatory and generative literature ….. 120 … (Flood, 2014). Project Gutenberg hosts more than fifty-thousand electronic books which are available for free (‘Project Gutenberg’, 2015) …
Inducing suffix and LCP arrays in external memory
T Bingmann, J Fischer, V Osipov – Journal of Experimental Algorithmics …, 2016 – dl.acm.org
Page 1. Inducing Suffix and LCP Arrays in External Memory TIMO BINGMANN, Karlsruhe Institute of Technology JOHANNES FISCHER, Technical University of Dortmund VITALY OSIPOV, Karlsruhe Institute of Technology We …
Learning Domain-Specific, L1-Specific Measures of Word Readability.
S Bergsma, D Yarowsky – Traitement Automatique des Langues, 2013 – Citeseer
… cloze deletion 2. Available at www.gutenberg.org/dirs/etext02/mthes10.zip Page 6. 208 TAL Volume 54 – n? 1/2013 tests and semantic association evaluations (Read, 1993; Laufer and Nation, 1995; Bre- land, 1996, inter alia). In …
Myths of Modern Education
C Mike – 2015 – dr.library.brocku.ca
Page 1. Myths of Modern Education Mike Countryman Honours Bachelor of Arts, University of Toronto Department of Graduate and Undergraduate Studies in Education Submitted in partial fulfillment of the requirements for the degree of Master of Education …
Reflections and implications
T Cremin, M Mottram – Researching Literacy Lives: Building …, 2014 – books.google.com
Page 184. 10 CONCLUSION Reflections and implications Teresa Cremin and Marilyn Mottram In the face of the rapidly changing nature of literacy and the persistence of deficit discourses in education which conceive of some …
The People’s Web Meets NLP: Collaboratively Constructed Language Resources
N Calzolari – 2013 – books.google.com
… applications of NLP techniques to gain insights into the use and functioning of language, as well as the use of language technology in applications that enable communication, knowledge management and discovery such as natural language generation, information retrieval …
Stochastic Tree Search for Highly Coordinated Planning
B Kartal – 2016 – search.proquest.com
Stochastic Tree Search for Highly Coordinated Planning. Abstract. Coordination plays a key role in efficiently solving multi-agent problems such as exploration of unknown environments, search and rescue, and surveillance …
Text Summarization
D Sarkar – Text Analytics with Python, 2016 – Springer
… same. We will start by loading some necessary dependencies and a corpus on which we will be computing collocations. We will use the nltk Gutenberg corpus’s book, Lewis Carroll’s Alice in Wonderland as our corpus. We also …
International Journal of Transmedia Literacy (IJTL). 1.1 December 2015
C Matteo – 2015 – books.google.com
Page 1. International Journal of Transmedia Literacy From Storytelling to Intercreativity in the Era of Distributed Authorship Edited by Matted Ciastellardi Giºvanna Di Rºsariº Contributes: Amorós, L. Arama, E. Bazzarin, W. Bonachd …
Phrase detectives: Utilizing collective intelligence for internet-scale language resource creation
M Poesio, J Chamberlain, U Kruschwitz… – ACM Transactions on …, 2013 – dl.acm.org
Page 1. 3 Phrase Detectives: Utilizing Collective Intelligence for Internet-Scale Language Resource Creation MASSIMO POESIO, JON CHAMBERLAIN, and UDO KRUSCHWITZ, University of Essex LIVIO ROBALDO, University …
Lexical simplification: optimising the pipeline
M Shardlow – 2015 – pdfs.semanticscholar.org
… Text Simplification Natural Language Generation Machine Translation Natural Language Processing … as an automated machine translation problem. It can also be considered as a natural language generation problem. TS is highly related to the field of text summarisation …
Early Cases of Bertillon, the Logic Programming Sleuth
SDJ Barbosa, ES Lima, AL Furtado… – Proceedings of the XII …, 2013 – sbgames.org
… 1 cf. http://www.gutenberg.org/files/3070/3070-h/3070-h.htm 2 http://en.wikipedia.org/wiki/ Alphonse_Bertillon SBC – Proceedings of SBGames 2013 Computing Track – Full Papers XII SBGames – São Paulo – SP – Brazil, October 16th – 18th, 2013 7 Page 2 …
Deconstructing “eco-terrorism”: rhetoric, framing and statecraft as seen through the insight approach
M Loadenthal – Critical Studies on Terrorism, 2013 – Taylor & Francis
Page 1. Critical Studies on Terrorism, 2013 Vol. 6, No. 1, 92–117, http://dx.doi.org/10.1080/ 17539153.2013.765702 ARTICLE Deconstructing “eco-terrorism”: rhetoric, framing and statecraft as seen through the Insight approach Michael Loadenthala,b …
Nuevas tendencias en fundamentos teóricos y aplicaciones de la minería de datos aplicada a la clasificación de textos en lenguaje natural
JM Carmona Cejudo – 2013 – riuma.uma.es
Page 1. Nuevas tendencias en fundamentos teóricos y aplicaciones de la miner´?a de datos aplicada a la clasificación de textos en lenguaje natural José Mar´?a Carmona Cejudo Directores: Rafael Morales Bueno Manuel Baena Garc?a …
The Content Machine: Towards a theory of publishing from the printing press to the digital network
M Bhaskar – 2013 – books.google.com
Page 1. The Content Machine Page 2. ANTHEM SCHOLARSHIP IN THE DIGITAL AGE Anthem Scholarship in the Digital Age investigates the global impact of technology and computing on knowledge and society. Tracing transformations …
The Endgame or a Wake?: Tropes of Circularity in Literature Then and Now
D Marques – CounterText, 2016 – euppublishing.com
Narrative encoding for computational reasoning and adaptation
SM Harmon – 2017 – search.proquest.com
Narrative Encoding for Computational Reasoning and Adaptation. Abstract. Narratives provide a powerful means of making sense of our world. They are cognitive tools that we use to examine and shape ourselves, as well as the environment around us …
MMOs from the Inside Out
RA Bartle – 2016 – Springer
Page 1. RICHARD A. BARTLE The History, Design, Fun, and Art of Massively-Multiplayer Online Role-Playing Games MMOs FROM THE INSIDE OUT Page 2. MMOs frOM the InsIde Out The hisTory, Design, Fun, anD arT oF Massively-MulTiplayer online role-playing gaMes …
Natural Language Processing Through Different Mathematical And Statistical Tools
RK Bisht – 2014 – ir.inflibnet.ac.in
… Part-of-speech tagging: Annotating natural language sentences or texts with parts-of-speech. • Natural language generation: Producing natural language sentences or texts from non-linguistic representations. • Machine translation: Translating sentences or texts in a source …
Character modeling through dialogue for expressive Natural Language Generation
GI Lin – 2016 – search.proquest.com
… Second, signicant technical work is needed to support procedural generation of character dialogue for any domain and … In addition, they used the Google N-Gram Corpus and Project Gutenberg to help select dierent … Figure 3.3: Natural Language Generation (NLG) Architecture …
Looking behind the text-to-be-seen: Analysing Twitter bots as electronic literature
V Lampi – 2017 – aaltodoc.aalto.fi
… Furthermore, the term ‘electronic’, or ‘digital’ which is used by some scholars, is called into question both because the history of procedural generation of text is much longer than the history of computers and because computers are by no means necessary for this kind of text …
Generation and Dramatization of Detective Stories
SDJ Barbosa, ES Lima, AL Furtado… – SBC Journal on Interactive …, 2014 – icad.puc-rio.br
… Section III exposes our view of the detective stories 1 cf. http://www.gutenberg.org/files/3070/ 3070-h/3070-h.htm 2 http://en.wikipedia.org/wiki/Alphonse_Bertillon SBC Journal on Interactive Systems, volume 5, number 2, 2014 39 ISSN: 2236-3297 Page 2 …
Technology and digital initiatives: innovative approaches for museums
J Decker – 2015 – books.google.com
Page 1. V © lo TO “. . ” igital ives S G h C 3. O r O O A G ” t 3. V O T T for Museums Edited by Juilee Decker Page 2. Technology and Digital Initiatives Page 3. Innovative Approaches for Museums About the Series The Innovative …
Computational Linguistics and Intelligent Text Processing: 14th International Conference, CICLing 2013, Samos, Greece, March 24-30, 2013, Proceedings
A Gelbukh – 2013 – books.google.com
… Clustering and categorization 6 23 26 Summarization 5 32 16 Morphology 5 24 21 Word sense disambiguation 5 19 26 Named entity recognition 4 20 20 Noisy text processing and cleaning 4 17 24 Social networks and microblogging 4 13 31 Natural language generation 3 11 …
MultiLingMine 2016: modeling, learning and mining for cross/multilinguality
D Lenco, M Roche, S Romeo, P Rosso, A Tagarelli – 2016 – agritrop.cirad.fr
Page 1. Page 2. MultiLingMine 2016: Modeling, Learning and Mining for Cross/Multilinguality Salvatore Romeo1, Andrea Tagarelli2, Dino Ienco3, Mathieu Roche4, and Paolo Rosso5 1 Qatar Computing Research Institute, Doha …
RST-style discourse parsing and its applications in discourse analysis
WV Feng – 2015 – search.proquest.com
RST-style discourse parsing and its applications in discourse analysis. Abstract. Discourse parsing is the task of identifying the relatedness and the particular discourse relations among various discourse units in a text. In particular …
On the Helmholtz principle for text mining
B Dadachev – 2015 – orca.cf.ac.uk
Page 1. A thesis submitted in partial fulfilment of the requirement for the degree of Doctor of Philosophy On the Helmholtz Principle for Text Mining Boris Dadachev School of Mathematics, Cardiff University & Hewlett-Packard Laboratories, Bristol October 2015 Page 2. i …
Multi-Document Summarization and Semantic Relatedness
O Mogren – 2015 – cse.chalmers.se
Page 1. Multi-Document Summarization and Semantic Relatedness OLOF MOGREN Department of Computer Science and Engineering CHALMERS UNIVERSITY OF TECHNOLOGY UNIVERSITY OF GOTHENBURG Gothenburg, Sweden 2015 Page 2. Page 3 …
Tietoon perustuvan pulmapelin suunnittelu-Tapaus: After Now Archeology
J Kemppainen – 2014 – aaltodoc.aalto.fi
Page 1. Designing a Knowledge Based Puzzle Game Case: After Now Archeology Jaakko Kemppainen Aalto University School of Arts, Design and Architecture Department of Media Media Lab Helsinki Master’s Thesis April 2014 Page 2. Page 3 …
From Event to Story Understanding
N Mostafazadeh – 2017 – search.proquest.com
… corpora. Also they found out that a personal story (narrative) corpus generated from. Weblogs resulted in better performance than a corpus they had acquired from Project. Gutenberg. COPA was also a SemEval 2012 task [Gordon et al., 2012]. The only competing …
Zipf’s Law in L1 Attrition
JM Vicente Flores – 2016 – dspace.library.uu.nl
Page 1. Zipf’s Law in L1 Attrition A study of lexical diversity Juan Miguel Vicente Flores Supervisors: Marjolein van Egmond Sergey Avrutin Second reader: Gerrit Bloothooft Thesis submitted in partial fulfillment of the requirements for the Research Master’s program …
Automated Extraction of Action Semantics for Embodied Virtual Agent Using Textual Knowledge Bases
JT Balint – 2017 – search.proquest.com
Automated Extraction of Action Semantics for Embodied Virtual Agent Using Textual Knowledge Bases. Abstract. In games, training environments, and other virtual simulations, non-player characters (virtual agents) perform actions …
Predicting text quality: metrics for content, organization and reader interest
AP Louis – 2013 – search.proquest.com
Predicting text quality: Metrics for content, organization and reader interest. Abstract. When people read articles—news, fiction or technical—most of the time if not always, they form perceptions about its quality. Some articles are well-written and others are poorly written …