Notes: Stanford Classifier is a machine learning tool for text classification. It is designed to be highly flexible and can be used for a wide range of classification tasks. It is particularly…
Search Results for: Mallet
Sentence Planner
Notes: It is correct that the term “tactic planning” is sometimes used to refer to the process of generating the syntactic structure of individual utterances, which is also known as sentence planning….
What are good alternatives to NLTK?
What are good alternatives to NLTK? Wikipedia contains an extensive listing of Natural language processing toolkits, under Outline of natural language processing. See also my recent answer to: Which NLP Library is…
Natural Language Processing Toolkits
Notes: Natural Language Processing toolkits like NLTK, Stanford NLP, and OpenNLP are well-known and widely used in the NLP community, and researchers and practitioners frequently cite and use these libraries in their…
Which NLP Library is most suitable for use and further development for a text mining startup?
Which NLP Library is most suitable for use and further development for a text mining startup? See answers to virtually identical question from 2011: · Which NLP library among the ones below…
Sentence Extraction
Notes: There are several options for extracting complete sentences from a text, including: Regular expressions: A pattern-matching method that can be used to identify and extract complete sentences from a text. Natural…
Ontology Learning & Dialog Systems
Notes: Ontology learning is the process of constructing an ontology, which is a formal representation of a set of concepts within a domain and the relationships between those concepts. Ontologies are used…
SLP (Spoken Language Programming)
Notes: Spoken programming refers to the use of natural language, such as English, to interact with a computer or programming environment in order to write and execute code. This can be done…
Apache OpenNLP & Dialog Systems
Notes: The OpenNLP library is used to perform natural language processing tasks such as tokenization, part-of-speech tagging, and named entity extraction. It is used in various applications such as information extraction, language…
OpenCalais
Notes: Calais is a service by Thomson Reuters that automatically extracts semantic information from web pages and other online content. Calais uses natural language processing (NLP) and other AI technologies to analyze…