Natural Language Processing with Java


Natural Language Processing with Java (2015) .. by Richard Martin Reese


Table of Contents

1: Introduction To NLP

  • What is NLP?
  • Why use NLP?
  • Why is NLP so hard?
  • Survey of NLP tools
  • Overview of text processing tasks
  • Understanding NLP models
  • Preparing data
  • Summary

2: Finding Parts Of Text

  • Understanding the parts of text
  • What is tokenization?
  • Simple Java tokenizers
  • NLP tokenizer APIs
  • Understanding normalization
  • Summary

3: Finding Sentences

  • The SBD process
  • What makes SBD difficult?
  • Understanding SBD rules of LingPipe’s HeuristicSentenceModel class
  • Simple Java SBDs
  • Using NLP APIs
  • Training a Sentence Detector model
  • Summary

4: Finding People And Things

  • Why NER is difficult?
  • Techniques for name recognition
  • Using regular expressions for NER
  • Using NLP APIs
  • Training a model
  • Summary

5: Detecting Part Of Speech

  • The tagging process
  • Using the NLP APIs
  • Summary

6: Classifying Texts And Documents

  • How classification is used
  • Understanding sentiment analysis
  • Text classifying techniques
  • Using APIs to classify text
  • Summary

7: Using Parser To Extract Relationships

  • Relationship types
  • Understanding parse trees
  • Using extracted relationships
  • Extracting relationships
  • Using NLP APIs
  • Extracting relationships for a question-answer system
  • Summary

8: Combined Approaches

  • Preparing data
  • Pipelines
  • Creating a pipeline to search text
  • Summary