How do I extract data from news stories?
Information extraction in general is closely aligned with Automatic summarization, if not a sub-task. Information extraction may also be known as Text mining. Large scale, or “web scale”, information extraction may be known as “Open Information Extraction” (as in Open IE project), for which a number of tools are available. TEXminer might be a basic tool for news data extraction.
For details, see my quick and dirty webpages: