![]() |
Zhang Maiwen MSc in Computer Science Computing Lab, University of Oxford Email: |
|
| About Me Publications Research Brainchilds Programs of Fun Find Me | ||
summaries for breaking news [Download Thesis] By Maiwen Zhang Supervisor: Dr. Stephen Clark |
Automatic text summarization is an active field of research in both the Information Retrieval (IR) and the Natural Language Processing (NLP) communities since it provides an efficient way to access very large repositories of data. This dissertation aims to combine the processes of information retrieval, clustering and extractive-based multi-document summarization so as to produce background summaries for a user's query (a piece of breaking news), based on a wired-news collection of 330,000 documents. With a single-event clustering method and an event-based summarization model introduced in the dissertation, the system successfully produced chronologically listed summaries that formed good background information to the user's input query. The summarizer captured the important sentences and minimized internal redundancy. Moreover, the extracted sentences are organized according to their natural orders.