MINDS - Core Summarization Engine




The core summarization engine was originally developed in 1996 under the direction of Dr. Kavi Mahesh. It was called HyperGen as it generated summaries in hyper-text format, retaining, but hiding portions of a document under labelled links. We intend to explore this idea of generating hyper-text summaries further.

One of the key properties of the summarizer is its easy adaptation to other languages. The main requirments are word and sentence boundary recognition software, and a list of the most frequently occurring words in a language. The core engine now handles Spanish and English, and work is underway on Russian and Japanese.

The summarizer is written in Java. We intend to provide a simple interface to allow a user to modify the summarization parameters to suit a specific task and document set.


Updated on November 11, 1997. For problems with this Web site, send mail to webmaster@crl.nmsu.edu

Internal