An vector-based information retrieval engine that supports
fuzzy-term expansion, glob-style wildcard matching for 8-bit and
16-bit languages, proximity and boolean matching operations and IDF weighted retrieval.
A document-signature information retrieval engine that supports fuzzy-term expansion, glob-style wildcard matching, phrase weighting and rapid document deletion and replacement operations with near-real-time indexing for new and modified documents.
8-bit clean indexing and querying.
Configurable server that can accommodate hundreds of resources for hundreds of clients.
Rapid prototyping of new resource indexes and client-server communication using TCL/DP interfacing language.
Advanced tokenization tools for handling complex SGML markup of customer texts.
User-centered design of client interfaces for new resources.
Status
For the Department of Defense, CRL has developed a Chiefs-of-State and Gazetteer resource that use the vector-based retrieval engine to supply breakdowns of country-specific information on demand to custom interfaces in Oleada.
In the Chiefs-of-State resource, queries can be by country, person or
office, with fuzzy matching strategies providing near misses
automatically if an initial search fails. Using the Gazetteer
resource, place names can be rapidly associated with their country or region, and map coordinates can even be searched or retrieved.
In each case, the powerful full-text indexing strategies provide more flexibility in searching than a traditional database scheme could provide, while offloading the processing and memory overhead of the full-text indexes to a single central server.
|