| Date |
Title
Instructor(s) |
Abstract |
Monday,
June 28 |
"Ecological"
Issues in Language Engineering
Jim Cowie and Bill Odgen
Guest Lecturer: Mark Leisher, Eugene Ludovik and Ron
Zacharski |
This course will cover issues related to
writing systems, encodings, input and output methods; treatment of punctuation, special
characters and symbols, including mark-up; processing of dates and numbers; and a variety
of issues connected with managing large multilingual collections of documents featuring
different mark-up styles. A number of computational tools will be introduced and
used in practical exercises. |
Tuesday/Wednesday,
June 29-30 |
Approaches to
Computational Morphology
Kemal Oflazer
Guest Lecturers: Stephen Helmreich, Karine Megerdoomian,
and Svetlana
Sheremetyeva |
After a presentation of several approaches
to computational morphology, with example systems for such widely different languages as
Spanish, Persian, Russian and Turkish, this course will concentrate on the engineering of
state-of-the-art morphological analysis and generation systems, especially for languages
other than English. The students will also get hands-on experience using sophisticated
development and testing tools, by building a morphological analyzer. |
Thursday/Friday,
July 1-2 |
Lexicon
Acquisition for NLP I:
Morphology and Syntax
Rémi Zajac
Guest Lecturer: Svetlana Sheremetyeva |
This course will describe the process of
design and acquisition of several types of lexicons for NLP systems: lexicons supporting
morphological and syntactic analysis of texts in a language, transfer lexicons for machine
translation and multilingual onomastica (lexicons of proper names). A number of
acquisition interfaces will be used in practical exercises. |
Monday/Tuesday,
July 5-6 |
Lexicon
Acquisition for NLP II:
Ontological Semantics
Sergei Nirenburg
Guest Lecturers: Hyopil Shin |
This course will present the design and
acquisition of static knowledge sources to support analysis of meaning in natural language
texts. In particular, it will cover designing and building ontologies, or world models,
for NLP and lexicons for the support of semantic analysis of particular languages.
Practical exercises will be supported by interactive acquisition interfaces. |
Wednesday/Thursday
July 7-8 |
Knowledge
Elicitation from Informants
Sergei Nirenburg
Guest Lecturers: Jim Cowie and David Farwell |
This course will present an environment for
eliciting grammatical and lexical knowledge about a language from a user who knows that
language and English but is not a trained linguist. This kind of environment is a
realistic alternative to experimenting with automatic elicitation of language knowledge.
It combines corpus-based, expectation-based and failure-driven acquisition of declarative
knowledge about a language and is most useful for the languages for which few
computational resources are available. The design of the acquisition process and system
will be discussed, and the interface, Boas, will be used in practical exercises. |
Friday
July 9 |
A Survey of
Language Engineering Applications
Jim Cowie, Sergei Nirenburg, Bill Odgen and Rémi Zajac |
This course will introduce language
engineering applications such as machine translation, information retrieval, information
extraction, text summarization and language instruction. The tasks and techniques learned
in the other courses will be put in their context and further illustrated in the
discussion of the various systems devoted to particular applications. The following
systems will be presented and made available for laboratory work: the Corelli machine
translation environment; the MINDS information retrieval and summarization system, the
URSA cross-language information retrieval engine, the Oleada language instruction
environment and translators tool set, the Mikrokosmos machine translation system,
and the Expedition environment for configuring machine translation systems for low-density
languages. |