next up previous
Next: About this document

Copyright Notice: The following set of slides are copyrighted by Kavi Mahesh, the Computing Research Lab at New Mexico State University, and its affiliated members of the Mikrokosmos project. No part of this may be used for commercial purposes (including consultance services). Any use of these slides must acknowledge the the authors and CRL, NMSU.

If you use the material in these slides in your own writings or presentations, we would appreciate if you can send us a copy of such articles.

Thank you!

Kavi Mahesh,
CRL, NMSU

in

A Situated Ontology for Practical NLP

Kavi Mahesh

What is an Ontology?

The branch of metaphysics that studies the nature of existence.

No........

An ontology is a database with information about

The Mikrokosmos View of an Ontology

{

Ontology Development: Desiderata

Terminology

Types of Ontologies

Why do we need an ontology?

What has an ontology got to do with NLP?

Why do we need an ontology?

What has an ontology got to do with Lexical Semantics?

Topology

Acquisition Methodology

Technology

Tools for acquisition:

Tools for lexicon interactions:

Tools for maintenance and quality control:

Guidelines: What Not to Add

  1. Do not add instances.
  2. Do not decompose unnecessarily.
  3. Do not add if there is already one ``close'' to it. E.g., ``suggest'' = ``urge.''
  4. Do not add collections; use the set notation.
  5. Do not add language-specific stuff.
  6. Do not add specialized events with particular arguments.

Guidelines: Naming a Concept

  1. Use ``scientific'' rather than lay terms.
  2. Use the English words.
  3. Use only alphabetic characters and `-'.
  4. Do not use plurals in concept names.
  5. Consistency across concepts is more important than conformance with a dictionary. E.g., for-profit and non-profit; not for-profit and nonprofit.
  6. Do not use names longer than three words.
  7. Avoid compound nouns. E.g., do not use time-unit; use unit-of-time instead.
  8. Use shorter names for more common word senses. E.g., bank and bank-river.
  9. For relation names, append typical prepositions. E.g., employed-by and employer-of.
  10. Use a word in only one sense. E.g., If grocery-store, then no store-medicine; use preserve-medicine instead; but then no peach-preserve.

Pathological Problems

MIKROKOSMOS: The Task

Automatic Natural-Language Interpretation

Applications:

MIKROKOSMOS: Research

Mikrokosmos: Engineering

Sample Input

Roche Compra Docteur Andreu

El grupo Roche, a través de su compañía en España, adquirió el laboratorio farmacéutico Doctor Andreu, se informó hoy aquí.

La comunicación oficial no precisó el monto de la operación realizada entre Productos Roche SA y Unión Explosivos Río Tinto SA, hasta ahora mayoritaria en el accionariado.

Fuentes financieras consultadas cifraron la operación en unos 10.000 millones de pesetas. Según el acuerdo firmado hoy en Madrid, los productos del Doctor Andreu continuarán siendo producidos y comercializados con el mismo nombre. Doctor Andreu, cuya fama la obtuvo a partir de las "pastillitas" para la tos, está bien introducido en las áreas de cardiología, reumatología y especialidades publicitarias.

Las actividades del grupo Roche, con sede central en Basilea (Suiza), incluyen el desarrollo, la producción y la comercialización de medicamentos, productos para el diagnóstico, así como de vitaminas y productos químicos.

A nivel mundial, cuenta con compañías en más de 50 países con casi 50.000 empleados. Doctor Andreu es una compañía farmacéutica dedicada a la producción y comercialización de fármacos y productos veterinarios. Con sede en Barcelona, cuenta con más de 400 empleados.

En el ejercicio pasado facturó unos 3.490 millones de pesetas.

En 1988, el Grupo Roche alcanzó unas ventas totales de 8.690 millones de francos suizos, de las que aproximadamente un 41 por ciento correspondieron a su división farmacéutica. El beneficio neto -el mejor de su historia- se elevó a 641,5 millones de francos suizos y la rentabilidad sobre las ventas aumentó del 6,3 al 7,4 por ciento.

El "cash flow" se incrementó en un 21 por ciento, alcanzando 1.179 millones de francos o el 14 por ciento de las ventas del grupo.

Las inversiones en investigación y desarrollo (I+D) fueron de 1.210 millones de francos suizos, el 14 por ciento del total de sus ventas.

Productos Roche cuenta con una plantilla de 600 personas y alcanzó unas ventas totales de 9.747 millones de pesetas, un 12,5 por ciento superiores al año 1987.

Sus beneficios fueron de 218 millones y el "cash flow" de 356 millones. Las inversiones realizadas totalizaron 223 millones de pesetas.

Sample Output: TMR

Contents of a TMR

Each proposition has:

Knowledge Acquisition for Large-Scale NLP

Lexicon acquisition:

Knowledge Acquisition for Large-Scale NLP

What is an Ontology?

Why do we need it?

Knowledge Acquisition for Large-Scale NLP

How do we acquire an ontology?

in

An Ontology for Multilingual NLP

The Mikrokosmos Ontology

Ontology and Lexical Semantics

Example: Swim and Float

English: Swim SWIM; Float FLOAT.

in

Spanish: Nadar SWIM; Flotar FLOAT.

in

Russian: Plyt' SWIM or FLOAT; Plavat' SWIM or FLOAT.

in

What concepts do we need to represent these verb meanings?

Example: Shovel and Clear

English:

Shovel SHIFT-MATERIAL;

Clear SHIFT-MATERIAL.

in

Spanish:

Limpiar (Clear) SHIFT-MATERIAL;

``Quitar con pala'' (Shovel): SHIFT-MATERIAL.

Ontology Acquisition

What concepts do we need:

Conceptual Dimensions: Swim, Float, ...

Why are we building an ontology?

What do we mean by an ontology?

An ontology for NLP purposes is a knowledge base that:

Nature of the ontology:

What have we built so far?

The Mikrokosmos ontology:

What do we care about our ontology?

The 5 commitments:

1.
Broad coverage
Why: Input texts are real-world, unedited, and unrestricted.
2.
Rich properties and interconnections
Why: need to check how well constraints are satisfied.
3.
Ease of understanding, searching and browsing
Why: Non-expert lexicographers need to find concepts given only a rough word "sense."
4.
NLP-oriented: developed for machine translation
Why: Purpose is (literal) meaning representation.
5.
Economy/cost-effectiveness/tractability
Why: we don't have several person-centuries; we only have several person months.

How to conform to the commitments?

Ontology for MT Encyclopedia

Types of knowledge we don't need:


Kavi Mahesh
Sun Nov 12 15:14:34 MST 1995