0% found this document useful (0 votes)

50 views20 pages

SEM-I: Why and What?

This document discusses the SEM-I, which is a semantic interface specification that defines the semantic representations output by natural language grammars. The SEM-I specifies things like the syntax of representations, naming conventions, and attributes of variables. It serves as an interface between grammars and applications, allowing applications to understand the expected representations. The document outlines plans to develop the SEM-I, including automatically generating parts from grammars, documenting it, and establishing a change protocol. It also discusses how the SEM-I could be extended in the future to include more semantics through a proposed SEM-I++.

Uploaded by

Raj Mehta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views20 pages

SEM-I: Why and What?

Uploaded by

Raj Mehta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 20

SEM-I: why and what?

Overview
Interfacing grammars to other systems
via semantics: requirements
What is in the SEM-I?
SEM-I tools
Some modest proposals ...
SEM-I ++

Modular architecture
Language independent component
Meaning representation (MRS/RMRS)

Language dependent analysis/realization

(DELPH-IN grammar)

string

Semantics as interface

Applications need to know what

representations to expect / deliver:

Deep/shallow integration via RMRS

transfer component for MT

query answering
information extraction, etc

RMRS from shallow grammars is an underspecified

form of semantics from deep grammars
treats deep grammars as normative, so need to
know their output

Explaining what were doing!

What must be specified

Syntax of representation (XML)

Formalism (MRS/RMRS)
Naming conventions
Attributes and values on variables
Relations, features, constant values, variable
sorts, optionality

`grammar relations (e.g., udef_q_rel)

open-class relations (e.g., _interview_v_rel)

Hierarchy of relations (where motivated by

denotation)

Consultants were interviewed

by Abrams
<mrs>
<var vid='h1'/>
<ep><pred>prpstn_m_rel</pred><var vid='h1'/>
<fvpair><rargname>MARG</rargname><var vid='h3'/></fvpair></ep>
<ep><pred>udef_q_rel</pred><var vid='h6'/>
<fvpair><rargname>ARG0</rargname><var vid='x4'/></fvpair>
<fvpair><rargname>RSTR</rargname><var vid='h7'/></fvpair></ep>
<ep><pred>_consultant_n_rel</pred><var vid='h9'/>
<fvpair><rargname>ARG0</rargname><var vid='x4'/></fvpair></ep>
<ep><pred>_interview_v_rel</pred><var vid='h10'/>
<fvpair><rargname>ARG0</rargname><var vid='e2'/></fvpair>
<fvpair><rargname>ARG1</rargname><var vid='x11'/></fvpair>
<fvpair><rargname>ARG2</rargname><var vid='x4'/></fvpair></ep>
<ep><pred>_by_p_cm_rel</pred><var vid='h10'/>
<fvpair><rargname>ARG0</rargname><var vid='e13'/></fvpair>
<fvpair><rargname>ARG1</rargname><var vid='u12'/></fvpair>
<fvpair><rargname>ARG2</rargname><var vid='x11'/></fvpair></ep>
<ep><pred>proper_q_rel</pred><var vid='h14'/>
<fvpair><rargname>ARG0</rargname><var vid='x11'/></fvpair>
<fvpair><rargname>RSTR</rargname><var vid='h15'/></fvpair></ep>
<ep><pred>named_rel</pred><var vid='h17'/>
<fvpair><rargname>ARG0</rargname><var vid='x11'/></fvpair>
<fvpair><rargname>CARG</rargname><constant>abrams</constant></fvpair></ep>
<hcons hreln='qeq'><hi><var vid='h3'/></hi><lo><var vid='h10'/></lo></hcons>
<hcons hreln='qeq'><hi><var vid='h7'/></hi><lo><var vid='h9'/></lo></hcons>
<hcons hreln='qeq'><hi><var vid='h15'/></hi><lo><var vid='h17'/></lo></hcons>
</mrs>

Some issues

Specification/documentation:

treatment of bare plural, message relations

defining when such relations are present
arity and correspondence of arguments for
_interview_v_rel etc

`unwanted predicates such as _by_p_cm_rel

(some of these are going/gone can all be avoided?)
qeqs etc can be ignored for analysis for some
applications, not for realisation (currently)
changes to grammars: e.g., message relations?

SEM-I: semantic interface

Formal level: MRS/RMRS syntax and

semantics, naming conventions
(_lemma_POS[_sense])
Meta-level: variable feature values; manually
specified `grammar relations

udef_q_rel (construction)
named_rel, proper_q_rel (`fixed lexical
relations)

Object-level (e.g., _consultant_n_rel)

SEM-I and grammars

Object levels SEM-Is are auto-generated and distinct

for each grammar
Meta-level SEM-Is should be (partially) shared

object

Definition of `correct (R)MRS for developers

Documentation
Checking of test-suites

Online

SEM-I plus lexical link used in lexical lookup phase

of generation (already)
rejection of invalid (R)MRSs (input to generator,
deep/shallow integration)
patching up input to generation, fixing up output
from parser

SEM-I: implementation
(current and planned)

Database of relations, features, value sorts,

optionality:

Meta-level: plan to generate from grammars, with

manual identification of relations (some relations
are grammar-internal, see later) and manual
documentation
Object-level: auto-generated from lexical entries
in deep grammars (current version is based on
generator code optionality not there yet)

Semantic test suite exemplifying grammar

relations (partial for ERG, in progress for
other grammars)

SEM-I development

SEM-I development must be incremental

SEM-I eventually forms the `API: stable, changes
negotiated.

Grammar writers need flexibility to hide things, make

changes: SEM-I only constrains the external view

Shared meta-level SEM-I is presumably part of Matrix, but

negotiated with consumers
Management needs to be worked out

BUT: automate production of SEM-I from grammars as much

as possible

Documentation needs to be automated as much as

possible: documentation by example

Interface

External representation: (R)MRSSEM-I

public, documented
reasonably stable

Internal representation

mapping to feature structures (MRSFS)

MRSSEM-I to MRSFS mapping needed anyway, but may have to go via

MRSINTERNAL to MRSFS mapping

distinctions between relations which are irrelevant for denotation

are hidden: only some relations are public
e.g., `selected for relations are internal only

External/Internal inter-conversion

e.g., internal-only relation automatically converted to supertype in

output

BUT: want to minimize the discrepancies

relation hierarchies in SEM-I consistent with grammar hierarchies

Architecture with indirection

External LF (defined by SEM-I)

Internal LF

parser/generator

String

bidirectional
mapping

Semi-automated
documentation
[incr tsdb()]

Lex DB
grammar

Object-level
SEM-I

Documentation
strings

and semantic
test-suite
Auto-generate
examples

semi-automatic

examples,
autogenerated
on demand

Documentation
Meta-level
SEM-I

autogenerate

Hierarchies

Type hierarchies of relations in grammars are not there to support

inference
GLB condition not needed for SEM-I
Proposal: basic SEM-I hierarchy of grammar relations derived
automatically from grammar type hierarchy plus marking of relations
as in SEM-I. (Possibly augmented in SEM-I ++, see later)
type1

type1

type3

type2

type4

grammar

type2

type5

type4

SEM-I

type5

Proposals

Documentation on wiki, mailing list for SEM-I developers and

consumers
MRS code to support particular TFS encoding of MRSs and
enforce naming conventions, simplifying basic MRSFS to MRS
mapping and making grammars more consistent
Allow substantive MRSINTERNAL to MRSSEM-I mapping (via
transfer rule mechanism), but hope to keep this minimal since it
hinders deep/shallow integration.
Agreed procedure for adding/changing variable features and
values
Inventory of grammar predicates: extensions/changes by
grammar developers require notification and documentation

Change protocol (initial

proposal)
A developer (grammar developer or software developer)
implementing a change which will affect the SEM-I must follow
the protocol:
Consultation (meta-SEM-I only). Proposed changes to the
meta-SEMI-I must be discussed on the mailing list.
Notification. All changes to the SEM-I (meta and object) must
be posted on the website.
A script for conversion from new to old version must be posted
(unless an incompatible change is agreed by the list members)
Testing. For each grammar, there will be a semantic test suite,
with agreed SEM-I output (for a specified reading). All changes
to a grammar must be validated against the corresponding testsuite. All software changes must be validated against all testsuites. The conversion script must also be validated.
Commit changes.

Applications and the SEM-I

Application code will be isolated from

grammar changes
MT: semantic transfer mapping from one
SEM-I to another
IE: mapping from SEM-I to template (often
ignoring much of the detail in the original
MRS)
QA: matching RMRSs: SEM-I hierarchy
used for compatibility tests (also SEMI ++)

SEM-I++ (aka Floyd)

SEM-I++ is not built by grammar developers, depends on SEM-I, not

grammars
More semantics, domain-independent, shared between applications
Might include:

Definitions of grammar relations and closed-class relations to support

inference
Mapping to external resources (e.g., WordNet and FrameNet)
Enriched hierarchies
Word classes

word classes could support a richer encoding of thematic role e.g., experiencerstimulus psych verbs map ARG1 to EXP and ARG2 to STIM

Plan is to support specification of SEM-I++ in some version of OWL

SEM-I++ information is additional to grammars but DELPH-IN
community may agree to support it

Sem Tem
No ratings yet
Sem Tem
63 pages
Ccptoj
No ratings yet
Ccptoj
14 pages
NLP - Mid 2 Examination
No ratings yet
NLP - Mid 2 Examination
4 pages
Warpa N J. T' Ath: Aspects
No ratings yet
Warpa N J. T' Ath: Aspects
18 pages
AI Unit-5 PDF
No ratings yet
AI Unit-5 PDF
34 pages
Compiler Design 4
No ratings yet
Compiler Design 4
48 pages
Unit 1
No ratings yet
Unit 1
109 pages
Martial Arts - Taekwondo - Student Handbook
96% (25)
Martial Arts - Taekwondo - Student Handbook
54 pages
NLP Unit-3
No ratings yet
NLP Unit-3
37 pages
Semantic Analysis
No ratings yet
Semantic Analysis
108 pages
UNIT 4 New
No ratings yet
UNIT 4 New
14 pages
Name: Gapkwi S. Reuel REG NO: U21DLCS10193 Course: Cosc 408: A. What Is Analytic Grammar?
No ratings yet
Name: Gapkwi S. Reuel REG NO: U21DLCS10193 Course: Cosc 408: A. What Is Analytic Grammar?
8 pages
Unit 3 NLP New
No ratings yet
Unit 3 NLP New
15 pages
Natural Language Processing Unit 3
No ratings yet
Natural Language Processing Unit 3
55 pages
Unit 3-1
No ratings yet
Unit 3-1
66 pages
Unit 4
No ratings yet
Unit 4
107 pages
Unit III 1
No ratings yet
Unit III 1
11 pages
CC LL
No ratings yet
CC LL
15 pages
Lec04 Sematic Analysis
No ratings yet
Lec04 Sematic Analysis
83 pages
NLP JNTUH Unit 3
No ratings yet
NLP JNTUH Unit 3
19 pages
Setting Up The Oracle Warehouse Builder 11g Release 2 Tutorial Environment
No ratings yet
Setting Up The Oracle Warehouse Builder 11g Release 2 Tutorial Environment
36 pages
Note 5 PDF
No ratings yet
Note 5 PDF
7 pages
Computer SC Specimen QP Class Xi
0% (1)
Computer SC Specimen QP Class Xi
7 pages
Poc Syntax Directed
No ratings yet
Poc Syntax Directed
26 pages
Pert24 - NLP For Communication
No ratings yet
Pert24 - NLP For Communication
30 pages
Navigation Update User Guide: Business 1 Team Hyundai Mnsoft Dec, 2016
No ratings yet
Navigation Update User Guide: Business 1 Team Hyundai Mnsoft Dec, 2016
58 pages
Chapter 4 - 6
No ratings yet
Chapter 4 - 6
78 pages
Siva PHD Thesis
No ratings yet
Siva PHD Thesis
173 pages
Chapter 4 Semantic Analysis
No ratings yet
Chapter 4 Semantic Analysis
36 pages
Ravikant Maurya: Address: 602 D/19 F, Park View
No ratings yet
Ravikant Maurya: Address: 602 D/19 F, Park View
3 pages
Onsip Manual 1
No ratings yet
Onsip Manual 1
20 pages
Specifications
No ratings yet
Specifications
1 page
Natural Language Processing
No ratings yet
Natural Language Processing
41 pages
EV11 Whitepaper - Deploying IMAP Access To Enterprise Vault
No ratings yet
EV11 Whitepaper - Deploying IMAP Access To Enterprise Vault
25 pages
The Role of NNN in Zeolite Acidity and Activity
No ratings yet
The Role of NNN in Zeolite Acidity and Activity
25 pages
Quora Answers: India: What Can I Do To Change The People of India?
No ratings yet
Quora Answers: India: What Can I Do To Change The People of India?
2 pages
Outline of Chapter 9
No ratings yet
Outline of Chapter 9
24 pages
NLP Module3
No ratings yet
NLP Module3
27 pages
Run Time System and Intermediate Code Generation Eng 57
No ratings yet
Run Time System and Intermediate Code Generation Eng 57
10 pages
1507965390
No ratings yet
1507965390
17 pages
Citrix 1Y0-301 Deploying Citrix Xendesktop 7.6 Solutions
No ratings yet
Citrix 1Y0-301 Deploying Citrix Xendesktop 7.6 Solutions
33 pages
Unit 3 NLP
No ratings yet
Unit 3 NLP
7 pages
Apex Institute of Technology Bachelor of Engineering (Computer Science & Subject: Natural Language Processing Subject Code
No ratings yet
Apex Institute of Technology Bachelor of Engineering (Computer Science & Subject: Natural Language Processing Subject Code
18 pages
User's Manual: Stackable 8 / 16 Port Osd KVM Switch
No ratings yet
User's Manual: Stackable 8 / 16 Port Osd KVM Switch
17 pages
Baseball Simulator in C++
No ratings yet
Baseball Simulator in C++
7 pages
Traditional Approach
No ratings yet
Traditional Approach
18 pages
Hamish Whittal - Shell Scripting
No ratings yet
Hamish Whittal - Shell Scripting
272 pages
Training Certificates It
No ratings yet
Training Certificates It
25 pages
WP DGAs in The Hands of Cyber Criminals
No ratings yet
WP DGAs in The Hands of Cyber Criminals
6 pages
Chapter 4 Linked Stacks and Queues
No ratings yet
Chapter 4 Linked Stacks and Queues
56 pages
How To Compose Melodies
No ratings yet
How To Compose Melodies
2 pages
Mysql Friendster Casestudy
No ratings yet
Mysql Friendster Casestudy
4 pages
Sharepoint Developer
No ratings yet
Sharepoint Developer
3 pages
ST-Microcontrollers Tunis PFE 2010
No ratings yet
ST-Microcontrollers Tunis PFE 2010
14 pages
NLP Unit 2
No ratings yet
NLP Unit 2
48 pages
Apple Education Resource Guide - November 1996 - Volume Four Number Two
No ratings yet
Apple Education Resource Guide - November 1996 - Volume Four Number Two
24 pages
Aspect Oriented Programming With C++ and Aspect C++
No ratings yet
Aspect Oriented Programming With C++ and Aspect C++
39 pages
Semantic Analysis 231
No ratings yet
Semantic Analysis 231
53 pages
SR MR Iov
No ratings yet
SR MR Iov
63 pages
Online Shopping Mall
No ratings yet
Online Shopping Mall
17 pages
ONGC Blog - Job Openings
No ratings yet
ONGC Blog - Job Openings
2 pages
SSDDFJ v2 1 Luck Stokes
No ratings yet
SSDDFJ v2 1 Luck Stokes
13 pages
Control Engg
No ratings yet
Control Engg
80 pages
Pineapp™ Archive-Secure™: The Best Answer For All Businesses' Mail Archiving Needs
No ratings yet
Pineapp™ Archive-Secure™: The Best Answer For All Businesses' Mail Archiving Needs
6 pages
NLP Unit 3
No ratings yet
NLP Unit 3
20 pages
Cread Cwrite
No ratings yet
Cread Cwrite
67 pages
Anna University Tiruchirappalli Tiruchirappalli - 620 024
No ratings yet
Anna University Tiruchirappalli Tiruchirappalli - 620 024
64 pages
Etabs Tutorial Wall
100% (6)
Etabs Tutorial Wall
12 pages
Configuring AS400 For Emails
100% (2)
Configuring AS400 For Emails
10 pages
How To Convert A Numeric Value Into English Words in Excel
No ratings yet
How To Convert A Numeric Value Into English Words in Excel
4 pages
07 Semant
No ratings yet
07 Semant
36 pages
History of Cloud Computing Characteristic of Cloud Computing Layers of Cloud Computing Deployment Models
No ratings yet
History of Cloud Computing Characteristic of Cloud Computing Layers of Cloud Computing Deployment Models
21 pages
Computer Aided Design and Analysis
No ratings yet
Computer Aided Design and Analysis
25 pages
Semanti Roles PDF
No ratings yet
Semanti Roles PDF
105 pages
CH 4 - Semantic Analysis PDF
100% (1)
CH 4 - Semantic Analysis PDF
36 pages
Language Processors
No ratings yet
Language Processors
41 pages
Towards Creating Precision Grammars From Interlinear Glossed Text: Inferring Large-Scale Typological Properties
No ratings yet
Towards Creating Precision Grammars From Interlinear Glossed Text: Inferring Large-Scale Typological Properties
10 pages
SDT Material
No ratings yet
SDT Material
30 pages
Describing Syntax and Semantics: CSE 325/CSE 425: Concepts of Programming Language
No ratings yet
Describing Syntax and Semantics: CSE 325/CSE 425: Concepts of Programming Language
46 pages
Structure in Linguistics
No ratings yet
Structure in Linguistics
6 pages
Unit V Intelligence and Applications: Morphological Analysis/Lexical Analysis
No ratings yet
Unit V Intelligence and Applications: Morphological Analysis/Lexical Analysis
30 pages
SemanticsSpeechRecognitionUnderstanding PDF
No ratings yet
SemanticsSpeechRecognitionUnderstanding PDF
11 pages
The Parsing System "Palavras"
No ratings yet
The Parsing System "Palavras"
505 pages
This Research Was Supported in Part by The United States Air Force E!ectronic Systems Division, Under Contract FI96828-C-0035
No ratings yet
This Research Was Supported in Part by The United States Air Force E!ectronic Systems Division, Under Contract FI96828-C-0035
22 pages
Lecture1 - Compiler Design
No ratings yet
Lecture1 - Compiler Design
52 pages
Grammars: Before You Can Parse You Need A Grammar. So Where Do Grammars Come From?
No ratings yet
Grammars: Before You Can Parse You Need A Grammar. So Where Do Grammars Come From?
32 pages
Lexical Analyzer Synopsis Final
0% (1)
Lexical Analyzer Synopsis Final
20 pages
Centro Universitário de Barra Mansa Academic Pro-Rectory Computer Engineering Course
No ratings yet
Centro Universitário de Barra Mansa Academic Pro-Rectory Computer Engineering Course
34 pages
052 SyntaxDirectedTranslation
No ratings yet
052 SyntaxDirectedTranslation
57 pages
Unit - 5 Natural Language Processing
No ratings yet
Unit - 5 Natural Language Processing
66 pages
Descriptive Morphological Analysis in Montage
No ratings yet
Descriptive Morphological Analysis in Montage
56 pages
Natural Language Processing Artificial Intelligence
100% (2)
Natural Language Processing Artificial Intelligence
81 pages
Natural Language Processing
No ratings yet
Natural Language Processing
13 pages
Natural Language Processing
No ratings yet
Natural Language Processing
15 pages
Phases of Compiler
No ratings yet
Phases of Compiler
9 pages
Syntax-Directed Translation: Dewan Tanvir Ahmed Assistant Professor, CSE Buet
No ratings yet
Syntax-Directed Translation: Dewan Tanvir Ahmed Assistant Professor, CSE Buet
21 pages

SEM-I: Why and What?

Uploaded by

SEM-I: Why and What?

Uploaded by

SEM-I: why and what?

Language dependent analysis/realization

Applications need to know what

Deep/shallow integration via RMRS

transfer component for MT

RMRS from shallow grammars is an underspecified

Explaining what were doing!

What must be specified

Syntax of representation (XML)

`grammar relations (e.g., udef_q_rel)

Hierarchy of relations (where motivated by

Consultants were interviewed

treatment of bare plural, message relations

`unwanted predicates such as _by_p_cm_rel

SEM-I: semantic interface

Formal level: MRS/RMRS syntax and

Object-level (e.g., _consultant_n_rel)

SEM-I and grammars

Object levels SEM-Is are auto-generated and distinct

Definition of `correct (R)MRS for developers

SEM-I plus lexical link used in lexical lookup phase

Database of relations, features, value sorts,

Meta-level: plan to generate from grammars, with

Semantic test suite exemplifying grammar

SEM-I development must be incremental

Grammar writers need flexibility to hide things, make

Shared meta-level SEM-I is presumably part of Matrix, but

BUT: automate production of SEM-I from grammars as much

Documentation needs to be automated as much as

External representation: (R)MRSSEM-I

mapping to feature structures (MRSFS)

MRSSEM-I to MRSFS mapping needed anyway, but may have to go via

distinctions between relations which are irrelevant for denotation

e.g., internal-only relation automatically converted to supertype in

BUT: want to minimize the discrepancies

relation hierarchies in SEM-I consistent with grammar hierarchies

Architecture with indirection

Type hierarchies of relations in grammars are not there to support

Documentation on wiki, mailing list for SEM-I developers and

Change protocol (initial

Applications and the SEM-I

Application code will be isolated from

SEM-I++ (aka Floyd)

SEM-I++ is not built by grammar developers, depends on SEM-I, not

Definitions of grammar relations and closed-class relations to support

Plan is to support specification of SEM-I++ in some version of OWL

You might also like