Speaker specific utterance transcription with morphological analysis
entry ::= value+ punct value ::= mor_unit ["~" mor_unit]? mor_unit ::= [prefix "#"]? (basic_unit | compound_unit) [("&" | "-") suffix]? basic_unit ::= pos "|" word_type compound_unit ::= pos "|+" basic_unit "+" basic_unit+ word_type ::= english_word | foreign_word | letter | idiosyncratic_word english_word ::= <Any non-inflected English word> foreign_word ::= <Any non-English word> letter ::= <Any letter> idiosyncratic_word ::= <Any non-standard word used by the speaker in a regular linguistic context> pos ::= <See Part-of-Speech Code Table below> prefix ::= "anti" | "co" | "de" | "dis" | "mega" | "mini" | "mis" | "multi" | "non" | "out" | "over" | "pre" | "re" | "semi" | "super" | "un" | "under" | "up" suffix ::= <See Suffix Code Table below> punct ::= "." | "!" | "?" | "+..."
Prefix codes are selected from the following:
A series of words with accompanying morphological information.
Parts of speech codes have the following interpretations:
POS Code | Meaning |
---|---|
adj | adjective |
adj:n | adjective derived from noun |
adj:v | adjective derived from verb |
adv | adverb |
adv:adj | adverb derived from adjective |
adv:int | intensifying adverb |
adv:loc | locational adverb |
adv:tem | temporal adverb |
adv:wh | WH adverb |
aux | auxiliary verb |
co | communicator |
co:voc | vocative communicator |
conj:coo | coordinating conjunction |
conj:subor | subordinating conjunction |
det | determiner |
det:num | numeric determiner |
det:wh | WH determiner |
fil | filler |
inf | infinitive |
int | interjection |
n | noun |
n:adj | noun derived from adjective |
n:gerund | nominal gerund |
n:pt | plurale tantum noun |
n:v | noun derived from verb |
neg | negation |
on | onomatopoeia |
part | participle |
post | postposition |
prep | preposition |
pro | pronoun |
pro:dem | demonstrative pronoun |
pro:exist | existential pronoun |
pro:indef | indefinite pronoun |
pro:poss | possessive pronoun |
pro:poss:det | determiner possessive pronoun |
pro:refl | reflexive pronoun |
pro:wh | WH pronoun |
ptl | particle |
qn | quantifier |
rel | relativizer |
v | verb |
v:n | verb derived from noun |
Suffix codes have the following interpretations:
Suffix Code | Meaning |
---|---|
13S | first- and third-person singular |
1S | first-person singular |
3S | third-person singular |
ABLE | “able” verb to adj derivational suffix |
AGT | agentive suffix |
AL | “al” noun to adj derivational suffix |
COND | conditional verb suffix |
CP | comparative adj and adverb suffix |
DIM | diminutive suffix |
FULL | “full/ful” nominal suffix (spoonful) |
GERUND | gerund suffix |
ISH | “ish” adjective or noun to adj suffix |
LESS | “less” noun to adj suffix |
LIKE | “like” noun to adj suffix (winglike) |
LOOKING | “looking” adjective suffix (funnylooking) |
LY | “ly” adj to adverb suffix |
NESS | “ness” adj to noun suffix |
PAST | past tense marker |
PERF | perfect participle marker |
PL | plural marker |
POSS | possessive suffix |
PRES | present tense marker |
PROG | progressive participle marker |
SP | superlative adj and adverb marker |
Y | “y” verb or noun to adj suffix (burny) |
ZERO | null suffix for verbs with no explicit past tense |
None
Mor is dependent on the CHAT Column and the Enum Column.
Similar to the format of the Enum column, with the following differences:
Example:
CHAT: let's see, I'm going to make Dad's dinner &xxx .``
Enum: 1|let's 2|see 3|I 4|'m 5|going 6|to 7|make 8|Dad 9|'s 10|dinner 11|.``
Mor: aux|let's v|see pro|I~aux|be&1S aux|go-PROG inf|to v|make n:prop|Dad-POSS n|dinner .