Odd for TEI based dictionariesEduard Drenth (ed.)

Editor: Eduard Drenth2023-05

Schema tei_dictionaries: Elements

<article>

<article> An article (determiner) for a noun
Member of
Contained by
core: note
dictionaries: def
May contain Character data only
Content model
<content>
 <textNode/>
</content>
Schema Declaration
element article { text }

<body>

<body> A body contains a superEntry with entries (for example for homonyms) or one entry
Contained by
May contain
Content model
<content>
 <alternate>
  <elementRef key="superEntry"/>
  <elementRef key="entry"/>
 </alternate>
</content>
Schema Declaration
element body { superEntry | entry }

<cit>

<cit> A cit is used as a container holding either a quote, forms (@type='translation') or senses (@type='sensegroup'). When @type='translation', multiple quotes are allowed, when @type='collocation' nested cits of @type=example are allowed. All types can be described using a note, for @type='proverb' and @type='collocation' a definition may be added
Attributes
type
Status Required
Legal values are:
translation
A cit of this type contains a translation of either a form or a text
example
A cit of this type contains an example
collocation
A cit of this type contains collocations of a lemma, possibly with translations and examples
proverb
A cit of this type contains proverbs using a lemma, possibly with translations and examples
sensegroup
A cit of this type contains senses that can be seen as a group
Contained by
May contain
core: note
dictionaries: def
Schematron

<s:report test="(@type='sensegroup' and not(tei:sense)) or (tei:form and @type!='translation') or ((@type='example' or @type='collocation' or @type='proverb') and not(tei:quote))">
<s:value-of select="ancestor::tei:TEI/tei:teiHeader/tei:fileDesc/tei:sourceDesc/tei:msDesc/tei:msIdentifier/tei:idno"/>
: @type=sensegroup requires cit/sense, @type!=translation|sensegroup requires cit/quote,
form requires @type=translation
</s:report>
Schematron

<s:report test="@type='sensegroup' and *[not(self::tei:note or self::tei:sense)]">
<s:value-of select="ancestor::tei:TEI/tei:teiHeader/tei:fileDesc/tei:sourceDesc/tei:msDesc/tei:msIdentifier/tei:idno"/>
: @type=sensegroup may only contain sense and note
</s:report>
Schematron

<s:report test="count(tei:quote) > 1 and @type!='translation'">
<s:value-of select="ancestor::tei:TEI/tei:teiHeader/tei:fileDesc/tei:sourceDesc/tei:msDesc/tei:msIdentifier/tei:idno"/>
: multiple quotes requires @type=translation
</s:report>
Schematron

<s:report test="tei:def and @type!='collocation' and @type!='proverb'">
<s:value-of select="ancestor::tei:TEI/tei:teiHeader/tei:fileDesc/tei:sourceDesc/tei:msDesc/tei:msIdentifier/tei:idno"/>
: def is only for collocation and proverb
</s:report>
Schematron

<s:report test="@type='translation' and not(@xml:lang)">
<s:value-of select="ancestor::tei:TEI/tei:teiHeader/tei:fileDesc/tei:sourceDesc/tei:msDesc/tei:msIdentifier/tei:idno"/>
: translation requires @xml:lang
</s:report>
Schematron

<s:report test="tei:cit/tei:cit/tei:cit">
<s:value-of select="ancestor::tei:TEI/tei:teiHeader/tei:fileDesc/tei:sourceDesc/tei:msDesc/tei:msIdentifier/tei:idno"/>
: maximum nesting depth for cit is 2
</s:report>
Schematron

<s:report test="ancestor::tei:cit and @type!='translation' and @type!='example' and @type!='sensegroup'">
<s:value-of select="ancestor::tei:TEI/tei:teiHeader/tei:fileDesc/tei:sourceDesc/tei:msDesc/tei:msIdentifier/tei:idno"/>
: nested cit must be translation, example or sensegroup
</s:report>
Content model
<content>
 <alternate>
  <sequence>
   <elementRef key="sense"
    maxOccurs="unbounded"/>

   <elementRef key="noteminOccurs="0"/>
  </sequence>
  <sequence>
   <elementRef key="quote"
    maxOccurs="unbounded"/>

   <elementRef key="defminOccurs="0"
    maxOccurs="unbounded"/>

   <elementRef key="noteminOccurs="0"/>
   <elementRef key="citminOccurs="0"
    maxOccurs="unbounded"/>

  </sequence>
  <sequence>
   <elementRef key="form"
    maxOccurs="unbounded"/>

   <elementRef key="noteminOccurs="0"/>
  </sequence>
 </alternate>
</content>
Schema Declaration
element cit
{
   attribute type
   {
      "translation" | "example" | "collocation" | "proverb" | "sensegroup"
   },
   ( ( sense+, note? ) | ( quote+, def*, note?, cit* ) | ( form+, note? ) )
}

<def>

<def> A def contains a definition of a sense and may contain text (with hi, q and ref) followed by label, glosses or note.
Module dictionaries
Contained by
May contain
core: note
character data
Schematron

<s:report test="not(node())">
<s:value-of select="ancestor::tei:TEI/tei:teiHeader/tei:fileDesc/tei:sourceDesc/tei:msDesc/tei:msIdentifier/tei:idno"/>
: def should contain something
</s:report>
Content model
<content>
 <sequence>
  <macroRef key="macro.textLike"
   minOccurs="0"/>

  <elementRef key="etymminOccurs="0"/>
  <elementRef key="labelminOccurs="0"/>
  <elementRef key="glossminOccurs="0"
   maxOccurs="unbounded"/>

  <elementRef key="noteminOccurs="0"/>
 </sequence>
</content>
Schema Declaration
element def { macro.textLike?, etym?, label?, gloss*, note? }

<entry>

<entry> An entry contains at least a form, it optionally contains sense(s) of that form and translations, examples, proverbs or collocations in cit elements, also note and ref can be used. Cits directly under an entry are assumed to hold information valid for all senses.
Contained by
May contain
core: note
Schematron

<s:assert test="tei:form/@type='lemma'">
<s:value-of select="tei:form/tei:orth"/>: entry/form/@type must be of type "lemma"
</s:assert>
Schematron

<s:report test="tei:cit[@type='translation']/tei:quote or tei:sense/tei:cit[@type='translation']/tei:quote">
<s:value-of select="ancestor::tei:TEI/tei:teiHeader/tei:fileDesc/tei:sourceDesc/tei:msDesc/tei:msIdentifier/tei:idno"/>
: cit/@type='translation requires form, not quote'
<s:value-of select="ancestor::TEI/teiHeader/fileDesc/titleStmt/title"/>
</s:report>
Schematron

<s:report test="tei:cit[@type!='translation']/tei:form">
<s:value-of select="tei:cit[@type!='translation']/tei:form/tei:orth"/>: cit/form requires
cit/@type='translation'
</s:report>
Schematron

<s:report test="tei:sense/tei:cit[@type!='translation']/tei:form">
<s:value-of select="tei:sense/tei:cit[@type!='translation']/tei:form/tei:orth"/>: cit/form requires
cit/@type='translation'
</s:report>
Content model
<content>
 <sequence>
  <elementRef key="form"/>
  <alternate maxOccurs="unbounded"
   minOccurs="0">

   <elementRef key="note"/>
   <elementRef key="sense"/>
   <elementRef key="cit"/>
   <elementRef key="ref"/>
  </alternate>
 </sequence>
</content>
Schema Declaration
element entry { form, ( note | sense | cit | ref )* }

<form>

<form> A form contains at least an orth for the word, grammar information in gram elements may be added, for these universal dependencies terminology is used. Several other elements may be used to describe the form, as well as nested form elements for example for paradigm forms. A form may also contain text nodes, q and hi, use of which is discouraged preferably use note instead.
Attributes att.namekinds (@namekind)
type
Status Required
Legal values are:
lemma
This type indicates a headword, possibly with nested non-lemma forms
paradigm
This type indicates a paradigm form for the containing lemma, should be within a lemma form, normally doesn't contain nested forms
synonym
This type indicates a synonym form for the containing lemma, should be within a lemma form
variant
This type indicates a variant for the containing lemma, should be within a lemma form
compound
This type indicates a compound for the containing lemma, should be within a lemma form, shouldn't contain nested forms
Contained by
May contain
Schematron

<s:report test="@namekind and not(@type='lemma')">
<s:value-of select="tei:orth"/>: define @namekind on a lemma form
</s:report>
Schematron

<s:report test="@type='lemma' and parent::tei:form">
<s:value-of select="tei:orth"/>: A lemma form must be at the highest level of a form tree
</s:report>
Schematron

<s:report test="tei:form/tei:form/tei:form">
<s:value-of select="tei:orth"/>: maximum 2 levels of form nesting allowed
</s:report>
Content model
<content>
 <sequence>
  <elementRef key="orth"/>
  <alternate>
   <elementRef key="gramGrpminOccurs="0"
    maxOccurs="unbounded"/>

   <elementRef key="gramminOccurs="0"
    maxOccurs="unbounded"/>

  </alternate>
  <elementRef key="articleminOccurs="0"/>
  <elementRef key="hyphminOccurs="0"/>
  <alternate minOccurs="0"
   maxOccurs="unbounded">

   <elementRef key="pron"/>
   <elementRef key="stress"/>
   <elementRef key="form"/>
   <elementRef key="etym"/>
   <elementRef key="note"/>
  </alternate>
 </sequence>
</content>
Schema Declaration
element form
{
   att.namekinds.attributes,
   attribute type { "lemma" | "paradigm" | "synonym" | "variant" | "compound" },
   (
      orth,
      ( gramGrp* | gram* ),
      article?,
      hyph?,
      ( pron | stress | form | etym | note )*
   )
}

<gram>

<gram> A gram provides grammar information for a form using universal dependencies terminology. No @type is used for the category, this simplifies querying and makes an index possible
Member of
Contained by
core: note
dictionaries: def
May contain Empty element
Content model
<content>
 <valList type="closed">
  <valItem ident="islemma.yes">
   <desc>
    <ref target="https://universaldependencies.org/u/overview/morphology.html">https://universaldependencies.org/u/overview/morphology.html</ref> Boolean, is this a base form</desc>
  </valItem>
  <valItem ident="abbr.yes">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Abbr.html">https://universaldependencies.org/u/feat/Abbr.html</ref> Boolean feature. Is this an abbreviation?</desc>
  </valItem>
  <valItem ident="poss.yes">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Poss.html">https://universaldependencies.org/u/feat/Poss.html</ref> Boolean feature. Is this word possessive?</desc>
  </valItem>
  <valItem ident="reflex.yes">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Reflex.html">https://universaldependencies.org/u/feat/Reflex.html</ref> Boolean feature, typically of pronouns or determiners. It tells whether the word is reflexive, i.e. refers to the subject of its clause.?</desc>
  </valItem>
  <valItem ident="prefix.yes">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Prefix.html">https://universaldependencies.org/u/feat/Prefix.html</ref> Boolean feature, Is this a prefix word in a compound, that usually cannot stand on its own?</desc>
  </valItem>
  <valItem ident="prontype.prs">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/PronType.html">https://universaldependencies.org/u/feat/PronType.html</ref> personal pronoun or determiner</desc>
  </valItem>
  <valItem ident="prontype.rcp">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/PronType.html">https://universaldependencies.org/u/feat/PronType.html</ref> reciprocal pronoun</desc>
  </valItem>
  <valItem ident="prontype.art">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/PronType.html">https://universaldependencies.org/u/feat/PronType.html</ref> Article is a special case of determiner that bears the feature of definiteness</desc>
  </valItem>
  <valItem ident="prontype.int">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/PronType.html">https://universaldependencies.org/u/feat/PronType.html</ref> interrogative pronoun, determiner, numeral or adverb</desc>
  </valItem>
  <valItem ident="prontype.rel">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/PronType.html">https://universaldependencies.org/u/feat/PronType.html</ref> relative pronoun, determiner, numeral or adverb</desc>
  </valItem>
  <valItem ident="prontype.ind">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/PronType.html">https://universaldependencies.org/u/feat/PronType.html</ref> indefinite pronoun, determiner, numeral or adverb</desc>
  </valItem>
  <valItem ident="prontype.emp">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/PronType.html">https://universaldependencies.org/u/feat/PronType.html</ref> Emphatic pro-adjectives (determiners) emphasize the nominal they depend on.</desc>
  </valItem>
  <valItem ident="prontype.exc">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/PronType.html">https://universaldependencies.org/u/feat/PronType.html</ref> exclamative determiner</desc>
  </valItem>
  <valItem ident="prontype.dem">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/PronType.html">https://universaldependencies.org/u/feat/PronType.html</ref> Demonstrative pronouns are often parallel to interrogatives.</desc>
  </valItem>
  <valItem ident="case.nom">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Case.html">https://universaldependencies.org/u/feat/Case.html</ref> nominative</desc>
  </valItem>
  <valItem ident="case.acc">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Case.html">https://universaldependencies.org/u/feat/Case.html</ref> accusative</desc>
  </valItem>
  <valItem ident="case.dat">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Case.html">https://universaldependencies.org/u/feat/Case.html</ref> dative</desc>
  </valItem>
  <valItem ident="case.gen">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Case.html">https://universaldependencies.org/u/feat/Case.html</ref> genitive</desc>
  </valItem>
  <valItem ident="case.ins">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Case.html">https://universaldependencies.org/u/feat/Case.html</ref> instrumental / instructive</desc>
  </valItem>
  <valItem ident="case.par">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Case.html">https://universaldependencies.org/u/feat/Case.html</ref> partitive</desc>
  </valItem>
  <valItem ident="tense.past">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Tense.html">https://universaldependencies.org/u/feat/Tense.html</ref> past tense</desc>
  </valItem>
  <valItem ident="tense.pres">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Tense.html">https://universaldependencies.org/u/feat/Tense.html</ref> present tense</desc>
  </valItem>
  <valItem ident="tense.fut">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Tense.html">https://universaldependencies.org/u/feat/Tense.html</ref> future tense</desc>
  </valItem>
  <valItem ident="voice.act">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Voice.html">https://universaldependencies.org/u/feat/Voice.html</ref> The subject of the verb is the doer of the action (agent).</desc>
  </valItem>
  <valItem ident="voice.pass">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Voice.html">https://universaldependencies.org/u/feat/Voice.html</ref> The subject of the verb is affected by the action (patient).</desc>
  </valItem>
  <valItem ident="number.sing">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Number.html">https://universaldependencies.org/u/feat/Number.html</ref> A singular noun denotes one person, animal or thing.</desc>
  </valItem>
  <valItem ident="number.plur">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Number.html">https://universaldependencies.org/u/feat/Number.html</ref> A plural noun denotes several persons, animals or things.</desc>
  </valItem>
  <valItem ident="number.ptan">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Number.html">https://universaldependencies.org/u/feat/Number.html</ref> Plurale tantum, some nouns appear only in the plural form even though they denote one thing.</desc>
  </valItem>
  <valItem ident="number.coll">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Number.html">https://universaldependencies.org/u/feat/Number.html</ref> Collective or mass or singulare tantum applies to words that use grammatical singular to describe sets of objects.</desc>
  </valItem>
  <valItem ident="person.1">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Person.html">https://universaldependencies.org/u/feat/Person.html</ref> The first person refers just to the speaker / author and in plural one or more additional persons.</desc>
  </valItem>
  <valItem ident="person.2">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Person.html">https://universaldependencies.org/u/feat/Person.html</ref> The second person refers to the addressee(s).</desc>
  </valItem>
  <valItem ident="person.3">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Person.html">https://universaldependencies.org/u/feat/Person.html</ref> The third person refers to one or more persons that are neither speakers nor addressees.</desc>
  </valItem>
  <valItem ident="verbtype.mod">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/VerbType.html">https://universaldependencies.org/u/feat/VerbType.html</ref> Verbs that take infinitive of another verb as argument and add various modes of possibility, necessity etc.</desc>
  </valItem>
  <valItem ident="verbtype.tense">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/VerbType.html">https://universaldependencies.org/u/feat/VerbType.html</ref> Verb used to create periphrastic verb forms (tenses, passives etc.).</desc>
  </valItem>
  <valItem ident="verbform.inf">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/VerbForm.html">https://universaldependencies.org/u/feat/VerbForm.html</ref> Infinitive is the citation form of verbs in many languages.</desc>
  </valItem>
  <valItem ident="verbform.part">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/VerbForm.html">https://universaldependencies.org/u/feat/VerbForm.html</ref> Participle is a non-finite verb form that shares properties of verbs and adjectives.</desc>
  </valItem>
  <valItem ident="verbform.ger">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/VerbForm.html">https://universaldependencies.org/u/feat/VerbForm.html</ref> Gerund is a non-finite verb form that shares properties of verbs and nouns.</desc>
  </valItem>
  <valItem ident="verbform.conv">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/VerbForm.html">https://universaldependencies.org/u/feat/VerbForm.html</ref> The converb, also called adverbial participle or transgressive, is a non-finite verb form that shares properties of verbs and adverbs.</desc>
  </valItem>
  <valItem ident="polite.infm">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Polite.html">https://universaldependencies.org/u/feat/Polite.html</ref> usually meant for communication with family members and close friends.</desc>
  </valItem>
  <valItem ident="polite.form">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Polite.html">https://universaldependencies.org/u/feat/Polite.html</ref> usually meant for communication with strangers and people of higher social status.</desc>
  </valItem>
  <valItem ident="numtype.ord">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/NumType.html">https://universaldependencies.org/u/feat/NumType.html</ref> ordinal number (first, second,..)</desc>
  </valItem>
  <valItem ident="numtype.card">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/NumType.html">https://universaldependencies.org/u/feat/NumType.html</ref> cardinal number (one, two, many,....)</desc>
  </valItem>
  <valItem ident="degree.cmp">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Degree.html">https://universaldependencies.org/u/feat/Degree.html</ref> comparative, second degree</desc>
  </valItem>
  <valItem ident="degree.sup">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Degree.html">https://universaldependencies.org/u/feat/Degree.html</ref> superlative, third degree</desc>
  </valItem>
  <valItem ident="degree.dim">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Degree.html">https://universaldependencies.org/u/feat/Degree.html</ref> Added to features in universaldependencies. Diminutive.</desc>
  </valItem>
  <valItem ident="mood.imp">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Mood.html">https://universaldependencies.org/u/feat/Mood.html</ref> The speaker uses imperative to order or ask the addressee to do the action of the verb.</desc>
  </valItem>
  <valItem ident="mood.sub">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Mood.html">https://universaldependencies.org/u/feat/Mood.html</ref> The subjunctive mood is used under certain circumstances in subordinate clauses, typically for actions that are subjective or otherwise uncertain.</desc>
  </valItem>
  <valItem ident="mood.ind">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Mood.html">https://universaldependencies.org/u/feat/Mood.html</ref> A verb in indicative merely states that something happens, has happened or will happen.</desc>
  </valItem>
  <valItem ident="gender.masc">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Gender.html">https://universaldependencies.org/u/feat/Gender.html</ref> masculine gender</desc>
  </valItem>
  <valItem ident="gender.fem">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Gender.html">https://universaldependencies.org/u/feat/Gender.html</ref> feminine gender</desc>
  </valItem>
  <valItem ident="gender.neut">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Gender.html">https://universaldependencies.org/u/feat/Gender.html</ref> neuter gender</desc>
  </valItem>
  <valItem ident="gender.com">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/Gender.html">https://universaldependencies.org/u/feat/Gender.html</ref> Some languages do not distinguish masculine/feminine but they do distinguish neuter vs. non-neuter. The non-neuter is called common gender.</desc>
  </valItem>
  <valItem ident="hyph.yes">
   <desc>
    <ref target="https://universaldependencies.org/u/feat/all.html#hyph-hyphenated-compound-or-part-of-it">https://universaldependencies.org/u/feat/all.html#hyph-hyphenated-compound-or-part-of-it</ref> Is this part of a hyphenated compound? Depending on tokenization, the compound may be one token or be split to several tokens; then the tokens need tags.</desc>
  </valItem>
  <valItem ident="prodrop.yes">
   <desc>Added for Frisian to MISC in universaldependencies. pronoun drop, omission of pronouns because they can be inferred</desc>
  </valItem>
  <valItem ident="clitic.yes">
   <desc>Added for Frisian to features in universaldependencies. Most personal pronouns have a clitic form, which is the result of either vowel deletion, vowel reduction, monophthongization or schwa deletion, while there are also cases of suppletion.</desc>
  </valItem>
  <valItem ident="inflection.infl">
   <desc>Not in universaldependencies. inflected</desc>
  </valItem>
  <valItem ident="inflection.uninf">
   <desc>Not in universaldependencies. uninflected</desc>
  </valItem>
  <valItem ident="suffix.yes">
   <desc>Not in universaldependencies Boolean feature, Is this a suffix word in a compound, that usually cannot stand on its own?</desc>
  </valItem>
  <valItem ident="valency.1">
   <desc>An intransitive verb takes one argument (no object)</desc>
  </valItem>
  <valItem ident="valency.2">
   <desc>A monotransitive verb takes two arguments (of which one object)</desc>
  </valItem>
  <valItem ident="valency.3">
   <desc>A ditransitive verb takes three arguments (of which a direct and an indirect object)</desc>
  </valItem>
  <valItem ident="convertedfrom.adj">
   <desc>Not in universaldependencies. adjective used as another category</desc>
  </valItem>
  <valItem ident="convertedfrom.adv">
   <desc>Not in universaldependencies. adverb used as another category</desc>
  </valItem>
  <valItem ident="convertedfrom.ver">
   <desc>Not in universaldependencies. verb used as another category</desc>
  </valItem>
  <valItem ident="convertedfrom.num">
   <desc>Not in universaldependencies. numeral used as another category</desc>
  </valItem>
  <valItem ident="convertedfrom.pro">
   <desc>Not in universaldependencies. pronomen used as another category</desc>
  </valItem>
  <valItem ident="convertedfrom.part">
   <desc>Not in universaldependencies. verbform part used as another category</desc>
  </valItem>
  <valItem ident="predicate.yes">
   <desc>Not in universaldependencies. statement about the subject</desc>
  </valItem>
  <valItem ident="construction.attr">
   <desc>Not in universaldependencies. attributive</desc>
  </valItem>
  <valItem ident="pos.adj">
   <desc>
    <ref target="https://universaldependencies.org/u/pos/index.html">https://universaldependencies.org/u/pos/index.html</ref> Adjectives are words that typically modify nouns and specify their properties or attributes.</desc>
  </valItem>
  <valItem ident="pos.adp">
   <desc>
    <ref target="https://universaldependencies.org/u/pos/index.html">https://universaldependencies.org/u/pos/index.html</ref> Adposition is a cover term for prepositions and postpositions.</desc>
  </valItem>
  <valItem ident="pos.adv">
   <desc>
    <ref target="https://universaldependencies.org/u/pos/index.html">https://universaldependencies.org/u/pos/index.html</ref> Adverbs are words that typically modify verbs for such categories as time, place, direction or manner.</desc>
  </valItem>
  <valItem ident="pos.aux">
   <desc>
    <ref target="https://universaldependencies.org/u/pos/index.html">https://universaldependencies.org/u/pos/index.html</ref> An auxiliary is a function word that accompanies the lexical verb of a verb phrase and expresses grammatical distinctions not carried by the lexical verb, such as person, number, tense, mood, aspect, voice or evidentiality.</desc>
  </valItem>
  <valItem ident="pos.cconj">
   <desc>
    <ref target="https://universaldependencies.org/u/pos/index.html">https://universaldependencies.org/u/pos/index.html</ref> A coordinating conjunction is a word that links words or larger constituents without syntactically subordinating one to the other and expresses a semantic relationship between them.</desc>
  </valItem>
  <valItem ident="pos.det">
   <desc>
    <ref target="https://universaldependencies.org/u/pos/index.html">https://universaldependencies.org/u/pos/index.html</ref> Determiners are words that modify nouns or noun phrases and express the reference of the noun phrase in context.</desc>
  </valItem>
  <valItem ident="pos.intj">
   <desc>
    <ref target="https://universaldependencies.org/u/pos/index.html">https://universaldependencies.org/u/pos/index.html</ref> An interjection is a word that is used most often as an exclamation or part of an exclamation.</desc>
  </valItem>
  <valItem ident="pos.noun">
   <desc>
    <ref target="https://universaldependencies.org/u/pos/index.html">https://universaldependencies.org/u/pos/index.html</ref> Nouns are a part of speech typically denoting a person, place, thing, animal or idea.</desc>
  </valItem>
  <valItem ident="pos.num">
   <desc>
    <ref target="https://universaldependencies.org/u/pos/index.html">https://universaldependencies.org/u/pos/index.html</ref> A numeral is a word, functioning most typically as a determiner, adjective or pronoun, that expresses a number and a relation to the number, such as quantity, sequence, frequency or fraction.</desc>
  </valItem>
  <valItem ident="pos.part">
   <desc>
    <ref target="https://universaldependencies.org/u/pos/index.html">https://universaldependencies.org/u/pos/index.html</ref> Particles are function words that must be associated with another word or phrase to impart meaning and that do not satisfy definitions of other universal parts of speech.</desc>
  </valItem>
  <valItem ident="pos.pron">
   <desc>
    <ref target="https://universaldependencies.org/u/pos/index.html">https://universaldependencies.org/u/pos/index.html</ref> Pronouns are words that substitute for nouns or noun phrases, whose meaning is recoverable from the linguistic or extralinguistic context.</desc>
  </valItem>
  <valItem ident="pos.propn">
   <desc>
    <ref target="https://universaldependencies.org/u/pos/index.html">https://universaldependencies.org/u/pos/index.html</ref> A proper noun is a noun (or nominal content word) that is the name (or part of the name) of a specific individual, place, or object.</desc>
  </valItem>
  <valItem ident="pos.punct">
   <desc>
    <ref target="https://universaldependencies.org/u/pos/index.html">https://universaldependencies.org/u/pos/index.html</ref> Punctuation marks are non-alphabetical characters and character groups used in many languages to delimit linguistic units in printed text.</desc>
  </valItem>
  <valItem ident="pos.sconj">
   <desc>
    <ref target="https://universaldependencies.org/u/pos/index.html">https://universaldependencies.org/u/pos/index.html</ref> A subordinating conjunction is a conjunction that links constructions by making one of them a constituent of the other.</desc>
  </valItem>
  <valItem ident="pos.sym">
   <desc>
    <ref target="https://universaldependencies.org/u/pos/index.html">https://universaldependencies.org/u/pos/index.html</ref> A symbol is a word-like entity that differs from ordinary words by form, function, or both.</desc>
  </valItem>
  <valItem ident="pos.verb">
   <desc>
    <ref target="https://universaldependencies.org/u/pos/index.html">https://universaldependencies.org/u/pos/index.html</ref> A verb is a member of the syntactic class of words that typically signal events and actions.</desc>
  </valItem>
  <valItem ident="pos.x">
   <desc>
    <ref target="https://universaldependencies.org/u/pos/index.html">https://universaldependencies.org/u/pos/index.html</ref> The tag X is used for words that for some reason cannot be assigned a real part-of-speech category.</desc>
  </valItem>
 </valList>
</content>
Legal values are:
islemma.yes
https://universaldependencies.org/u/overview/morphology.html Boolean, is this a base form
abbr.yes
https://universaldependencies.org/u/feat/Abbr.html Boolean feature. Is this an abbreviation?
poss.yes
https://universaldependencies.org/u/feat/Poss.html Boolean feature. Is this word possessive?
reflex.yes
https://universaldependencies.org/u/feat/Reflex.html Boolean feature, typically of pronouns or determiners. It tells whether the word is reflexive, i.e. refers to the subject of its clause.?
prefix.yes
https://universaldependencies.org/u/feat/Prefix.html Boolean feature, Is this a prefix word in a compound, that usually cannot stand on its own?
prontype.prs
https://universaldependencies.org/u/feat/PronType.html personal pronoun or determiner
prontype.rcp
https://universaldependencies.org/u/feat/PronType.html reciprocal pronoun
prontype.art
https://universaldependencies.org/u/feat/PronType.html Article is a special case of determiner that bears the feature of definiteness
prontype.int
https://universaldependencies.org/u/feat/PronType.html interrogative pronoun, determiner, numeral or adverb
prontype.rel
https://universaldependencies.org/u/feat/PronType.html relative pronoun, determiner, numeral or adverb
prontype.ind
https://universaldependencies.org/u/feat/PronType.html indefinite pronoun, determiner, numeral or adverb
prontype.emp
https://universaldependencies.org/u/feat/PronType.html Emphatic pro-adjectives (determiners) emphasize the nominal they depend on.
prontype.exc
https://universaldependencies.org/u/feat/PronType.html exclamative determiner
prontype.dem
https://universaldependencies.org/u/feat/PronType.html Demonstrative pronouns are often parallel to interrogatives.
case.nom
https://universaldependencies.org/u/feat/Case.html nominative
case.acc
https://universaldependencies.org/u/feat/Case.html accusative
case.dat
https://universaldependencies.org/u/feat/Case.html dative
case.gen
https://universaldependencies.org/u/feat/Case.html genitive
case.ins
https://universaldependencies.org/u/feat/Case.html instrumental / instructive
case.par
https://universaldependencies.org/u/feat/Case.html partitive
tense.past
https://universaldependencies.org/u/feat/Tense.html past tense
tense.pres
https://universaldependencies.org/u/feat/Tense.html present tense
tense.fut
https://universaldependencies.org/u/feat/Tense.html future tense
voice.act
https://universaldependencies.org/u/feat/Voice.html The subject of the verb is the doer of the action (agent).
voice.pass
https://universaldependencies.org/u/feat/Voice.html The subject of the verb is affected by the action (patient).
number.sing
https://universaldependencies.org/u/feat/Number.html A singular noun denotes one person, animal or thing.
number.plur
https://universaldependencies.org/u/feat/Number.html A plural noun denotes several persons, animals or things.
number.ptan
https://universaldependencies.org/u/feat/Number.html Plurale tantum, some nouns appear only in the plural form even though they denote one thing.
number.coll
https://universaldependencies.org/u/feat/Number.html Collective or mass or singulare tantum applies to words that use grammatical singular to describe sets of objects.
person.1
https://universaldependencies.org/u/feat/Person.html The first person refers just to the speaker / author and in plural one or more additional persons.
person.2
https://universaldependencies.org/u/feat/Person.html The second person refers to the addressee(s).
person.3
https://universaldependencies.org/u/feat/Person.html The third person refers to one or more persons that are neither speakers nor addressees.
verbtype.mod
https://universaldependencies.org/u/feat/VerbType.html Verbs that take infinitive of another verb as argument and add various modes of possibility, necessity etc.
verbtype.tense
https://universaldependencies.org/u/feat/VerbType.html Verb used to create periphrastic verb forms (tenses, passives etc.).
verbform.inf
https://universaldependencies.org/u/feat/VerbForm.html Infinitive is the citation form of verbs in many languages.
verbform.part
https://universaldependencies.org/u/feat/VerbForm.html Participle is a non-finite verb form that shares properties of verbs and adjectives.
verbform.ger
https://universaldependencies.org/u/feat/VerbForm.html Gerund is a non-finite verb form that shares properties of verbs and nouns.
verbform.conv
https://universaldependencies.org/u/feat/VerbForm.html The converb, also called adverbial participle or transgressive, is a non-finite verb form that shares properties of verbs and adverbs.
polite.infm
https://universaldependencies.org/u/feat/Polite.html usually meant for communication with family members and close friends.
polite.form
https://universaldependencies.org/u/feat/Polite.html usually meant for communication with strangers and people of higher social status.
numtype.ord
https://universaldependencies.org/u/feat/NumType.html ordinal number (first, second,..)
numtype.card
https://universaldependencies.org/u/feat/NumType.html cardinal number (one, two, many,....)
degree.cmp
https://universaldependencies.org/u/feat/Degree.html comparative, second degree
degree.sup
https://universaldependencies.org/u/feat/Degree.html superlative, third degree
degree.dim
https://universaldependencies.org/u/feat/Degree.html Added to features in universaldependencies. Diminutive.
mood.imp
https://universaldependencies.org/u/feat/Mood.html The speaker uses imperative to order or ask the addressee to do the action of the verb.
mood.sub
https://universaldependencies.org/u/feat/Mood.html The subjunctive mood is used under certain circumstances in subordinate clauses, typically for actions that are subjective or otherwise uncertain.
mood.ind
https://universaldependencies.org/u/feat/Mood.html A verb in indicative merely states that something happens, has happened or will happen.
gender.masc
https://universaldependencies.org/u/feat/Gender.html masculine gender
gender.fem
https://universaldependencies.org/u/feat/Gender.html feminine gender
gender.neut
https://universaldependencies.org/u/feat/Gender.html neuter gender
gender.com
https://universaldependencies.org/u/feat/Gender.html Some languages do not distinguish masculine/feminine but they do distinguish neuter vs. non-neuter. The non-neuter is called common gender.
hyph.yes
https://universaldependencies.org/u/feat/all.html#hyph-hyphenated-compound-or-part-of-it Is this part of a hyphenated compound? Depending on tokenization, the compound may be one token or be split to several tokens; then the tokens need tags.
prodrop.yes
Added for Frisian to MISC in universaldependencies. pronoun drop, omission of pronouns because they can be inferred
clitic.yes
Added for Frisian to features in universaldependencies. Most personal pronouns have a clitic form, which is the result of either vowel deletion, vowel reduction, monophthongization or schwa deletion, while there are also cases of suppletion.
inflection.infl
Not in universaldependencies. inflected
inflection.uninf
Not in universaldependencies. uninflected
suffix.yes
Not in universaldependencies Boolean feature, Is this a suffix word in a compound, that usually cannot stand on its own?
valency.1
An intransitive verb takes one argument (no object)
valency.2
A monotransitive verb takes two arguments (of which one object)
valency.3
A ditransitive verb takes three arguments (of which a direct and an indirect object)
convertedfrom.adj
Not in universaldependencies. adjective used as another category
convertedfrom.adv
Not in universaldependencies. adverb used as another category
convertedfrom.ver
Not in universaldependencies. verb used as another category
convertedfrom.num
Not in universaldependencies. numeral used as another category
convertedfrom.pro
Not in universaldependencies. pronomen used as another category
convertedfrom.part
Not in universaldependencies. verbform part used as another category
predicate.yes
Not in universaldependencies. statement about the subject
construction.attr
Not in universaldependencies. attributive
pos.adj
https://universaldependencies.org/u/pos/index.html Adjectives are words that typically modify nouns and specify their properties or attributes.
pos.adp
https://universaldependencies.org/u/pos/index.html Adposition is a cover term for prepositions and postpositions.
pos.adv
https://universaldependencies.org/u/pos/index.html Adverbs are words that typically modify verbs for such categories as time, place, direction or manner.
pos.aux
https://universaldependencies.org/u/pos/index.html An auxiliary is a function word that accompanies the lexical verb of a verb phrase and expresses grammatical distinctions not carried by the lexical verb, such as person, number, tense, mood, aspect, voice or evidentiality.
pos.cconj
https://universaldependencies.org/u/pos/index.html A coordinating conjunction is a word that links words or larger constituents without syntactically subordinating one to the other and expresses a semantic relationship between them.
pos.det
https://universaldependencies.org/u/pos/index.html Determiners are words that modify nouns or noun phrases and express the reference of the noun phrase in context.
pos.intj
https://universaldependencies.org/u/pos/index.html An interjection is a word that is used most often as an exclamation or part of an exclamation.
pos.noun
https://universaldependencies.org/u/pos/index.html Nouns are a part of speech typically denoting a person, place, thing, animal or idea.
pos.num
https://universaldependencies.org/u/pos/index.html A numeral is a word, functioning most typically as a determiner, adjective or pronoun, that expresses a number and a relation to the number, such as quantity, sequence, frequency or fraction.
pos.part
https://universaldependencies.org/u/pos/index.html Particles are function words that must be associated with another word or phrase to impart meaning and that do not satisfy definitions of other universal parts of speech.
pos.pron
https://universaldependencies.org/u/pos/index.html Pronouns are words that substitute for nouns or noun phrases, whose meaning is recoverable from the linguistic or extralinguistic context.
pos.propn
https://universaldependencies.org/u/pos/index.html A proper noun is a noun (or nominal content word) that is the name (or part of the name) of a specific individual, place, or object.
pos.punct
https://universaldependencies.org/u/pos/index.html Punctuation marks are non-alphabetical characters and character groups used in many languages to delimit linguistic units in printed text.
pos.sconj
https://universaldependencies.org/u/pos/index.html A subordinating conjunction is a conjunction that links constructions by making one of them a constituent of the other.
pos.sym
https://universaldependencies.org/u/pos/index.html A symbol is a word-like entity that differs from ordinary words by form, function, or both.
pos.verb
https://universaldependencies.org/u/pos/index.html A verb is a member of the syntactic class of words that typically signal events and actions.
pos.x
https://universaldependencies.org/u/pos/index.html The tag X is used for words that for some reason cannot be assigned a real part-of-speech category.
Schema Declaration
element gram
{
   "islemma.yes"
 | "abbr.yes"
 | "poss.yes"
 | "reflex.yes"
 | "prefix.yes"
 | "prontype.prs"
 | "prontype.rcp"
 | "prontype.art"
 | "prontype.int"
 | "prontype.rel"
 | "prontype.ind"
 | "prontype.emp"
 | "prontype.exc"
 | "prontype.dem"
 | "case.nom"
 | "case.acc"
 | "case.dat"
 | "case.gen"
 | "case.ins"
 | "case.par"
 | "tense.past"
 | "tense.pres"
 | "tense.fut"
 | "voice.act"
 | "voice.pass"
 | "number.sing"
 | "number.plur"
 | "number.ptan"
 | "number.coll"
 | "person.1"
 | "person.2"
 | "person.3"
 | "verbtype.mod"
 | "verbtype.tense"
 | "verbform.inf"
 | "verbform.part"
 | "verbform.ger"
 | "verbform.conv"
 | "polite.infm"
 | "polite.form"
 | "numtype.ord"
 | "numtype.card"
 | "degree.cmp"
 | "degree.sup"
 | "degree.dim"
 | "mood.imp"
 | "mood.sub"
 | "mood.ind"
 | "gender.masc"
 | "gender.fem"
 | "gender.neut"
 | "gender.com"
 | "hyph.yes"
 | "prodrop.yes"
 | "clitic.yes"
 | "inflection.infl"
 | "inflection.uninf"
 | "suffix.yes"
 | "valency.1"
 | "valency.2"
 | "valency.3"
 | "convertedfrom.adj"
 | "convertedfrom.adv"
 | "convertedfrom.ver"
 | "convertedfrom.num"
 | "convertedfrom.pro"
 | "convertedfrom.part"
 | "predicate.yes"
 | "construction.attr"
 | "pos.adj"
 | "pos.adp"
 | "pos.adv"
 | "pos.aux"
 | "pos.cconj"
 | "pos.det"
 | "pos.intj"
 | "pos.noun"
 | "pos.num"
 | "pos.part"
 | "pos.pron"
 | "pos.propn"
 | "pos.punct"
 | "pos.sconj"
 | "pos.sym"
 | "pos.verb"
 | "pos.x"
}
Legal values are:
islemma.yes
https://universaldependencies.org/u/overview/morphology.html Boolean, is this a base form
abbr.yes
https://universaldependencies.org/u/feat/Abbr.html Boolean feature. Is this an abbreviation?
poss.yes
https://universaldependencies.org/u/feat/Poss.html Boolean feature. Is this word possessive?
reflex.yes
https://universaldependencies.org/u/feat/Reflex.html Boolean feature, typically of pronouns or determiners. It tells whether the word is reflexive, i.e. refers to the subject of its clause.?
prefix.yes
https://universaldependencies.org/u/feat/Prefix.html Boolean feature, Is this a prefix word in a compound, that usually cannot stand on its own?
prontype.prs
https://universaldependencies.org/u/feat/PronType.html personal pronoun or determiner
prontype.rcp
https://universaldependencies.org/u/feat/PronType.html reciprocal pronoun
prontype.art
https://universaldependencies.org/u/feat/PronType.html Article is a special case of determiner that bears the feature of definiteness
prontype.int
https://universaldependencies.org/u/feat/PronType.html interrogative pronoun, determiner, numeral or adverb
prontype.rel
https://universaldependencies.org/u/feat/PronType.html relative pronoun, determiner, numeral or adverb
prontype.ind
https://universaldependencies.org/u/feat/PronType.html indefinite pronoun, determiner, numeral or adverb
prontype.emp
https://universaldependencies.org/u/feat/PronType.html Emphatic pro-adjectives (determiners) emphasize the nominal they depend on.
prontype.exc
https://universaldependencies.org/u/feat/PronType.html exclamative determiner
prontype.dem
https://universaldependencies.org/u/feat/PronType.html Demonstrative pronouns are often parallel to interrogatives.
case.nom
https://universaldependencies.org/u/feat/Case.html nominative
case.acc
https://universaldependencies.org/u/feat/Case.html accusative
case.dat
https://universaldependencies.org/u/feat/Case.html dative
case.gen
https://universaldependencies.org/u/feat/Case.html genitive
case.ins
https://universaldependencies.org/u/feat/Case.html instrumental / instructive
case.par
https://universaldependencies.org/u/feat/Case.html partitive
tense.past
https://universaldependencies.org/u/feat/Tense.html past tense
tense.pres
https://universaldependencies.org/u/feat/Tense.html present tense
tense.fut
https://universaldependencies.org/u/feat/Tense.html future tense
voice.act
https://universaldependencies.org/u/feat/Voice.html The subject of the verb is the doer of the action (agent).
voice.pass
https://universaldependencies.org/u/feat/Voice.html The subject of the verb is affected by the action (patient).
number.sing
https://universaldependencies.org/u/feat/Number.html A singular noun denotes one person, animal or thing.
number.plur
https://universaldependencies.org/u/feat/Number.html A plural noun denotes several persons, animals or things.
number.ptan
https://universaldependencies.org/u/feat/Number.html Plurale tantum, some nouns appear only in the plural form even though they denote one thing.
number.coll
https://universaldependencies.org/u/feat/Number.html Collective or mass or singulare tantum applies to words that use grammatical singular to describe sets of objects.
person.1
https://universaldependencies.org/u/feat/Person.html The first person refers just to the speaker / author and in plural one or more additional persons.
person.2
https://universaldependencies.org/u/feat/Person.html The second person refers to the addressee(s).
person.3
https://universaldependencies.org/u/feat/Person.html The third person refers to one or more persons that are neither speakers nor addressees.
verbtype.mod
https://universaldependencies.org/u/feat/VerbType.html Verbs that take infinitive of another verb as argument and add various modes of possibility, necessity etc.
verbtype.tense
https://universaldependencies.org/u/feat/VerbType.html Verb used to create periphrastic verb forms (tenses, passives etc.).
verbform.inf
https://universaldependencies.org/u/feat/VerbForm.html Infinitive is the citation form of verbs in many languages.
verbform.part
https://universaldependencies.org/u/feat/VerbForm.html Participle is a non-finite verb form that shares properties of verbs and adjectives.
verbform.ger
https://universaldependencies.org/u/feat/VerbForm.html Gerund is a non-finite verb form that shares properties of verbs and nouns.
verbform.conv
https://universaldependencies.org/u/feat/VerbForm.html The converb, also called adverbial participle or transgressive, is a non-finite verb form that shares properties of verbs and adverbs.
polite.infm
https://universaldependencies.org/u/feat/Polite.html usually meant for communication with family members and close friends.
polite.form
https://universaldependencies.org/u/feat/Polite.html usually meant for communication with strangers and people of higher social status.
numtype.ord
https://universaldependencies.org/u/feat/NumType.html ordinal number (first, second,..)
numtype.card
https://universaldependencies.org/u/feat/NumType.html cardinal number (one, two, many,....)
degree.cmp
https://universaldependencies.org/u/feat/Degree.html comparative, second degree
degree.sup
https://universaldependencies.org/u/feat/Degree.html superlative, third degree
degree.dim
https://universaldependencies.org/u/feat/Degree.html Added to features in universaldependencies. Diminutive.
mood.imp
https://universaldependencies.org/u/feat/Mood.html The speaker uses imperative to order or ask the addressee to do the action of the verb.
mood.sub
https://universaldependencies.org/u/feat/Mood.html The subjunctive mood is used under certain circumstances in subordinate clauses, typically for actions that are subjective or otherwise uncertain.
mood.ind
https://universaldependencies.org/u/feat/Mood.html A verb in indicative merely states that something happens, has happened or will happen.
gender.masc
https://universaldependencies.org/u/feat/Gender.html masculine gender
gender.fem
https://universaldependencies.org/u/feat/Gender.html feminine gender
gender.neut
https://universaldependencies.org/u/feat/Gender.html neuter gender
gender.com
https://universaldependencies.org/u/feat/Gender.html Some languages do not distinguish masculine/feminine but they do distinguish neuter vs. non-neuter. The non-neuter is called common gender.
hyph.yes
https://universaldependencies.org/u/feat/all.html#hyph-hyphenated-compound-or-part-of-it Is this part of a hyphenated compound? Depending on tokenization, the compound may be one token or be split to several tokens; then the tokens need tags.
prodrop.yes
Added for Frisian to MISC in universaldependencies. pronoun drop, omission of pronouns because they can be inferred
clitic.yes
Added for Frisian to features in universaldependencies. Most personal pronouns have a clitic form, which is the result of either vowel deletion, vowel reduction, monophthongization or schwa deletion, while there are also cases of suppletion.
inflection.infl
Not in universaldependencies. inflected
inflection.uninf
Not in universaldependencies. uninflected
suffix.yes
Not in universaldependencies Boolean feature, Is this a suffix word in a compound, that usually cannot stand on its own?
valency.1
An intransitive verb takes one argument (no object)
valency.2
A monotransitive verb takes two arguments (of which one object)
valency.3
A ditransitive verb takes three arguments (of which a direct and an indirect object)
convertedfrom.adj
Not in universaldependencies. adjective used as another category
convertedfrom.adv
Not in universaldependencies. adverb used as another category
convertedfrom.ver
Not in universaldependencies. verb used as another category
convertedfrom.num
Not in universaldependencies. numeral used as another category
convertedfrom.pro
Not in universaldependencies. pronomen used as another category
convertedfrom.part
Not in universaldependencies. verbform part used as another category
predicate.yes
Not in universaldependencies. statement about the subject
construction.attr
Not in universaldependencies. attributive
pos.adj
https://universaldependencies.org/u/pos/index.html Adjectives are words that typically modify nouns and specify their properties or attributes.
pos.adp
https://universaldependencies.org/u/pos/index.html Adposition is a cover term for prepositions and postpositions.
pos.adv
https://universaldependencies.org/u/pos/index.html Adverbs are words that typically modify verbs for such categories as time, place, direction or manner.
pos.aux
https://universaldependencies.org/u/pos/index.html An auxiliary is a function word that accompanies the lexical verb of a verb phrase and expresses grammatical distinctions not carried by the lexical verb, such as person, number, tense, mood, aspect, voice or evidentiality.
pos.cconj
https://universaldependencies.org/u/pos/index.html A coordinating conjunction is a word that links words or larger constituents without syntactically subordinating one to the other and expresses a semantic relationship between them.
pos.det
https://universaldependencies.org/u/pos/index.html Determiners are words that modify nouns or noun phrases and express the reference of the noun phrase in context.
pos.intj
https://universaldependencies.org/u/pos/index.html An interjection is a word that is used most often as an exclamation or part of an exclamation.
pos.noun
https://universaldependencies.org/u/pos/index.html Nouns are a part of speech typically denoting a person, place, thing, animal or idea.
pos.num
https://universaldependencies.org/u/pos/index.html A numeral is a word, functioning most typically as a determiner, adjective or pronoun, that expresses a number and a relation to the number, such as quantity, sequence, frequency or fraction.
pos.part
https://universaldependencies.org/u/pos/index.html Particles are function words that must be associated with another word or phrase to impart meaning and that do not satisfy definitions of other universal parts of speech.
pos.pron
https://universaldependencies.org/u/pos/index.html Pronouns are words that substitute for nouns or noun phrases, whose meaning is recoverable from the linguistic or extralinguistic context.
pos.propn
https://universaldependencies.org/u/pos/index.html A proper noun is a noun (or nominal content word) that is the name (or part of the name) of a specific individual, place, or object.
pos.punct
https://universaldependencies.org/u/pos/index.html Punctuation marks are non-alphabetical characters and character groups used in many languages to delimit linguistic units in printed text.
pos.sconj
https://universaldependencies.org/u/pos/index.html A subordinating conjunction is a conjunction that links constructions by making one of them a constituent of the other.
pos.sym
https://universaldependencies.org/u/pos/index.html A symbol is a word-like entity that differs from ordinary words by form, function, or both.
pos.verb
https://universaldependencies.org/u/pos/index.html A verb is a member of the syntactic class of words that typically signal events and actions.
pos.x
https://universaldependencies.org/u/pos/index.html The tag X is used for words that for some reason cannot be assigned a real part-of-speech category.

<gramGrp>

<gramGrp> Groups together gram elements
Contained by
May contain
Content model
<content>
 <elementRef key="gramminOccurs="0"
  maxOccurs="unbounded"/>

</content>
Schema Declaration
element gramGrp { gram* }

<msIdentifier>

<msIdentifier>
Module msdescription
Contained by
May contain Empty element
Schematron

<note>

<note>
Module core
Contained by
dictionaries: def
May contain
character data
Content model
<content>
 <alternate minOccurs="0"
  maxOccurs="unbounded">

  <elementRef key="etym"/>
  <macroRef key="macro.textLike"/>
 </alternate>
</content>
Schema Declaration
element note { ( etym | macro.textLike )* }

<purpose>

<purpose> Purpose is used to indicate the capabilities of a dictionary in its @type attribute, when a certain capability is present this implies it is viable to query for data belonging to the capability. Clients are free to either or not act upon the registered capabilities.
Attributes
type The type attirbute denotes a capability
Status Required
Datatype teidata.enumerated
Legal values are:
formtranslation
The dictionary can be used for form translations
textsearch
The dictionary can be used for searching text
synonyms
The dictionary can be used for synonyms
variants
The dictionary can be used for variants
compounds
The dictionary can be used for compounds
pronunciation
The dictionary can be used for pronunciation
hyphenation
The dictionary can be used for hyphenation
usage
The dictionary can be used for usage information for forms or text
stress
The dictionary can be used for word stress
definition
The dictionary can be used for definitions of forms or text
grammar
The dictionary can be used for grammar among which part of speech
paradigm
The dictionary can be used for paradigm
examples
The dictionary can be used for examples
collocations
The dictionary can be used for collocations
proverbs
The dictionary can be used for proverbs
Contained by
May contain Empty element

<quote>

<quote> A quote contains text (with hi, q and ref) and optionally etymological or bibliographical information or a note.
Contained by
May contain
core: note
character data
Content model
<content>
 <macroRef key="macro.textLike"/>
 <elementRef key="etymminOccurs="0"/>
 <elementRef key="biblminOccurs="0"/>
 <elementRef key="noteminOccurs="0"/>
</content>
Schema Declaration
element quote { macro.textLike, etym?, bibl?, note? }

<sense>

<sense> A sense contains zero or more ref, def, note and translations, examples, proverbs or collocations in cit elements, followed by an optional form
Contained by
May contain
core: note
dictionaries: def
Content model
<content>
 <sequence>
  <alternate minOccurs="0"
   maxOccurs="unbounded">

   <elementRef key="ref"/>
   <elementRef key="def"/>
   <elementRef key="cit"/>
   <elementRef key="note"/>
  </alternate>
  <elementRef key="formminOccurs="0"/>
 </sequence>
</content>
Schema Declaration
element sense { ( ref | def | cit | note )*, form? }

<superEntry>

<superEntry> A superEntry contains entries that are grouped together, for example for homonyms
Contained by
May contain
Content model
<content>
 <elementRef key="entry"
  maxOccurs="unbounded"/>

</content>
Schema Declaration
element superEntry { entry+ }

<TEI>

<TEI> Each article is a TEI document with a teiHeader holding at least an msIdentifier/idno for the article. The contents of the article can be found under body/text. msIdentifier/collection/@ref points to metainfo of the dictionary.
Attributes
linguisticsversion
Status Required
Datatype teidata.enumerated
Legal values are:
1
2
Contained by
May contain
Schematron

<s:assert test="tei:teiHeader/tei:fileDesc/tei:sourceDesc/tei:msDesc/tei:msIdentifier/tei:idno">
<s:value-of select="tei:teiHeader/tei:fileDesc/tei:titleStmt/tei:title"/>: missing
teiHeader/fileDesc/sourceDesc/msDesc/msIdentifier/idno.
</s:assert>
Content model
<content>
 <sequence>
  <elementRef key="teiHeader"/>
  <elementRef key="text"/>
 </sequence>
</content>
Schema Declaration
element TEI
{
   attribute linguisticsversion { "1" | "2" },
   ( teiHeader, text )
}

<teiCorpus>

<teiCorpus> A teiCorpus document contains a teiHeader with meta information about the dictionary. It may contain the articles in nested TEI elements, but the default application expects articles in separate TEI documents where collection/@ref points to the file containing dictionary metainfo in teiCorpus/teiHeader.
Contained by
May contain
Content model
<content>
 <elementRef key="teiHeader"/>
 <elementRef key="TEI"
  maxOccurs="unboundedminOccurs="0"/>

</content>
Schema Declaration
element teiCorpus { teiHeader, TEI* }

<text>

<text> A text container contains a body
Contained by
May contain
Content model
<content>
 <elementRef key="body"/>
</content>
Schema Declaration
element text { body }

<textDesc>

<textDesc> Use textDesc/purpose to indicate the capabilities of a dictionary, so clients can know what it can be used for
Contained by
May contain Empty element
Schematron

<s:assert test="ancestor::tei:teiHeader[parent::tei:teiCorpus]">
<s:value-of select="ancestor::tei:TEI/tei:teiHeader/tei:fileDesc/tei:sourceDesc/tei:msDesc/tei:msIdentifier/tei:idno"/>
: textDesc/purpose should be in teiCorpus/teiHeader
not in TEI/teiHeader
</s:assert>

<usg>

<usg> usage information for forms or text
Attributes
type
Status Required
Suggested values include:
lang
used in a certain language (variant)
time
used in a certain time, for example old, 18th century, etc.
freq
frequency information, for example rare, often etc.
connotation
(secondary) meaning or association
hint
a hint for usage
style
style (figurative, literal, etc.)
geo
used in a certain geographical area
medium
used in a certain medium, for example speech or movie
domain
used in a certain domain, for example medical
Contained by
May contain Empty element

Schema tei_dictionaries: Attribute classes

att.namekinds

att.namekinds 
Module namesdates
Members form
Attributes
namekind A namekind attribute can be used on form elements to indicate what kind of name it is, for example a place or a person.
Status Optional
Datatype teidata.enumerated
Legal values are:
countryName
placeName
see description for placeName element
geoName
see description for geoName element
orgName
see description for orgName element
personName
see description for personName element
animalName
plantName
birdName

Schema tei_dictionaries: Macros

macro.textLike

macro.textLike Several elements (i.e. def, quote and note) can contain text with simple styling and references. In note one may want to explain forms, for this several elements for characteristics of forms can be used.
Used by
Content model
<content>
 <alternate minOccurs="0"
  maxOccurs="unbounded">

  <textNode/>
  <elementRef key="q"/>
  <elementRef key="hi"/>
  <elementRef key="ref"/>
  <elementRef key="pron"/>
  <elementRef key="stress"/>
  <elementRef key="gram"/>
  <elementRef key="article"/>
 </alternate>
</content>
Declaration
macro.textLike = ( text | q | hi | ref | pron | stress | gram | article )*
Date: 2023-05