home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Low_Saxon-LSDC: POS Tags: DET

There are 39 DET lemmas (1%), 115 DET types (2%) and 2205 DET tokens (10%). Out of 16 observed tags, the rank of DET is: 8 in number of lemmas, 8 in number of types and 6 in number of tokens.

The 10 most frequent DET lemmas: de, en, syn, myn, et, dat, ear, disse, keyn, uns

The 10 most frequent DET types: de, en, den, dat, et, myn, dee, dem, syn, syne

The 10 most frequent ambiguous lemmas: de (DET 1259, PRON 8, X 1), en (DET 363, CCONJ 6, ADJ 1, ADP 1, PART 1, PRON 1), myn (DET 80, PRON 4), et (PRON 205, DET 59), dat (PRON 194, SCONJ 194, DET 34, CCONJ 1, NOUN 1), disse (DET 33, PRON 4), keyn (DET 27, PRON 3), uns (DET 24, PRON 1), geyn (DET 21, PRON 3), dyn (DET 20, PRON 1)

The 10 most frequent ambiguous types: en (DET 272, CCONJ 170, PRON 11, ADJ 1, PART 1), den (DET 209, PRON 19), dat (SCONJ 174, PRON 148, DET 116, CCONJ 1), et (PRON 189, DET 78), myn (DET 55, PRON 9), dee (PRON 138, DET 47), dem (DET 54, PRON 3), syn (DET 51, AUX 29, VERB 1), der (ADV 71, DET 39, PRON 2, X 2), ne (DET 37, PRON 6, PART 2)

Morphology

The form / lemma ratio of DET is 2.948718 (the average of all parts of speech is 1.410753).

The 1st highest number of forms (24) was observed with the lemma “de”: ‘m, ‘n, ‘r, ‘t, dat, de, deam, dean, dear, deas, dee, dem, den, der, des, det, en, er, et, m, me, myn, n, t.

The 2nd highest number of forms (11) was observed with the lemma “dat”: ‘m, ‘n, ‘t, dat, de, deam, dee, dem, den, et, me.

The 3rd highest number of forms (11) was observed with the lemma “en”: ‘ne, den, e, en, eyn, eyne, eynem, eynen, eyner, ne, nen.

DET occurs with 9 features: Case (2190; 99% instances), Number (2186; 99% instances), PronType (2180; 99% instances), Gender (2145; 97% instances), Definite (1722; 78% instances), Poss (309; 14% instances), Person[psor] (35; 2% instances), Number[psor] (34; 2% instances), Gender[psor] (17; 1% instances)

DET occurs with 31 feature-value pairs: Case=Acc, Case=Acc,Dat, Case=Dat, Case=Gen, Case=Nom, Definite=Def, Definite=Ind, Gender=Fem, Gender=Fem,Masc, Gender=Fem,Masc,Neut, Gender=Fem,Neut, Gender=Masc, Gender=Masc,Neut, Gender=Neut, Gender[psor]=Fem, Gender[psor]=Masc, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person[psor]=1, Person[psor]=2, Person[psor]=3, Poss=Yes, PronType=Art, PronType=Dem, PronType=Ind, PronType=Ind,Neg,Tot, PronType=Neg, PronType=Prs, PronType=Tot

DET occurs with 224 feature combinations. The most frequent feature combination is Case=Nom|Definite=Def|Gender=Masc|Number=Sing|PronType=Art (185 tokens). Examples: de, dee, den, der, Gyn, ne

Relations

DET nodes are attached to their parents using 7 different relations: det (2173; 99% instances), det:poss (24; 1% instances), nsubj (3; 0% instances), amod (2; 0% instances), advmod (1; 0% instances), fixed (1; 0% instances), obj (1; 0% instances)

Parents of DET nodes belong to 7 different parts of speech: NOUN (2059; 93% instances), ADJ (68; 3% instances), PROPN (42; 2% instances), PRON (21; 1% instances), VERB (10; 0% instances), NUM (4; 0% instances), X (1; 0% instances)

2155 (98%) DET nodes are leaves.

50 (2%) DET nodes have one child.

The highest child degree of a DET node is 1.

Children of DET nodes are attached using 6 different relations: advmod (23; 46% instances), nmod:poss (12; 24% instances), punct (12; 24% instances), acl (1; 2% instances), case (1; 2% instances), nmod (1; 2% instances)

Children of DET nodes belong to 6 different parts of speech: ADV (23; 46% instances), PUNCT (12; 24% instances), NOUN (9; 18% instances), PROPN (4; 8% instances), ADP (1; 2% instances), VERB (1; 2% instances)