home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Khoekhoe-KDT: POS Tags: PRON

There are 19 PRON lemmas (1%), 143 PRON types (4%) and 3269 PRON tokens (11%). Out of 16 observed tags, the rank of PRON is: 11 in number of lemmas, 6 in number of types and 5 in number of tokens.

The 10 most frequent PRON lemmas: _, ǁî, ti, sa, tare, ǁnā, nē, si, hoa, tari

The 10 most frequent PRON types: ta, ts, tita, da, i, b, s, te, ǁîb, sats

The 10 most frequent ambiguous lemmas: _ (PRON 2154, X 44, CCONJ 6, AUX 1), ǁî (PRON 269, NOUN 9), ti (PRON 211, PART 136, DET 135, ADV 32), sa (PRON 209, DET 106), tare (PRON 112, DET 10, ADV 1, NOUN 1), ǁnā (DET 157, PRON 91, ADV 16, NOUN 4, VERB 4), (DET 238, PRON 77, ADV 17, NOUN 5), hoa (DET 72, PRON 30), tari (PRON 27, ADJ 1, ADV 1), nau (DET 26, ADV 7, PRON 7)

The 10 most frequent ambiguous types: ta (PRON 583, AUX 52), i (AUX 192, PRON 153), s (SCONJ 164, PRON 138, PART 30), n (PRON 76, X 1), ǁnās (PRON 18, NOUN 1), sa (SCONJ 136, DET 65, PRON 25), hoan (DET 6, PRON 3), go (AUX 358, PRON 12), ra (AUX 678, PRON 11), tare (PRON 10, DET 3, ADV 1)

Morphology

The form / lemma ratio of PRON is 7.526316 (the average of all parts of speech is 1.375985).

The 1st highest number of forms (48) was observed with the lemma “_”: -e, -i, [ti]ta, am, an, b, ba, bi, da, de, di, du, e, ga, ge, go, gu, gā-aisib, i, ib, kha, kho, khom, m, mi, n, na, nî, ra, ro, s, sa, sakho, se, si, sikhom, so, ta, te, ti, tita, ts, tsa, tse, tsi, în, ûib, ǁaegu.

The 2nd highest number of forms (13) was observed with the lemma “nē”: nē, nē-e, nē-i, nēb, nēba, nēde, nēga, nēn, nēna, nēro, nēro-e, nēs, nēsa.

The 3rd highest number of forms (13) was observed with the lemma “ǁî”: ǁî, ǁîb, ǁîba, ǁîde, ǁîdi, ǁîga, ǁîgu, ǁîm, ǁîn, ǁîna, ǁîra, ǁîs, ǁîsa.

PRON occurs with 12 features: PronType (3269; 100% instances), Number (3236; 99% instances), Person (3225; 99% instances), Case (2705; 83% instances), Gender (2320; 71% instances), Deixis (175; 5% instances), Clusivity (135; 4% instances), Typo (5; 0% instances), Poss (3; 0% instances), Degree (2; 0% instances), Foreign (2; 0% instances), Assoc (1; 0% instances)

PRON occurs with 30 feature-value pairs: Assoc=Yes, Case=Acc, Case=Nom, Case=Voc, Clusivity=Ex, Clusivity=In, Degree=Dim, Deixis=Contr, Deixis=Prox, Deixis=Remt, Foreign=Yes, Gender=Fem, Gender=Fem,Neut, Gender=Masc, Gender=Neut, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=2, Person=2,3, Person=3, Poss=Yes, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, PronType=Rel, PronType=Tot, Typo=Yes

PRON occurs with 137 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing|Person=1|PronType=Prs (582 tokens). Examples: ta, Ti, te

Relations

PRON nodes are attached to their parents using 24 different relations: nsubj (2039; 62% instances), obj (289; 9% instances), expl (219; 7% instances), obl (137; 4% instances), nmod:poss (120; 4% instances), root (111; 3% instances), nsubj:pass (89; 3% instances), iobj (51; 2% instances), iobj:appl (44; 1% instances), conj (34; 1% instances), appos (26; 1% instances), fixed (25; 1% instances), expl:impers (24; 1% instances), ccomp (13; 0% instances), nsubj:outer (12; 0% instances), parataxis (11; 0% instances), reparandum (8; 0% instances), advcl (5; 0% instances), obj:appl (4; 0% instances), det (3; 0% instances), obl:agent (2; 0% instances), amod (1; 0% instances), nmod (1; 0% instances), vocative (1; 0% instances)

Parents of PRON nodes belong to 14 different parts of speech: VERB (2557; 78% instances), NOUN (297; 9% instances), ADJ (150; 5% instances), (111; 3% instances), PRON (44; 1% instances), ADV (31; 1% instances), PROPN (26; 1% instances), PART (25; 1% instances), AUX (7; 0% instances), NUM (7; 0% instances), INTJ (6; 0% instances), DET (5; 0% instances), ADP (2; 0% instances), X (1; 0% instances)

2814 (86%) PRON nodes are leaves.

318 (10%) PRON nodes have one child.

57 (2%) PRON nodes have two children.

80 (2%) PRON nodes have three or more children.

The highest child degree of a PRON node is 9.

Children of PRON nodes are attached using 27 different relations: case (181; 24% instances), punct (122; 16% instances), aux (73; 10% instances), cc (54; 7% instances), nsubj (42; 6% instances), advmod (39; 5% instances), advmod:emph (35; 5% instances), nmod:poss (34; 5% instances), acl:relcl (22; 3% instances), discourse (22; 3% instances), conj (19; 3% instances), mark (17; 2% instances), cop (16; 2% instances), parataxis (9; 1% instances), reparandum (9; 1% instances), advcl (8; 1% instances), amod (8; 1% instances), det (8; 1% instances), vocative (6; 1% instances), obl (5; 1% instances), expl (4; 1% instances), nsubj:outer (2; 0% instances), acl (1; 0% instances), appos (1; 0% instances), goeswith (1; 0% instances), iobj (1; 0% instances), obj (1; 0% instances)

Children of PRON nodes belong to 15 different parts of speech: ADP (183; 25% instances), PUNCT (122; 16% instances), AUX (90; 12% instances), ADV (76; 10% instances), NOUN (64; 9% instances), CCONJ (54; 7% instances), PRON (44; 6% instances), VERB (32; 4% instances), DET (21; 3% instances), INTJ (21; 3% instances), SCONJ (13; 2% instances), ADJ (9; 1% instances), PROPN (5; 1% instances), PART (4; 1% instances), X (2; 0% instances)