Treebank Statistics: UD_Cappadocian-AMGiC: POS Tags: PRON
There are 20 PRON lemmas (6%), 45 PRON types (10%) and 94 PRON tokens (11%).
Out of 16 observed tags, the rank of PRON is: 4 in number of lemmas, 4 in number of types and 4 in number of tokens.
The 10 most frequent PRON lemmas: (e)γó, to, (e)tútus, emís, o, ro, Ešís, _, cínus, do
The 10 most frequent PRON types: to, du, su, tu, da, mu, či, do, ta, tútus
The 10 most frequent ambiguous lemmas: o (DET 29, PRON 2), ro (ADV 2, PRON 2), _ (X 2, NOUN 1, PRON 1), ne (CCONJ 2, PRON 1), óči (SCONJ 7, PRON 1)
The 10 most frequent ambiguous types: to (DET 16, PRON 10), tu (DET 11, PRON 5, SCONJ 1), da (PRON 4, DET 1), či (PRON 4, DET 2, PART 1), ta (DET 6, PRON 3), m (PRON 2, AUX 1), ro (ADV 2, PRON 2), se (AUX 5, PRON 2), m’ (AUX 2, PRON 1), ne (AUX 4, CCONJ 2, PRON 1)
- to
- tu
- da
- či
- ta
- m
- ro
- se
- m’
- ne
Morphology
The form / lemma ratio of PRON is 2.250000 (the average of all parts of speech is 1.244253).
The 1st highest number of forms (27) was observed with the lemma “(e)γó”: da, do, du, m, m’, mas, me, mu, s’, sas, se, su, séna, ta, to, tu, tun, tus, zin, zis, či, čis, ši, ǰi, ǰis, γo, δa.
The 2nd highest number of forms (3) was observed with the lemma “(e)tútus”: tútunu, tútus, Τúta.
The 3rd highest number of forms (2) was observed with the lemma “emís”: emís, más.
PRON occurs with 7 features: PronType (94; 100% instances), Number (86; 91% instances), Case (85; 90% instances), Person (85; 90% instances), Gender (50; 53% instances), Clitic (47; 50% instances), Poss (27; 29% instances)
PRON occurs with 18 feature-value pairs: Case=Acc, Case=Gen, Case=Nom, Clitic=Yes, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, PronType=Rel
PRON occurs with 42 feature combinations.
The most frequent feature combination is Case=Acc|Clitic=Yes|Gender=Neut|Number=Sing|Person=3|PronType=Prs (11 tokens).
Examples: da, do, ta, to, δa
Relations
PRON nodes are attached to their parents using 10 different relations: nmod (29; 31% instances), obj (26; 28% instances), nsubj (16; 17% instances), expl (9; 10% instances), iobj (6; 6% instances), det (3; 3% instances), obl (2; 2% instances), advcl (1; 1% instances), ccomp (1; 1% instances), det:poss (1; 1% instances)
Parents of PRON nodes belong to 6 different parts of speech: VERB (59; 63% instances), NOUN (29; 31% instances), ADJ (2; 2% instances), ADV (2; 2% instances), ADP (1; 1% instances), PROPN (1; 1% instances)
87 (93%) PRON nodes are leaves.
5 (5%) PRON nodes have one child.
1 (1%) PRON nodes have two children.
1 (1%) PRON nodes have three or more children.
The highest child degree of a PRON node is 3.
Children of PRON nodes are attached using 8 different relations: cc (2; 20% instances), mark (2; 20% instances), acl (1; 10% instances), advmod (1; 10% instances), advmod:emph (1; 10% instances), aux (1; 10% instances), cop (1; 10% instances), punct (1; 10% instances)
Children of PRON nodes belong to 8 different parts of speech: AUX (2; 20% instances), CCONJ (2; 20% instances), ADP (1; 10% instances), ADV (1; 10% instances), PART (1; 10% instances), PUNCT (1; 10% instances), SCONJ (1; 10% instances), VERB (1; 10% instances)