home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Karelian-KKPP: POS Tags: PRON

There are 24 PRON lemmas (3%), 87 PRON types (6%) and 289 PRON tokens (9%). Out of 14 observed tags, the rank of PRON is: 8 in number of lemmas, 6 in number of types and 4 in number of tokens.

The 10 most frequent PRON lemmas: še, mie, myö, mi, hyö, hiän, kaikki, tämä, kumpaine, ne

The 10 most frequent PRON types: hyö, mie, hiän, myö, meijän, mitä, še, šiitä, miun, kaikki

The 10 most frequent ambiguous lemmas: toini (ADJ 14, PRON 2), työ (NOUN 2, PRON 2)

The 10 most frequent ambiguous types: toisen (ADJ 2, PRON 1)

Morphology

The form / lemma ratio of PRON is 3.625000 (the average of all parts of speech is 1.495298).

The 1st highest number of forms (7) was observed with the lemma “kumpaine”: kumpaista, kumpani, kumpasen, kumpaset, kumpasešta, kumpasien, kumpasista.

The 2nd highest number of forms (7) was observed with the lemma “mi”: mi, mih, min, missä, mistä, mit, mitä.

The 3rd highest number of forms (7) was observed with the lemma “tämä”: täh, tähä, tällä, tämän, tänä, tätä, täššä.

PRON occurs with 6 features: Case (288; 100% instances), Number (288; 100% instances), PronType (243; 84% instances), Person (127; 44% instances), Reflex (9; 3% instances), Person[psor] (2; 1% instances)

PRON occurs with 22 feature-value pairs: Case=Abl, Case=Acc, Case=Ade, Case=Com, Case=Ela, Case=Ess, Case=Gen, Case=Ill, Case=Ine, Case=Ins, Case=Nom, Case=Par, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Person[psor]=3, PronType=Dem, PronType=Ind, PronType=Prs, Reflex=Yes

PRON occurs with 64 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing|Person=1|PronType=Prs (21 tokens). Examples: mie, Myö

Relations

PRON nodes are attached to their parents using 15 different relations: nsubj (98; 34% instances), obl (49; 17% instances), det (37; 13% instances), obj (37; 13% instances), nmod:poss (29; 10% instances), nsubj:cop (12; 4% instances), nmod (9; 3% instances), conj (4; 1% instances), nmod:gsubj (4; 1% instances), amod (3; 1% instances), fixed (2; 1% instances), root (2; 1% instances), acl:relcl (1; 0% instances), appos (1; 0% instances), parataxis (1; 0% instances)

Parents of PRON nodes belong to 8 different parts of speech: VERB (170; 59% instances), NOUN (86; 30% instances), AUX (12; 4% instances), ADJ (8; 3% instances), PRON (6; 2% instances), ADP (3; 1% instances), ADV (2; 1% instances), (2; 1% instances)

241 (83%) PRON nodes are leaves.

38 (13%) PRON nodes have one child.

6 (2%) PRON nodes have two children.

4 (1%) PRON nodes have three or more children.

The highest child degree of a PRON node is 6.

Children of PRON nodes are attached using 17 different relations: cc (22; 33% instances), case (8; 12% instances), punct (6; 9% instances), nsubj:cop (5; 7% instances), conj (4; 6% instances), cop:own (3; 4% instances), det (3; 4% instances), obl (3; 4% instances), acl:relcl (2; 3% instances), appos (2; 3% instances), cop (2; 3% instances), parataxis (2; 3% instances), advmod (1; 1% instances), amod (1; 1% instances), ccomp (1; 1% instances), mark (1; 1% instances), obj (1; 1% instances)

Children of PRON nodes belong to 11 different parts of speech: CCONJ (22; 33% instances), NOUN (16; 24% instances), PRON (6; 9% instances), PUNCT (6; 9% instances), ADP (5; 7% instances), AUX (5; 7% instances), ADJ (2; 3% instances), VERB (2; 3% instances), ADV (1; 1% instances), PROPN (1; 1% instances), SCONJ (1; 1% instances)