home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Ukrainian-IU: POS Tags: PRON

There are 36 PRON lemmas (0%), 101 PRON types (0%) and 5046 PRON tokens (4%). Out of 17 observed tags, the rank of PRON is: 15 in number of lemmas, 11 in number of types and 7 in number of tokens.

The 10 most frequent PRON lemmas: він, я, це, вони, ми, вона, те, що, ви, себе

The 10 most frequent PRON types: він, це, я, ми, вона, його, вони, що, їх, мене

The 10 most frequent ambiguous lemmas: це (PRON 486, PART 9, ADV 1), що (SCONJ 1072, PRON 237, PART 19, ADV 2), себе (PRON 172, PART 1, X 1), все (PRON 94, ADV 18, PART 15), щось (PRON 65, ADV 3), то (PART 167, PRON 25, CCONJ 24), усе (PRON 22, ADV 5, PART 1), оце (PART 4, PRON 4), т. (ADV 10, DET 5, PRON 4, NOUN 3, ADJ 1), дещо (ADV 10, PRON 2)

The 10 most frequent ambiguous types: це (PRON 308, DET 21, PART 7, ADV 1), його (PRON 200, DET 178, NOUN 1), що (SCONJ 1064, PRON 136, PART 17, ADV 2), їх (PRON 129, DET 31), те (PRON 105, DET 10), її (PRON 107, DET 87), того (PRON 93, DET 47, ADV 3), все (PRON 75, DET 26, ADV 18, PART 13), собі (PRON 75, PART 2), себе (PRON 69, X 1)

Morphology

The form / lemma ratio of PRON is 2.805556 (the average of all parts of speech is 1.738445).

The 1st highest number of forms (7) was observed with the lemma “він”: він, його, йому, ним, нього, ньому, нім.

The 2nd highest number of forms (6) was observed with the lemma “вона”: вона, нею, неї, ній, їй, її.

The 3rd highest number of forms (6) was observed with the lemma “що”: віщо, чим, чого, чому, чім, що.

PRON occurs with 9 features: Case (5046; 100% instances), PronType (5046; 100% instances), Number (4722; 94% instances), Person (3350; 66% instances), Animacy (3014; 60% instances), Gender (2754; 55% instances), Reflex (174; 3% instances), Abbr (4; 0% instances), Uninflect (4; 0% instances)

PRON occurs with 27 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, PronType=Dem, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes, Uninflect=Yes

PRON occurs with 111 feature combinations. The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing|Person=3|PronType=Prs (430 tokens). Examples: він

Relations

PRON nodes are attached to their parents using 27 different relations: nsubj (2278; 45% instances), obj (1138; 23% instances), obl (873; 17% instances), nmod (213; 4% instances), iobj (196; 4% instances), expl (114; 2% instances), root (62; 1% instances), conj (39; 1% instances), orphan (25; 0% instances), parataxis (22; 0% instances), parataxis:discourse (17; 0% instances), appos (10; 0% instances), advcl (9; 0% instances), ccomp (9; 0% instances), fixed (9; 0% instances), flat:abs (6; 0% instances), xcomp:sp (5; 0% instances), acl (4; 0% instances), advmod (4; 0% instances), discourse (3; 0% instances), flat:repeat (2; 0% instances), flat:sibl (2; 0% instances), xcomp (2; 0% instances), case (1; 0% instances), csubj (1; 0% instances), reparandum (1; 0% instances), vocative (1; 0% instances)

Parents of PRON nodes belong to 14 different parts of speech: VERB (3905; 77% instances), NOUN (543; 11% instances), ADJ (225; 4% instances), ADV (163; 3% instances), PRON (66; 1% instances), (62; 1% instances), DET (49; 1% instances), PROPN (12; 0% instances), ADP (5; 0% instances), NUM (5; 0% instances), X (5; 0% instances), PART (3; 0% instances), INTJ (2; 0% instances), SYM (1; 0% instances)

3593 (71%) PRON nodes are leaves.

1057 (21%) PRON nodes have one child.

259 (5%) PRON nodes have two children.

137 (3%) PRON nodes have three or more children.

The highest child degree of a PRON node is 8.

Children of PRON nodes are attached using 33 different relations: case (1010; 48% instances), punct (208; 10% instances), acl (159; 8% instances), discourse (114; 5% instances), det (81; 4% instances), nmod (77; 4% instances), advmod (73; 3% instances), acl:relcl (61; 3% instances), nsubj (53; 3% instances), cc (52; 2% instances), conj (44; 2% instances), appos (31; 1% instances), amod (29; 1% instances), orphan (21; 1% instances), parataxis (17; 1% instances), cop (13; 1% instances), mark (12; 1% instances), obl (8; 0% instances), expl (6; 0% instances), fixed (6; 0% instances), advcl (4; 0% instances), nummod (4; 0% instances), vocative (4; 0% instances), csubj (3; 0% instances), obj (3; 0% instances), acl:adv (2; 0% instances), det:numgov (2; 0% instances), flat:repeat (2; 0% instances), ccomp (1; 0% instances), det:nummod (1; 0% instances), flat:sibl (1; 0% instances), parataxis:discourse (1; 0% instances), reparandum (1; 0% instances)

Children of PRON nodes belong to 15 different parts of speech: ADP (1010; 48% instances), PUNCT (208; 10% instances), VERB (205; 10% instances), NOUN (155; 7% instances), PART (139; 7% instances), DET (91; 4% instances), ADV (75; 4% instances), PRON (66; 3% instances), CCONJ (51; 2% instances), ADJ (46; 2% instances), PROPN (27; 1% instances), AUX (13; 1% instances), SCONJ (12; 1% instances), NUM (4; 0% instances), INTJ (2; 0% instances)