home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Belarusian-HSE: POS Tags: PRON

There are 67 PRON lemmas (0%), 150 PRON types (0%) and 10323 PRON tokens (3%). Out of 17 observed tags, the rank of PRON is: 12 in number of lemmas, 10 in number of types and 9 in number of tokens.

The 10 most frequent PRON lemmas: які, мы, ён, гэта, што, я, яны, вы, яна, хто

The 10 most frequent PRON types: мы, гэта, што, я, ён, якія, які, хто, яны, вы

The 10 most frequent ambiguous lemmas: які (PRON 1631, DET 217), ён (PRON 1009, DET 3), гэта (PRON 843, PART 211, DET 11), што (SCONJ 1314, PRON 822, PART 9, ADV 5, DET 1), я (PRON 805, X 4), яны (PRON 725, NOUN 1), яна (PRON 464, DET 1), тое (PRON 401, DET 6, PART 1), ты (PRON 245, SCONJ 1, X 1), усё (PRON 203, ADV 61, PART 14, DET 3)

The 10 most frequent ambiguous types: гэта (PRON 465, PART 195, DET 13), што (SCONJ 1307, PRON 457, PART 8, DET 1), я (PRON 311, X 3, VERB 1), якія (PRON 444, DET 36), які (PRON 408, DET 20), яго (PRON 278, DET 191), іх (PRON 268, DET 114), якая (PRON 199, DET 16), тое (PRON 189, DET 9, PART 1), нам (PRON 147, DET 1)

Morphology

The form / lemma ratio of PRON is 2.238806 (the average of all parts of speech is 1.756638).

The 1st highest number of forms (16) was observed with the lemma “які”: якi, якiм, якiх, якiя, якая, якога, якое, якой, якому, якою, якую, які, якім, якімі, якіх, якія.

The 2nd highest number of forms (9) was observed with the lemma “усе”: Все, усiх, усе, усім, усімі, усіх, ўсе, ўсім, ўсіх.

The 3rd highest number of forms (8) was observed with the lemma “усё”: Усе, усяго, усё, усім, ўсе, ўсяго, ўсё, ўсім.

PRON occurs with 10 features: Case (10311; 100% instances), PronType (10301; 100% instances), Number (10092; 98% instances), Gender (5555; 54% instances), Person (5241; 51% instances), Animacy (3173; 31% instances), Reflex (215; 2% instances), Abbr (3; 0% instances), Degree (1; 0% instances), Typo (1; 0% instances)

PRON occurs with 27 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Degree=Pos, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, PronType=Dem, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes, Typo=Yes

PRON occurs with 165 feature combinations. The most frequent feature combination is Case=Nom|Number=Plur|Person=1|PronType=Prs (786 tokens). Examples: мы, ⚡️Мы, ✨Мы, 🎄Мы, 💬Мы

Relations

PRON nodes are attached to their parents using 30 different relations: nsubj (5155; 50% instances), obl (1623; 16% instances), obj (1579; 15% instances), iobj (843; 8% instances), nmod (350; 3% instances), root (264; 3% instances), conj (129; 1% instances), nsubj:pass (124; 1% instances), parataxis (44; 0% instances), ccomp (41; 0% instances), fixed (27; 0% instances), acl:relcl (23; 0% instances), appos (23; 0% instances), acl (14; 0% instances), orphan (13; 0% instances), xcomp (13; 0% instances), obl:agent (12; 0% instances), advmod (10; 0% instances), advcl (8; 0% instances), det (8; 0% instances), mark (7; 0% instances), csubj (3; 0% instances), case (2; 0% instances), list (2; 0% instances), amod (1; 0% instances), dislocated (1; 0% instances), expl (1; 0% instances), flat:foreign (1; 0% instances), nsubj:outer (1; 0% instances), reparandum (1; 0% instances)

Parents of PRON nodes belong to 17 different parts of speech: VERB (8072; 78% instances), NOUN (987; 10% instances), ADJ (452; 4% instances), (264; 3% instances), ADV (218; 2% instances), PRON (123; 1% instances), DET (72; 1% instances), PROPN (49; 0% instances), NUM (25; 0% instances), AUX (17; 0% instances), PART (13; 0% instances), SYM (10; 0% instances), X (10; 0% instances), CCONJ (4; 0% instances), ADP (3; 0% instances), INTJ (3; 0% instances), SCONJ (1; 0% instances)

7349 (71%) PRON nodes are leaves.

2038 (20%) PRON nodes have one child.

492 (5%) PRON nodes have two children.

444 (4%) PRON nodes have three or more children.

The highest child degree of a PRON node is 7.

Children of PRON nodes are attached using 31 different relations: case (1981; 42% instances), punct (674; 14% instances), advmod (333; 7% instances), acl (315; 7% instances), nsubj (250; 5% instances), nmod (187; 4% instances), conj (146; 3% instances), cc (132; 3% instances), det (116; 2% instances), acl:relcl (87; 2% instances), amod (76; 2% instances), cop (72; 2% instances), parataxis (70; 1% instances), fixed (59; 1% instances), appos (55; 1% instances), mark (41; 1% instances), obl (27; 1% instances), advcl (20; 0% instances), orphan (16; 0% instances), discourse (14; 0% instances), iobj (13; 0% instances), dep (11; 0% instances), csubj (9; 0% instances), expl (9; 0% instances), vocative (9; 0% instances), nummod:gov (6; 0% instances), dislocated (5; 0% instances), nummod (5; 0% instances), list (2; 0% instances), aux (1; 0% instances), reparandum (1; 0% instances)

Children of PRON nodes belong to 17 different parts of speech: ADP (1975; 42% instances), PUNCT (674; 14% instances), NOUN (451; 10% instances), VERB (432; 9% instances), ADV (208; 4% instances), PART (201; 4% instances), ADJ (140; 3% instances), CCONJ (132; 3% instances), DET (132; 3% instances), PRON (123; 3% instances), AUX (76; 2% instances), PROPN (72; 2% instances), SCONJ (59; 1% instances), X (26; 1% instances), SYM (21; 0% instances), NUM (16; 0% instances), INTJ (4; 0% instances)