home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Belarusian-HSE: POS Tags: PRON

There are 73 PRON lemmas (0%), 153 PRON types (0%) and 10327 PRON tokens (3%). Out of 17 observed tags, the rank of PRON is: 11 in number of lemmas, 10 in number of types and 9 in number of tokens.

The 10 most frequent PRON lemmas: які, мы, ён, гэта, што, я, яны, вы, яна, хто

The 10 most frequent PRON types: мы, гэта, што, я, ён, якія, які, хто, яны, вы

The 10 most frequent ambiguous lemmas: які (PRON 1630, DET 218), ён (PRON 1009, DET 3), гэта (PRON 843, PART 211, DET 11), што (SCONJ 1315, PRON 819, PART 9, ADV 5, DET 2), я (PRON 805, X 4), яны (PRON 725, NOUN 1), яна (PRON 464, DET 1), тое (PRON 400, DET 6, PART 1), ты (PRON 245, SCONJ 1, X 1), усё (PRON 203, ADV 61, PART 14, DET 3)

The 10 most frequent ambiguous types: гэта (PRON 465, PART 195, DET 13), што (SCONJ 1308, PRON 455, PART 8, DET 2), я (PRON 311, X 3, VERB 1), якія (PRON 444, DET 36), які (PRON 408, DET 20), яго (PRON 278, DET 191), іх (PRON 269, DET 113), якая (PRON 199, DET 16), тое (PRON 189, DET 9, PART 1), нам (PRON 147, DET 1)

Morphology

The form / lemma ratio of PRON is 2.095890 (the average of all parts of speech is 1.754875).

The 1st highest number of forms (16) was observed with the lemma “які”: якi, якiм, якiх, якiя, якая, якога, якое, якой, якому, якою, якую, які, якім, якімі, якіх, якія.

The 2nd highest number of forms (9) was observed with the lemma “усе”: Все, усiх, усе, усім, усімі, усіх, ўсе, ўсім, ўсіх.

The 3rd highest number of forms (8) was observed with the lemma “усё”: Усе, усяго, усё, усім, ўсе, ўсяго, ўсё, ўсім.

PRON occurs with 11 features: Case (10312; 100% instances), PronType (10302; 100% instances), Number (10093; 98% instances), Gender (5554; 54% instances), Person (5242; 51% instances), Animacy (3174; 31% instances), Reflex (215; 2% instances), Abbr (3; 0% instances), Degree (2; 0% instances), Poss (2; 0% instances), Typo (1; 0% instances)

PRON occurs with 28 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Degree=Pos, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Dem, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes, Typo=Yes

PRON occurs with 173 feature combinations. The most frequent feature combination is Case=Nom|Number=Plur|Person=1|PronType=Prs (786 tokens). Examples: мы, ⚡️Мы, ✨Мы, 🎄Мы, 💬Мы

Relations

PRON nodes are attached to their parents using 29 different relations: nsubj (5158; 50% instances), obl (1624; 16% instances), obj (1577; 15% instances), iobj (844; 8% instances), nmod (350; 3% instances), root (264; 3% instances), conj (129; 1% instances), nsubj:pass (124; 1% instances), parataxis (44; 0% instances), ccomp (41; 0% instances), fixed (27; 0% instances), acl:relcl (23; 0% instances), appos (23; 0% instances), acl (14; 0% instances), orphan (13; 0% instances), xcomp (13; 0% instances), obl:agent (12; 0% instances), advmod (10; 0% instances), det (10; 0% instances), advcl (8; 0% instances), mark (7; 0% instances), csubj (3; 0% instances), case (2; 0% instances), list (2; 0% instances), amod (1; 0% instances), dislocated (1; 0% instances), expl (1; 0% instances), flat:foreign (1; 0% instances), reparandum (1; 0% instances)

Parents of PRON nodes belong to 17 different parts of speech: VERB (8072; 78% instances), NOUN (988; 10% instances), ADJ (455; 4% instances), (264; 3% instances), ADV (218; 2% instances), PRON (122; 1% instances), DET (72; 1% instances), PROPN (50; 0% instances), NUM (25; 0% instances), AUX (17; 0% instances), PART (13; 0% instances), SYM (10; 0% instances), X (10; 0% instances), CCONJ (4; 0% instances), ADP (3; 0% instances), INTJ (3; 0% instances), SCONJ (1; 0% instances)

7353 (71%) PRON nodes are leaves.

2038 (20%) PRON nodes have one child.

493 (5%) PRON nodes have two children.

443 (4%) PRON nodes have three or more children.

The highest child degree of a PRON node is 7.

Children of PRON nodes are attached using 31 different relations: case (1981; 42% instances), punct (674; 14% instances), advmod (333; 7% instances), acl (315; 7% instances), nsubj (249; 5% instances), nmod (187; 4% instances), conj (147; 3% instances), cc (133; 3% instances), det (116; 2% instances), acl:relcl (88; 2% instances), amod (76; 2% instances), cop (72; 2% instances), parataxis (70; 1% instances), fixed (58; 1% instances), appos (55; 1% instances), mark (41; 1% instances), obl (26; 1% instances), advcl (20; 0% instances), orphan (16; 0% instances), discourse (14; 0% instances), iobj (13; 0% instances), dep (11; 0% instances), csubj (9; 0% instances), expl (9; 0% instances), vocative (9; 0% instances), nummod:gov (6; 0% instances), nummod (5; 0% instances), dislocated (4; 0% instances), list (2; 0% instances), aux (1; 0% instances), reparandum (1; 0% instances)

Children of PRON nodes belong to 17 different parts of speech: ADP (1974; 42% instances), PUNCT (674; 14% instances), NOUN (450; 9% instances), VERB (432; 9% instances), ADV (208; 4% instances), PART (201; 4% instances), ADJ (140; 3% instances), CCONJ (133; 3% instances), DET (132; 3% instances), PRON (122; 3% instances), AUX (76; 2% instances), PROPN (73; 2% instances), SCONJ (59; 1% instances), X (26; 1% instances), SYM (21; 0% instances), NUM (16; 0% instances), INTJ (4; 0% instances)