home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-Taiga: POS Tags: PART

There are 102 PART lemmas (0%), 120 PART types (0%) and 7327 PART tokens (4%). Out of 17 observed tags, the rank of PART is: 9 in number of lemmas, 11 in number of types and 9 in number of tokens.

The 10 most frequent PART lemmas: не, и, только, же, ли, вот, просто, даже, ну, тоже

The 10 most frequent PART types: не, и, только, же, ли, вот, просто, даже, ну, тоже

The 10 most frequent ambiguous lemmas: не (PART 3331, CCONJ 2, X 1), и (CCONJ 4902, PART 547, X 4, NOUN 2), только (PART 396, SCONJ 9, CCONJ 7), же (PART 338, CCONJ 1, X 1), просто (PART 230, ADV 5), ну (PART 156, INTJ 10), тоже (PART 140, ADV 10, PRON 2, X 1), это (PRON 961, PART 138, DET 2), да (CCONJ 106, PART 105), ни (PART 104, CCONJ 90, VERB 1)

The 10 most frequent ambiguous types: не (PART 3026, ADV 3, ADJ 2, CCONJ 2, VERB 2, NUM 1, PRON 1, X 1), и (CCONJ 4395, PART 544, X 4, ADP 2, NOUN 2), только (PART 346, CCONJ 7, SCONJ 7), же (PART 337, X 9, CCONJ 1), ли (PART 330, NOUN 1), просто (PART 198, ADV 3, ADJ 2), ну (PART 68, INTJ 2), тоже (PART 138, ADV 8, PRON 3, X 1), это (PRON 579, PART 133, DET 81), ни (PART 96, CCONJ 80, PRON 2, ADV 1, DET 1, VERB 1)

Morphology

The form / lemma ratio of PART is 1.176471 (the average of all parts of speech is 1.879397).

The 1st highest number of forms (7) was observed with the lemma “не”: Неееее, е, на, не, неее, ни, нп.

The 2nd highest number of forms (6) was observed with the lemma “пожалуйста”: Пожалуйстаааа, пж, пжж, подалуйста, пожалуйста, пожалуйстааа.

The 3rd highest number of forms (3) was observed with the lemma “все-таки”: все, все-таки, всё-таки.

PART occurs with 4 features: Polarity (3506; 48% instances), Typo (15; 0% instances), Abbr (6; 0% instances), Foreign (4; 0% instances)

PART occurs with 4 feature-value pairs: Abbr=Yes, Foreign=Yes, Polarity=Neg, Typo=Yes

PART occurs with 7 feature combinations. The most frequent feature combination is _ (3801 tokens). Examples: и, только, же, ли, вот, просто, даже, ну, тоже, это

Relations

PART nodes are attached to their parents using 24 different relations: advmod (6276; 86% instances), fixed (299; 4% instances), discourse (210; 3% instances), parataxis (174; 2% instances), expl (138; 2% instances), root (119; 2% instances), conj (42; 1% instances), cc (34; 0% instances), orphan (5; 0% instances), case (4; 0% instances), flat:name (4; 0% instances), advcl (3; 0% instances), dislocated (3; 0% instances), appos (2; 0% instances), dep (2; 0% instances), flat (2; 0% instances), list (2; 0% instances), obl (2; 0% instances), ccomp (1; 0% instances), flat:foreign (1; 0% instances), mark (1; 0% instances), nmod (1; 0% instances), nsubj (1; 0% instances), obj (1; 0% instances)

Parents of PART nodes belong to 17 different parts of speech: VERB (3538; 48% instances), NOUN (1163; 16% instances), ADV (701; 10% instances), ADJ (687; 9% instances), PRON (344; 5% instances), DET (271; 4% instances), PART (171; 2% instances), (119; 2% instances), PROPN (85; 1% instances), NUM (81; 1% instances), CCONJ (68; 1% instances), SCONJ (47; 1% instances), AUX (27; 0% instances), INTJ (16; 0% instances), X (5; 0% instances), ADP (2; 0% instances), SYM (2; 0% instances)

6682 (91%) PART nodes are leaves.

499 (7%) PART nodes have one child.

79 (1%) PART nodes have two children.

67 (1%) PART nodes have three or more children.

The highest child degree of a PART node is 6.

Children of PART nodes are attached using 28 different relations: punct (348; 39% instances), advmod (186; 21% instances), fixed (126; 14% instances), cc (44; 5% instances), parataxis (34; 4% instances), nsubj (28; 3% instances), conj (21; 2% instances), vocative (21; 2% instances), discourse (20; 2% instances), mark (11; 1% instances), goeswith (8; 1% instances), csubj (7; 1% instances), advcl (5; 1% instances), iobj (5; 1% instances), det (4; 0% instances), obl (4; 0% instances), orphan (3; 0% instances), appos (2; 0% instances), case (2; 0% instances), flat (2; 0% instances), flat:name (2; 0% instances), acl:relcl (1; 0% instances), aux (1; 0% instances), ccomp (1; 0% instances), cop (1; 0% instances), expl (1; 0% instances), nmod (1; 0% instances), nummod:gov (1; 0% instances)

Children of PART nodes belong to 17 different parts of speech: PUNCT (348; 39% instances), PART (171; 19% instances), ADV (106; 12% instances), CCONJ (59; 7% instances), NOUN (53; 6% instances), VERB (41; 5% instances), PRON (24; 3% instances), PROPN (19; 2% instances), SCONJ (14; 2% instances), SYM (12; 1% instances), X (12; 1% instances), ADJ (8; 1% instances), AUX (8; 1% instances), ADP (5; 1% instances), DET (5; 1% instances), INTJ (4; 0% instances), NUM (1; 0% instances)