home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-RNC: POS Tags: X

There are 152 X lemmas (3%), 184 X types (2%) and 205 X tokens (0%). Out of 17 observed tags, the rank of X is: 7 in number of lemmas, 9 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: _, sic, </i>, , {л._219}, </em>, , etc., {3reg:10.}, {4_reg:_11.}

The 10 most frequent X types: л., [sic], </i>, , {л._219}, {л._300}, {л._301_об.}, {л._302_об.}, {л._302}, 1</em>

The 10 most frequent ambiguous lemmas: _ (X 48, NOUN 1)

The 10 most frequent ambiguous types: л. (X 12, NOUN 11), 1 (ADJ 7, NUM 6, X 1), 10 (NUM 45, ADJ 7, X 1), 11 (NUM 14, ADJ 8, X 1), 12 (NUM 25, ADJ 9, X 1), 13 (NUM 6, ADJ 5, X 1), 14 (ADJ 5, NUM 3, X 1), 15 (NUM 12, ADJ 3, X 1), 3 (NUM 157, ADJ 5, ADV 1, X 1), 6 (NUM 43, ADJ 4, X 1)

Morphology

The form / lemma ratio of X is 1.210526 (the average of all parts of speech is 1.988362).

The 1st highest number of forms (37) was observed with the lemma “_”: 1, 10, 11, 12, 13, 14, 15, 3, 6, 7, 8, 9, {л._1_об.}, {л._254_об.}, {л._255_об.}, {л._255}, {л._256}, {л._300_об.}, {л._300}, {л._301_об.}, {л._302_об.}, {л._302}, {л._303}, {л._304_об.}, {л._304}, {л._305}, {л._306_об.}, {л._3}, {л._41}, {л._4}, {л._5}, {л._602_об.}, {л._602}, Помета, ж, л., об..

The 2nd highest number of forms (1) was observed with the lemma “</em>”: </em>.

The 3rd highest number of forms (1) was observed with the lemma “</i>”: </i>.

X occurs with 1 features: Foreign (1; 0% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is _ (204 tokens). Examples: л., [sic], </i>, , {л._219}, {л._300}, {л._301_об.}, {л._302_об.}, {л._302}, 1</em>

Relations

X nodes are attached to their parents using 5 different relations: dep (187; 91% instances), parataxis (14; 7% instances), nmod (2; 1% instances), goeswith (1; 0% instances), root (1; 0% instances)

Parents of X nodes belong to 10 different parts of speech: NOUN (93; 45% instances), VERB (66; 32% instances), PROPN (14; 7% instances), X (14; 7% instances), ADJ (6; 3% instances), PRON (5; 2% instances), DET (4; 2% instances), ADV (1; 0% instances), NUM (1; 0% instances), (1; 0% instances)

190 (93%) X nodes are leaves.

2 (1%) X nodes have one child.

0 (0%) X nodes have two children.

13 (6%) X nodes have three or more children.

The highest child degree of a X node is 6.

Children of X nodes are attached using 5 different relations: punct (29; 64% instances), dep (12; 27% instances), parataxis (2; 4% instances), case (1; 2% instances), nmod (1; 2% instances)

Children of X nodes belong to 4 different parts of speech: PUNCT (29; 64% instances), X (14; 31% instances), ADP (1; 2% instances), VERB (1; 2% instances)