home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-Ruthenian: POS Tags: X

There are 65 X lemmas (1%), 133 X types (1%) and 152 X tokens (0%). Out of 17 observed tags, the rank of X is: 9 in number of lemmas, 12 in number of types and 16 in number of tokens.

The 10 most frequent X lemmas: _, etc., idem, аd, alie, z, 1, 2v, 3v, 4v

The 10 most frequent X types: etc., ad, іdem, Alіe, [2_зв.], [4_зв.], z, {76}, с, Confіrmatіo

The 10 most frequent ambiguous lemmas: _ (X 73, ADJ 1, VERB 1), 1 (ADJ 13, NUM 2, X 1), за (ADP 521, ADV 1, X 1), съ (ADP 938, X 1), я (PRON 439, X 1)

The 10 most frequent ambiguous types: с (ADP 620, X 2), [10] (ADJ 7, X 1), [1] (ADJ 1, X 1), [2] (ADJ 2, X 1), [4] (ADJ 7, NUM 1, X 1), [5] (ADJ 1, NUM 1, X 1), [6] (ADJ 1, NUM 1, X 1), [7] (ADJ 4, X 1), [8] (ADJ 5, NUM 1, X 1), [9] (ADJ 2, X 1)

Morphology

The form / lemma ratio of X is 2.046154 (the average of all parts of speech is 2.698737).

The 1st highest number of forms (72) was observed with the lemma “_”: [10], [1_зв.], [2], [2_зв.], [3_зв.], [4], [4_зв.], [5], [5_зв.], [6], [6_зв.], [7], [7_зв.], [8], [8_зв.], [9], [9_зв.], {11_зв.}, {22_зв.}, {23_зв.}, {24_зв.}, {24}, {25_зв.}, {25}, {26_зв.}, {26}, {27_зв.}, {27}, {28_зв.}, {28}, {29_зв.}, {29}, {30_зв.}, {30}, {31_зв.}, {32_зв.}, {32}, {33_зв.}, {33}, {34_зв.}, {34}, {35_зв.}, {35}, {36_зв.}, {36}, {37_зв.}, {37}, {38_зв.}, {38}, {39}, {55}, {56}, {57}, {58}, {59}, {61}, {62}, {63}, {66}, {67}, {68}, {69}, {70}, {74}, {76}, {78}, {79}, бьчии, и(з)налъ, к, с, сла.

The 2nd highest number of forms (1) was observed with the lemma “1”: [1].

The 3rd highest number of forms (1) was observed with the lemma “2v”: [2v].

X occurs with 2 features: Foreign (36; 24% instances), Typo (1; 1% instances)

X occurs with 2 feature-value pairs: Foreign=Yes, Typo=Yes

X occurs with 3 feature combinations. The most frequent feature combination is _ (115 tokens). Examples: [2_зв.], [4_зв.], {76}, [10], [1], [1_зв.], [2], [2v], [3_зв.], [3v]

Relations

X nodes are attached to their parents using 10 different relations: dep (113; 74% instances), nmod (9; 6% instances), case (7; 5% instances), root (6; 4% instances), conj (4; 3% instances), goeswith (4; 3% instances), amod (3; 2% instances), appos (3; 2% instances), flat:foreign (2; 1% instances), obl (1; 1% instances)

Parents of X nodes belong to 10 different parts of speech: NOUN (55; 36% instances), VERB (47; 31% instances), X (20; 13% instances), ADJ (8; 5% instances), PROPN (7; 5% instances), (6; 4% instances), PRON (4; 3% instances), ADV (2; 1% instances), DET (2; 1% instances), ADP (1; 1% instances)

133 (88%) X nodes are leaves.

10 (7%) X nodes have one child.

4 (3%) X nodes have two children.

5 (3%) X nodes have three or more children.

The highest child degree of a X node is 7.

Children of X nodes are attached using 14 different relations: nmod (8; 21% instances), punct (6; 16% instances), case (5; 13% instances), amod (4; 11% instances), cc (2; 5% instances), dep (2; 5% instances), flat:foreign (2; 5% instances), nsubj (2; 5% instances), obj (2; 5% instances), advmod (1; 3% instances), det (1; 3% instances), iobj (1; 3% instances), obl (1; 3% instances), orphan (1; 3% instances)

Children of X nodes belong to 9 different parts of speech: X (20; 53% instances), PUNCT (6; 16% instances), PRON (3; 8% instances), CCONJ (2; 5% instances), DET (2; 5% instances), NOUN (2; 5% instances), ADJ (1; 3% instances), PART (1; 3% instances), PROPN (1; 3% instances)