home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-Ruthenian: POS Tags: X

There are 34 X lemmas (0%), 53 X types (0%) and 71 X tokens (0%). Out of 17 observed tags, the rank of X is: 13 in number of lemmas, 14 in number of types and 16 in number of tokens.

The 10 most frequent X lemmas: _, etc., idem, аd, alie, z, 1, 2v, 3v, 4v

The 10 most frequent X types: etc., ad, іdem, Alіe, [2_зв.], [4_зв.], z, с, (м), Confіrmatіo

The 10 most frequent ambiguous lemmas: _ (X 23, ADJ 1), 1 (ADJ 11, NUM 2, X 1), за (ADP 371, ADV 1, X 1), съ (ADP 863, X 1)

The 10 most frequent ambiguous types: с (ADP 547, X 2), (м) (AUX 2, X 1), [10] (ADJ 2, X 1), [4] (ADJ 1, NUM 1, X 1), [5] (NUM 1, X 1), [7] (ADJ 1, X 1), [8] (ADJ 1, X 1), за (ADP 367, ADV 1, X 1), к (ADP 421, X 1), ти (PRON 1, X 1)

Morphology

The form / lemma ratio of X is 1.558824 (the average of all parts of speech is 2.589846).

The 1st highest number of forms (23) was observed with the lemma “_”: (м), [10], [1_зв.], [2], [2_зв.], [3_зв.], [4], [4_зв.], [5], [5_зв.], [6], [6_зв.], [7], [7_зв.], [8], [8_зв.], [9], [9_зв.], бьчии, и(з)налъ, к, с, сла.

The 2nd highest number of forms (1) was observed with the lemma “1”: [1].

The 3rd highest number of forms (1) was observed with the lemma “2v”: [2v].

X occurs with 2 features: Foreign (36; 51% instances), Typo (1; 1% instances)

X occurs with 2 feature-value pairs: Foreign=Yes, Typo=Yes

X occurs with 3 feature combinations. The most frequent feature combination is Foreign=Yes (36 tokens). Examples: etc., ad, іdem, Alіe, z, Confіrmatіo, Kopіa, N, Połockіe(g)o, Sіgіsmundus

Relations

X nodes are attached to their parents using 11 different relations: dep (32; 45% instances), nmod (9; 13% instances), case (7; 10% instances), root (6; 8% instances), goeswith (4; 6% instances), amod (3; 4% instances), appos (3; 4% instances), conj (3; 4% instances), flat:foreign (2; 3% instances), iobj (1; 1% instances), obl (1; 1% instances)

Parents of X nodes belong to 9 different parts of speech: NOUN (21; 30% instances), X (20; 28% instances), VERB (15; 21% instances), (6; 8% instances), ADJ (4; 6% instances), ADV (2; 3% instances), ADP (1; 1% instances), DET (1; 1% instances), PRON (1; 1% instances)

53 (75%) X nodes are leaves.

10 (14%) X nodes have one child.

3 (4%) X nodes have two children.

5 (7%) X nodes have three or more children.

The highest child degree of a X node is 7.

Children of X nodes are attached using 14 different relations: nmod (8; 22% instances), case (5; 14% instances), punct (5; 14% instances), amod (4; 11% instances), cc (2; 6% instances), dep (2; 6% instances), flat:foreign (2; 6% instances), nsubj (2; 6% instances), advmod (1; 3% instances), det (1; 3% instances), iobj (1; 3% instances), obj (1; 3% instances), obl (1; 3% instances), orphan (1; 3% instances)

Children of X nodes belong to 9 different parts of speech: X (20; 56% instances), PUNCT (5; 14% instances), PRON (3; 8% instances), CCONJ (2; 6% instances), DET (2; 6% instances), ADJ (1; 3% instances), NOUN (1; 3% instances), PART (1; 3% instances), PROPN (1; 3% instances)