home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-Birchbark: POS Tags: X

There are 884 X lemmas (18%), 912 X types (8%) and 1581 X tokens (6%). Out of 17 observed tags, the rank of X is: 3 in number of lemmas, 4 in number of types and 7 in number of tokens.

The 10 most frequent X lemmas: а, з, д, и, е, -, в, г, —-, ——-

The 10 most frequent X types: а, з, -, д, в, е, г, —-, ——-, и

The 10 most frequent ambiguous lemmas: а (CCONJ 990, X 21), и (CCONJ 724, PRON 149, X 20, DET 7, PART 3), —- (X 15, SYM 1), ——- (X 15, PROPN 1), о (ADP 46, X 13), —– (X 11, PUNCT 1), …а (X 10, PROPN 1), у (ADP 803, X 9, ADV 1), ка (X 7, PART 3), на (ADP 451, X 7)

The 10 most frequent ambiguous types: а (CCONJ 927, X 21, NUM 1), з (X 21, ADP 12, NUM 2), д (X 20, NUM 3), в (ADP 96, X 17, NUM 3), е (X 17, AUX 3, PRON 2, DET 1, NUM 1, VERB 1), г (X 16, NUM 2), —- (X 15, SYM 1), ——- (X 15, PROPN 1), и (CCONJ 558, X 15, ADP 6, PART 3, PRON 2, NUM 1), ж (X 14, DET 1, PART 1)

Morphology

The form / lemma ratio of X is 1.031674 (the average of all parts of speech is 2.410435).

The 1st highest number of forms (5) was observed with the lemma “_”: (е), ес[о]-…, же, ьзано, ѡ.

The 2nd highest number of forms (5) was observed with the lemma “и”: [и, {и}, и, ӏ, …и.

The 3rd highest number of forms (4) was observed with the lemma “на”: [н]а, на, …на, …на.

X does not occur with any features.

Relations

X nodes are attached to their parents using 28 different relations: dep (671; 42% instances), conj (538; 34% instances), root (242; 15% instances), obl (18; 1% instances), nmod (14; 1% instances), flat (13; 1% instances), orphan (13; 1% instances), advcl (11; 1% instances), nsubj (9; 1% instances), reparandum (8; 1% instances), flat:name (7; 0% instances), obj (6; 0% instances), list (5; 0% instances), mark (4; 0% instances), parataxis (4; 0% instances), appos (2; 0% instances), cc (2; 0% instances), dislocated (2; 0% instances), goeswith (2; 0% instances), iobj (2; 0% instances), acl (1; 0% instances), acl:relcl (1; 0% instances), amod (1; 0% instances), case (1; 0% instances), ccomp (1; 0% instances), nsubj:pass (1; 0% instances), vocative (1; 0% instances), xcomp (1; 0% instances)

Parents of X nodes belong to 13 different parts of speech: X (619; 39% instances), VERB (332; 21% instances), (242; 15% instances), NOUN (198; 13% instances), PROPN (104; 7% instances), PRON (25; 2% instances), NUM (23; 1% instances), ADJ (14; 1% instances), DET (8; 1% instances), ADP (6; 0% instances), ADV (5; 0% instances), PART (3; 0% instances), CCONJ (2; 0% instances)

1136 (72%) X nodes are leaves.

226 (14%) X nodes have one child.

77 (5%) X nodes have two children.

142 (9%) X nodes have three or more children.

The highest child degree of a X node is 76.

Children of X nodes are attached using 31 different relations: conj (553; 40% instances), punct (258; 19% instances), dep (158; 11% instances), case (63; 5% instances), cc (61; 4% instances), nsubj (47; 3% instances), obl (30; 2% instances), obj (26; 2% instances), advmod (25; 2% instances), iobj (23; 2% instances), nmod (17; 1% instances), mark (16; 1% instances), nummod:gov (14; 1% instances), orphan (12; 1% instances), advcl (11; 1% instances), flat (11; 1% instances), det (9; 1% instances), vocative (8; 1% instances), aux (7; 1% instances), parataxis (6; 0% instances), dislocated (5; 0% instances), flat:name (5; 0% instances), amod (4; 0% instances), appos (3; 0% instances), cop (3; 0% instances), ccomp (2; 0% instances), acl:relcl (1; 0% instances), csubj (1; 0% instances), expl (1; 0% instances), nsubj:pass (1; 0% instances), nummod (1; 0% instances)

Children of X nodes belong to 16 different parts of speech: X (619; 45% instances), PUNCT (258; 19% instances), NOUN (118; 9% instances), ADP (69; 5% instances), PROPN (67; 5% instances), CCONJ (61; 4% instances), VERB (41; 3% instances), PRON (39; 3% instances), DET (22; 2% instances), NUM (22; 2% instances), PART (22; 2% instances), SCONJ (16; 1% instances), AUX (11; 1% instances), ADV (9; 1% instances), ADJ (5; 0% instances), SYM (3; 0% instances)