home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-Birchbark: POS Tags: X

There are 884 X lemmas (18%), 919 X types (8%) and 1602 X tokens (6%). Out of 17 observed tags, the rank of X is: 3 in number of lemmas, 4 in number of types and 7 in number of tokens.

The 10 most frequent X lemmas: _, а, з, д, и, е, -, в, г, —-

The 10 most frequent X types: а, д, з, -, в, —-, г, е, и, ——-

The 10 most frequent ambiguous lemmas: _ (X 26, NUM 3, VERB 3, PROPN 2, PUNCT 2, DET 1), а (CCONJ 1009, X 21), и (CCONJ 743, PRON 153, X 20, DET 7, PART 3), —- (X 15, SYM 1), ——- (X 15, PROPN 1), о (ADP 49, X 13), —– (X 11, PUNCT 1), …а (X 10, PROPN 1), у (ADP 822, X 9, ADV 1), ка (X 7, PART 3)

The 10 most frequent ambiguous types: а (CCONJ 944, X 22, NUM 1), д (X 21, NUM 3), з (X 21, ADP 13, NUM 2), в (ADP 99, X 18, NUM 3), —- (X 17, SYM 1), г (X 17, NUM 2), е (X 17, AUX 3, PRON 2, DET 1, NUM 1, VERB 1), и (CCONJ 571, X 16, ADP 6, PART 3, PRON 2, NUM 1), ——- (X 15, PROPN 1), ж (X 14, DET 1, PART 1)

Morphology

The form / lemma ratio of X is 1.039593 (the average of all parts of speech is 2.421872).

The 1st highest number of forms (24) was observed with the lemma “_”: (е), —-, а, б, в, в…, г, гу…, д, ес[о]-…, же, и, ро…, си…, ьзано, ѡ, ѡ…, …не, …[ѡмъсл]…, …[ѹха]…, …и, …мене, …о, ꙗ….

The 2nd highest number of forms (5) was observed with the lemma “и”: [и, {и}, и, ӏ, …и.

The 3rd highest number of forms (4) was observed with the lemma “на”: [н]а, на, …на, …на.

X does not occur with any features.

Relations

X nodes are attached to their parents using 28 different relations: dep (675; 42% instances), conj (542; 34% instances), root (247; 15% instances), obl (18; 1% instances), orphan (15; 1% instances), nmod (14; 1% instances), flat (13; 1% instances), advcl (12; 1% instances), nsubj (10; 1% instances), reparandum (8; 0% instances), flat:name (7; 0% instances), obj (7; 0% instances), parataxis (6; 0% instances), list (5; 0% instances), mark (4; 0% instances), cc (3; 0% instances), appos (2; 0% instances), dislocated (2; 0% instances), goeswith (2; 0% instances), iobj (2; 0% instances), acl (1; 0% instances), acl:relcl (1; 0% instances), amod (1; 0% instances), case (1; 0% instances), ccomp (1; 0% instances), nsubj:pass (1; 0% instances), vocative (1; 0% instances), xcomp (1; 0% instances)

Parents of X nodes belong to 13 different parts of speech: X (627; 39% instances), VERB (337; 21% instances), (247; 15% instances), NOUN (201; 13% instances), PROPN (103; 6% instances), PRON (26; 2% instances), NUM (23; 1% instances), ADJ (14; 1% instances), DET (8; 0% instances), ADP (6; 0% instances), ADV (5; 0% instances), PART (3; 0% instances), CCONJ (2; 0% instances)

1149 (72%) X nodes are leaves.

228 (14%) X nodes have one child.

78 (5%) X nodes have two children.

147 (9%) X nodes have three or more children.

The highest child degree of a X node is 76.

Children of X nodes are attached using 31 different relations: conj (564; 40% instances), punct (260; 18% instances), dep (159; 11% instances), case (64; 5% instances), cc (63; 4% instances), nsubj (45; 3% instances), obl (31; 2% instances), obj (26; 2% instances), advmod (25; 2% instances), iobj (22; 2% instances), orphan (17; 1% instances), mark (16; 1% instances), nmod (16; 1% instances), nummod:gov (15; 1% instances), advcl (12; 1% instances), flat (11; 1% instances), det (9; 1% instances), vocative (8; 1% instances), aux (7; 0% instances), parataxis (7; 0% instances), dislocated (5; 0% instances), flat:name (5; 0% instances), amod (4; 0% instances), appos (4; 0% instances), ccomp (3; 0% instances), cop (3; 0% instances), acl:relcl (1; 0% instances), csubj (1; 0% instances), expl (1; 0% instances), nsubj:pass (1; 0% instances), nummod (1; 0% instances)

Children of X nodes belong to 16 different parts of speech: X (627; 45% instances), PUNCT (260; 18% instances), NOUN (121; 9% instances), ADP (70; 5% instances), PROPN (69; 5% instances), CCONJ (63; 4% instances), VERB (45; 3% instances), PRON (40; 3% instances), NUM (23; 2% instances), DET (22; 2% instances), PART (22; 2% instances), SCONJ (16; 1% instances), AUX (11; 1% instances), ADV (9; 1% instances), ADJ (5; 0% instances), SYM (3; 0% instances)