Treebank Statistics: UD_Old_East_Slavic-Birchbark: POS Tags: X
There are 884 X lemmas (18%), 919 X types (8%) and 1602 X tokens (6%).
Out of 17 observed tags, the rank of X is: 3 in number of lemmas, 4 in number of types and 7 in number of tokens.
The 10 most frequent X lemmas: _, а, з, д, и, е, -, в, г, —-
The 10 most frequent X types: а, д, з, -, в, —-, г, е, и, ——-
The 10 most frequent ambiguous lemmas: _ (X 26, NUM 3, VERB 3, PROPN 2, PUNCT 2, DET 1), а (CCONJ 1009, X 21), и (CCONJ 743, PRON 153, X 20, DET 7, PART 3), —- (X 15, SYM 1), ——- (X 15, PROPN 1), о (ADP 49, X 13), —– (X 11, PUNCT 1), …а (X 10, PROPN 1), у (ADP 822, X 9, ADV 1), ка (X 7, PART 3)
The 10 most frequent ambiguous types: а (CCONJ 944, X 22, NUM 1), д (X 21, NUM 3), з (X 21, ADP 13, NUM 2), в (ADP 99, X 18, NUM 3), —- (X 17, SYM 1), г (X 17, NUM 2), е (X 17, AUX 3, PRON 2, DET 1, NUM 1, VERB 1), и (CCONJ 571, X 16, ADP 6, PART 3, PRON 2, NUM 1), ——- (X 15, PROPN 1), ж (X 14, DET 1, PART 1)
- а
- д
- з
- в
- —-
- г
- е
- X 17: а б в г д е ж ꙅ з и ї к л м н о п р с т уо ѳ х ѡ ц ч ш щ ъ ѣ ѫ ю ѧ
- AUX 3: оце е тобе н[е] годена а попрова
ди ко моне сестрѹ - PRON 2: иже е уклъдеть да пр
оклѧтъ бѹдеѹть - DET 1: любо же · присли кунꙑ · любо же · а смолови с кꙑмо [·] любо е · седе тите приеха возмете :
- NUM 1: ди а в г д е ѕ
- VERB 1: е гн…
- и
- CCONJ 571: …вич и брат ѥго к ти
(мофѣю) - X 16: а б в г д е ж ꙅ з и ї к л м н о п р с т уо ѳ х ѡ ц ч ш щ ъ ѣ ѫ ю ѧ
- ADP 6: … худо буде а на то у … (дѣ)теи и своихъ хоромо
- PART 3: … (м)[ол]и воньзда шюрина и моега оти вꙑволоци доскь и …
- PRON 2: ѹ хотъсла:ва ми бꙑло гривн възѧти : а творѧть и пеставивъше
- NUM 1: … … [: и ѧ]з[ъ] крале бебрꙑ про дан[ь] и : грн҃ве въ беб[ръ]хъ
- CCONJ 571: …вич и брат ѥго к ти
- ——-
- ж
Morphology
The form / lemma ratio of X is 1.039593 (the average of all parts of speech is 2.421872).
The 1st highest number of forms (24) was observed with the lemma “_”: (е), —-, а, б, в, в…, г, гу…, д, ес[о]-
The 2nd highest number of forms (5) was observed with the lemma “и”: [и, {и}, и, ӏ, …
The 3rd highest number of forms (4) was observed with the lemma “на”: [н]а, на, …
X does not occur with any features.
Relations
X nodes are attached to their parents using 28 different relations: dep (675; 42% instances), conj (542; 34% instances), root (247; 15% instances), obl (18; 1% instances), orphan (15; 1% instances), nmod (14; 1% instances), flat (13; 1% instances), advcl (12; 1% instances), nsubj (10; 1% instances), reparandum (8; 0% instances), flat:name (7; 0% instances), obj (7; 0% instances), parataxis (6; 0% instances), list (5; 0% instances), mark (4; 0% instances), cc (3; 0% instances), appos (2; 0% instances), dislocated (2; 0% instances), goeswith (2; 0% instances), iobj (2; 0% instances), acl (1; 0% instances), acl:relcl (1; 0% instances), amod (1; 0% instances), case (1; 0% instances), ccomp (1; 0% instances), nsubj:pass (1; 0% instances), vocative (1; 0% instances), xcomp (1; 0% instances)
Parents of X nodes belong to 13 different parts of speech: X (627; 39% instances), VERB (337; 21% instances), (247; 15% instances), NOUN (201; 13% instances), PROPN (103; 6% instances), PRON (26; 2% instances), NUM (23; 1% instances), ADJ (14; 1% instances), DET (8; 0% instances), ADP (6; 0% instances), ADV (5; 0% instances), PART (3; 0% instances), CCONJ (2; 0% instances)
1149 (72%) X nodes are leaves.
228 (14%) X nodes have one child.
78 (5%) X nodes have two children.
147 (9%) X nodes have three or more children.
The highest child degree of a X node is 76.
Children of X nodes are attached using 31 different relations: conj (564; 40% instances), punct (260; 18% instances), dep (159; 11% instances), case (64; 5% instances), cc (63; 4% instances), nsubj (45; 3% instances), obl (31; 2% instances), obj (26; 2% instances), advmod (25; 2% instances), iobj (22; 2% instances), orphan (17; 1% instances), mark (16; 1% instances), nmod (16; 1% instances), nummod:gov (15; 1% instances), advcl (12; 1% instances), flat (11; 1% instances), det (9; 1% instances), vocative (8; 1% instances), aux (7; 0% instances), parataxis (7; 0% instances), dislocated (5; 0% instances), flat:name (5; 0% instances), amod (4; 0% instances), appos (4; 0% instances), ccomp (3; 0% instances), cop (3; 0% instances), acl:relcl (1; 0% instances), csubj (1; 0% instances), expl (1; 0% instances), nsubj:pass (1; 0% instances), nummod (1; 0% instances)
Children of X nodes belong to 16 different parts of speech: X (627; 45% instances), PUNCT (260; 18% instances), NOUN (121; 9% instances), ADP (70; 5% instances), PROPN (69; 5% instances), CCONJ (63; 4% instances), VERB (45; 3% instances), PRON (40; 3% instances), NUM (23; 2% instances), DET (22; 2% instances), PART (22; 2% instances), SCONJ (16; 1% instances), AUX (11; 1% instances), ADV (9; 1% instances), ADJ (5; 0% instances), SYM (3; 0% instances)