Treebank Statistics: UD_Ukrainian-ParlaMint: POS Tags: X
There are 32 X
lemmas (1%), 58 X
types (1%) and 81 X
tokens (0%).
Out of 16 observed tags, the rank of X
is: 10 in number of lemmas, 9 in number of types and 15 in number of tokens.
The 10 most frequent X
lemmas: -, –, COVID, hoc, ad, from, you, бацька, поехать, .
The 10 most frequent X
types: COVID, hoc, на, русском, ad, from, you, бацька, будете, вы
The 10 most frequent ambiguous lemmas: - (PUNCT 118, X 23), – (PUNCT 166, X 14), . (PUNCT 3022, X 1)
The 10 most frequent ambiguous types: на (ADP 587, X 3), будете (AUX 7, X 2), за (ADP 272, X 2, ADV 1), я (PRON 267, X 2), . (PUNCT 3022, X 1), не (PART 519, X 1)
- на
- будете
- за
- я
- .
- не
Morphology
The form / lemma ratio of X
is 1.812500 (the average of all parts of speech is 1.786380).
The 1st highest number of forms (15) was observed with the lemma “-”: будете, вы, единую, за, извиняюсь, молчать, на, не, разговаривать, разговариваю, русском, страну, что, я, языке.
The 2nd highest number of forms (14) was observed with the lemma “–”: Бацька, И, Лукашэнка, дапамогу, дзякуй, за, косами, мертвые, прэзідэнт, с, сапраўдны, стоят, тишина, только.
The 3rd highest number of forms (2) was observed with the lemma “бацька”: бацька, бацькаю.
X
occurs with 1 features: Foreign (80; 99% instances)
X
occurs with 1 feature-value pairs: Foreign=Yes
X
occurs with 2 feature combinations.
The most frequent feature combination is Foreign=Yes
(80 tokens).
Examples: COVID, hoc, на, русском, ad, from, you, бацька, будете, вы
Relations
X
nodes are attached to their parents using 12 different relations: flat:foreign (46; 57% instances), conj (7; 9% instances), nmod (7; 9% instances), appos (6; 7% instances), parataxis (4; 5% instances), amod (3; 4% instances), ccomp (2; 2% instances), obl (2; 2% instances), acl (1; 1% instances), advmod (1; 1% instances), fixed (1; 1% instances), obj (1; 1% instances)
Parents of X
nodes belong to 5 different parts of speech: X (54; 67% instances), NOUN (13; 16% instances), VERB (11; 14% instances), PROPN (2; 2% instances), PRON (1; 1% instances)
58 (72%) X
nodes are leaves.
11 (14%) X
nodes have one child.
0 (0%) X
nodes have two children.
12 (15%) X
nodes have three or more children.
The highest child degree of a X
node is 21.
Children of X
nodes are attached using 15 different relations: flat:foreign (46; 44% instances), punct (29; 28% instances), conj (8; 8% instances), nsubj (4; 4% instances), case (3; 3% instances), parataxis (3; 3% instances), advmod:neg (2; 2% instances), mark (2; 2% instances), acl:relcl (1; 1% instances), advcl (1; 1% instances), appos (1; 1% instances), cc (1; 1% instances), det (1; 1% instances), discourse (1; 1% instances), fixed (1; 1% instances)
Children of X
nodes belong to 13 different parts of speech: X (54; 52% instances), PUNCT (29; 28% instances), ADP (3; 3% instances), NOUN (3; 3% instances), PART (3; 3% instances), ADV (2; 2% instances), NUM (2; 2% instances), PRON (2; 2% instances), SCONJ (2; 2% instances), ADJ (1; 1% instances), CCONJ (1; 1% instances), DET (1; 1% instances), VERB (1; 1% instances)