Treebank Statistics: UD_Belarusian: POS Tags: X
There are 18 X
lemmas (1%), 18 X
types (1%) and 45 X
tokens (1%).
Out of 16 observed tags, the rank of X
is: 12 in number of lemmas, 12 in number of types and 15 in number of tokens.
The 10 most frequent X
lemmas: °с, aju, daeryook, firm, internatonal, law, telegraf.by, litesound, afp, are
The 10 most frequent X
types: °С, Aju, Daeryook, Firm, Internatonal, Law, Telegraf.by, Litesound, AFP, Are
The 10 most frequent ambiguous lemmas:
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of X
is 1.000000 (the average of all parts of speech is 1.397401).
The 1st highest number of forms (1) was observed with the lemma “afp”: AFP.
The 2nd highest number of forms (1) was observed with the lemma “aju”: Aju.
The 3rd highest number of forms (1) was observed with the lemma “are”: Are.
X
occurs with 4 features: Animacy (9; 20% instances), Case (9; 20% instances), Gender (9; 20% instances), Number (9; 20% instances)
X
occurs with 4 feature-value pairs: Animacy=Anim
, Case=Gen
, Gender=Masc
, Number=Sing
X
occurs with 2 feature combinations.
The most frequent feature combination is _
(36 tokens).
Examples: Aju, Daeryook, Firm, Internatonal, Law, Telegraf.by, Litesound, AFP, Are, Daily
Relations
X
nodes are attached to their parents using 9 different relations: flat (25; 56% instances), root (5; 11% instances), appos (3; 7% instances), obj (3; 7% instances), parataxis (3; 7% instances), conj (2; 4% instances), nsubj (2; 4% instances), nmod (1; 2% instances), obl (1; 2% instances)
Parents of X
nodes belong to 6 different parts of speech: X (22; 49% instances), NOUN (10; 22% instances), VERB (6; 13% instances), (5; 11% instances), ADJ (1; 2% instances), PROPN (1; 2% instances)
24 (53%) X
nodes are leaves.
2 (4%) X
nodes have one child.
9 (20%) X
nodes have two children.
10 (22%) X
nodes have three or more children.
The highest child degree of a X
node is 6.
Children of X
nodes are attached using 12 different relations: flat (25; 38% instances), punct (17; 26% instances), nummod:gov (6; 9% instances), parataxis (5; 8% instances), case (3; 5% instances), nummod (3; 5% instances), cc (2; 3% instances), advmod (1; 2% instances), advmod:discourse (1; 2% instances), conj (1; 2% instances), cop (1; 2% instances), nsubj (1; 2% instances)
Children of X
nodes belong to 11 different parts of speech: X (22; 33% instances), PUNCT (17; 26% instances), NUM (9; 14% instances), SYM (4; 6% instances), VERB (4; 6% instances), ADP (3; 5% instances), CCONJ (2; 3% instances), NOUN (2; 3% instances), ADV (1; 2% instances), AUX (1; 2% instances), PART (1; 2% instances)