home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Belarusian: POS Tags: X

There are 18 X lemmas (1%), 18 X types (1%) and 45 X tokens (1%). Out of 16 observed tags, the rank of X is: 12 in number of lemmas, 12 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: °с, aju, daeryook, firm, internatonal, law, telegraf.by, litesound, afp, are

The 10 most frequent X types: °С, Aju, Daeryook, Firm, Internatonal, Law, Telegraf.by, Litesound, AFP, Are

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.397401).

The 1st highest number of forms (1) was observed with the lemma “afp”: AFP.

The 2nd highest number of forms (1) was observed with the lemma “aju”: Aju.

The 3rd highest number of forms (1) was observed with the lemma “are”: Are.

X occurs with 4 features: Animacy (9; 20% instances), Case (9; 20% instances), Gender (9; 20% instances), Number (9; 20% instances)

X occurs with 4 feature-value pairs: Animacy=Anim, Case=Gen, Gender=Masc, Number=Sing

X occurs with 2 feature combinations. The most frequent feature combination is _ (36 tokens). Examples: Aju, Daeryook, Firm, Internatonal, Law, Telegraf.by, Litesound, AFP, Are, Daily

Relations

X nodes are attached to their parents using 9 different relations: flat (25; 56% instances), root (5; 11% instances), appos (3; 7% instances), obj (3; 7% instances), parataxis (3; 7% instances), conj (2; 4% instances), nsubj (2; 4% instances), nmod (1; 2% instances), obl (1; 2% instances)

Parents of X nodes belong to 6 different parts of speech: X (22; 49% instances), NOUN (10; 22% instances), VERB (6; 13% instances), (5; 11% instances), ADJ (1; 2% instances), PROPN (1; 2% instances)

24 (53%) X nodes are leaves.

2 (4%) X nodes have one child.

9 (20%) X nodes have two children.

10 (22%) X nodes have three or more children.

The highest child degree of a X node is 6.

Children of X nodes are attached using 12 different relations: flat (25; 38% instances), punct (17; 26% instances), nummod:gov (6; 9% instances), parataxis (5; 8% instances), case (3; 5% instances), nummod (3; 5% instances), cc (2; 3% instances), advmod (1; 2% instances), advmod:discourse (1; 2% instances), conj (1; 2% instances), cop (1; 2% instances), nsubj (1; 2% instances)

Children of X nodes belong to 11 different parts of speech: X (22; 33% instances), PUNCT (17; 26% instances), NUM (9; 14% instances), SYM (4; 6% instances), VERB (4; 6% instances), ADP (3; 5% instances), CCONJ (2; 3% instances), NOUN (2; 3% instances), ADV (1; 2% instances), AUX (1; 2% instances), PART (1; 2% instances)