home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Estonian-EDT: POS Tags: X

There are 222 X lemmas (1%), 223 X types (0%) and 349 X tokens (0%). Out of 16 observed tags, the rank of X is: 7 in number of lemmas, 9 in number of types and 14 in number of tokens.

The 10 most frequent X lemmas: of, in, drive, key, the, International, pruritus, tõ, you, World

The 10 most frequent X types: of, in, drive, key, the, International, pruritus, tõ, you, World

The 10 most frequent ambiguous lemmas: the (NOUN 6, X 3), International (PROPN 5, X 4), pruritus (X 4, NOUN 1), World (PROPN 5, X 3), and (NOUN 6, X 3, CCONJ 1), de (PROPN 23, X 3), et (SCONJ 3569, X 3), for (NOUN 4, X 3), i (NOUN 3, SYM 3, X 2), Geophysical (PROPN 2, X 2)

The 10 most frequent ambiguous types: the (NOUN 6, X 3), International (PROPN 4, X 4), World (PROPN 3, X 3), and (NOUN 6, X 3, CCONJ 1), de (PROPN 23, X 3), et (SCONJ 3418, X 3), for (NOUN 4, X 3), ‘i (X 2, SYM 1), Geophysical (PROPN 2, X 2), System (X 2, PROPN 1)

Morphology

The form / lemma ratio of X is 1.004505 (the average of all parts of speech is 1.912184).

The 1st highest number of forms (2) was observed with the lemma “al”: al, al..

The 2nd highest number of forms (2) was observed with the lemma “is”: ‘is, is.

The 3rd highest number of forms (1) was observed with the lemma “9qh+”: 9qh+.

X occurs with 4 features: Abbr (71; 20% instances), Foreign (64; 18% instances), Case (11; 3% instances), Number (11; 3% instances)

X occurs with 5 feature-value pairs: Abbr=Yes, Case=Gen, Case=Nom, Foreign=Yes, Number=Sing

X occurs with 5 feature combinations. The most frequent feature combination is _ (203 tokens). Examples: drive, key, the, International, World, data, de, drug, in, packet

Relations

X nodes are attached to their parents using 13 different relations: flat:foreign (141; 40% instances), flat (83; 24% instances), appos (39; 11% instances), conj (23; 7% instances), root (21; 6% instances), parataxis (11; 3% instances), obl (9; 3% instances), nmod (8; 2% instances), goeswith (5; 1% instances), advcl (3; 1% instances), advmod (2; 1% instances), nsubj (2; 1% instances), nsubj:cop (2; 1% instances)

Parents of X nodes belong to 7 different parts of speech: X (156; 45% instances), NOUN (82; 23% instances), PROPN (64; 18% instances), (21; 6% instances), VERB (19; 5% instances), INTJ (4; 1% instances), NUM (3; 1% instances)

233 (67%) X nodes are leaves.

21 (6%) X nodes have one child.

25 (7%) X nodes have two children.

70 (20%) X nodes have three or more children.

The highest child degree of a X node is 9.

Children of X nodes are attached using 15 different relations: punct (189; 49% instances), flat:foreign (129; 34% instances), flat (35; 9% instances), conj (8; 2% instances), cc (5; 1% instances), nummod (4; 1% instances), appos (3; 1% instances), advmod (2; 1% instances), cop (2; 1% instances), acl (1; 0% instances), advcl (1; 0% instances), cc:preconj (1; 0% instances), csubj:cop (1; 0% instances), nmod (1; 0% instances), nsubj:cop (1; 0% instances)

Children of X nodes belong to 10 different parts of speech: PUNCT (189; 49% instances), X (156; 41% instances), NOUN (13; 3% instances), ADV (5; 1% instances), CCONJ (5; 1% instances), PROPN (5; 1% instances), NUM (4; 1% instances), VERB (3; 1% instances), AUX (2; 1% instances), ADJ (1; 0% instances)