home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Upper_Sorbian-UFAL: POS Tags: X

There are 166 X lemmas (5%), 166 X types (4%) and 199 X tokens (2%). Out of 16 observed tags, the rank of X is: 7 in number of lemmas, 7 in number of types and 13 in number of tokens.

The 10 most frequent X lemmas: a, i, vitis, al, backen, o, apg, au, b, c

The 10 most frequent X types: a, i, Vitis, al, backen, o, APG, H, au, b

The 10 most frequent ambiguous lemmas: a (CCONJ 337, X 5), c (X 2, PROPN 1), dr (X 2, NOUN 1), mj (VERB 2, X 2), und (X 2, PROPN 1), center (NOUN 1, X 1), d (ADJ 1, X 1), et (CCONJ 2, X 1), institut (NOUN 16, X 1), k (ADP 48, X 1)

The 10 most frequent ambiguous types: a (CCONJ 337, X 4), H (X 2, PROPN 1), dr (X 2, NOUN 1), mj (VERB 2, X 2, ADJ 1), n (DET 37, ADP 2, X 2), und (X 2, PROPN 1), INSTITUT (NOUN 1, X 1), et (CCONJ 2, X 1), k (ADP 41, X 1), m (NOUN 2, X 1)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.418889).

The 1st highest number of forms (1) was observed with the lemma “100px”: 100px.

The 2nd highest number of forms (1) was observed with the lemma “100x200px”: 100x200px.

The 3rd highest number of forms (1) was observed with the lemma “a”: a.

X occurs with 1 features: Abbr (8; 4% instances)

X occurs with 1 feature-value pairs: Abbr=Yes

X occurs with 2 feature combinations. The most frequent feature combination is _ (191 tokens). Examples: a, i, Vitis, al, backen, o, H, au, b, c

Relations

X nodes are attached to their parents using 17 different relations: conj (48; 24% instances), flat (46; 23% instances), nmod (44; 22% instances), appos (22; 11% instances), nsubj (7; 4% instances), parataxis (6; 3% instances), dep (4; 2% instances), list (4; 2% instances), compound (3; 2% instances), fixed (3; 2% instances), obl (3; 2% instances), advmod:emph (2; 1% instances), flat:foreign (2; 1% instances), obj (2; 1% instances), amod (1; 1% instances), dep:alt (1; 1% instances), root (1; 1% instances)

Parents of X nodes belong to 7 different parts of speech: X (94; 47% instances), NOUN (75; 38% instances), VERB (15; 8% instances), PROPN (9; 5% instances), ADV (3; 2% instances), ADJ (2; 1% instances), (1; 1% instances)

74 (37%) X nodes are leaves.

61 (31%) X nodes have one child.

26 (13%) X nodes have two children.

38 (19%) X nodes have three or more children.

The highest child degree of a X node is 15.

Children of X nodes are attached using 14 different relations: punct (122; 42% instances), flat (46; 16% instances), conj (42; 14% instances), appos (23; 8% instances), advmod (13; 4% instances), cc (13; 4% instances), case (8; 3% instances), amod (5; 2% instances), advmod:emph (4; 1% instances), nummod (4; 1% instances), fixed (3; 1% instances), list (3; 1% instances), nmod (3; 1% instances), dep (1; 0% instances)

Children of X nodes belong to 11 different parts of speech: PUNCT (122; 42% instances), X (94; 32% instances), ADV (16; 6% instances), CCONJ (13; 4% instances), NOUN (13; 4% instances), ADP (11; 4% instances), ADJ (7; 2% instances), NUM (5; 2% instances), PROPN (5; 2% instances), VERB (3; 1% instances), SCONJ (1; 0% instances)