home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_German-LIT: POS Tags: X

There are 40 X lemmas (1%), 42 X types (1%) and 57 X tokens (0%). Out of 15 observed tags, the rank of X is: 7 in number of lemmas, 10 in number of types and 14 in number of tokens.

The 10 most frequent X lemmas: –, usw, z.b., –, 3), d, d.h., «, 1), 10)

The 10 most frequent X types: –, usw, 3), d, d.h., z.B., «, 1), 10), 2)

The 10 most frequent ambiguous lemmas: (X 9, PUNCT 8), z.b. (X 2, ADV 1), (ADP 12, PROPN 2, X 2, NOUN 1), « (PUNCT 36, NUM 2, X 2, ADP 1, PROPN 1), 1) (ADP 1, X 1), 2) (ADP 1, X 1), Jahr (NOUN 2, X 1), Mittel (NOUN 18, X 1), Recht (NOUN 14, ADV 2, X 1), Sinn (NOUN 91, PROPN 1, X 1)

The 10 most frequent ambiguous types: (X 9, PUNCT 8), z.B. (X 2, ADV 1, CCONJ 1), « (PUNCT 36, NUM 2, X 2, ADP 1, PROPN 1), 1) (ADP 1, X 1), 2) (ADP 1, X 1), Ehe (NOUN 6, X 1), Mittel (NOUN 16, X 1), Sinn (NOUN 65, PROPN 1, X 1), andre (DET 34, NOUN 1, X 1), daran (ADV 5, X 1)

Morphology

The form / lemma ratio of X is 1.050000 (the average of all parts of speech is 1.310429).

The 1st highest number of forms (2) was observed with the lemma “–”: en, h.

The 2nd highest number of forms (1) was observed with the lemma “1)”: 1).

The 3rd highest number of forms (1) was observed with the lemma “10)”: 10).

X does not occur with any features.

Relations

X nodes are attached to their parents using 14 different relations: dep (24; 42% instances), case (7; 12% instances), flat (4; 7% instances), cc (3; 5% instances), conj (3; 5% instances), nmod (3; 5% instances), nsubj (3; 5% instances), amod (2; 4% instances), obj (2; 4% instances), root (2; 4% instances), acl (1; 2% instances), appos (1; 2% instances), orphan (1; 2% instances), xcomp (1; 2% instances)

Parents of X nodes belong to 8 different parts of speech: VERB (24; 42% instances), NOUN (20; 35% instances), X (4; 7% instances), ADJ (2; 4% instances), ADV (2; 4% instances), AUX (2; 4% instances), (2; 4% instances), PRON (1; 2% instances)

44 (77%) X nodes are leaves.

5 (9%) X nodes have one child.

4 (7%) X nodes have two children.

4 (7%) X nodes have three or more children.

The highest child degree of a X node is 7.

Children of X nodes are attached using 12 different relations: punct (6; 20% instances), advmod (5; 17% instances), det (3; 10% instances), flat (3; 10% instances), cc (2; 7% instances), compound (2; 7% instances), nmod (2; 7% instances), nsubj (2; 7% instances), obl (2; 7% instances), aux (1; 3% instances), conj (1; 3% instances), cop (1; 3% instances)

Children of X nodes belong to 7 different parts of speech: NOUN (7; 23% instances), PUNCT (6; 20% instances), ADV (4; 13% instances), DET (4; 13% instances), X (4; 13% instances), CCONJ (3; 10% instances), AUX (2; 7% instances)