home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_German-PUD: POS Tags: PUNCT

There are 1 PUNCT lemmas (5%), 14 PUNCT types (0%) and 2767 PUNCT tokens (13%). Out of 16 observed tags, the rank of PUNCT is: 12 in number of lemmas, 14 in number of types and 3 in number of tokens.

The 10 most frequent PUNCT lemmas: _

The 10 most frequent PUNCT types: ,, ., -, „, “, (, ), :, –, ;

The 10 most frequent ambiguous lemmas: _ (NOUN 4261, PUNCT 2767, DET 2515, VERB 1913, ADP 1715, ADJ 1387, PROPN 1219, PRON 1185, ADV 1139, AUX 950, CCONJ 743, NUM 352, SCONJ 326, PART 144, X 31, SYM 22)

The 10 most frequent ambiguous types: - (PUNCT 178, NUM 2, CCONJ 1)

Morphology

The form / lemma ratio of PUNCT is 14.000000 (the average of all parts of speech is 307.454545).

The 1st highest number of forms (14) was observed with the lemma “_”: ’, (, ), ,, -, ., /, :, ;, ?, –, “, „, ….

PUNCT does not occur with any features.

Relations

PUNCT nodes are attached to their parents using 1 different relations: punct (2767; 100% instances)

Parents of PUNCT nodes belong to 14 different parts of speech: VERB (1675; 61% instances), NOUN (586; 21% instances), ADJ (220; 8% instances), PROPN (177; 6% instances), NUM (26; 1% instances), X (24; 1% instances), ADV (19; 1% instances), AUX (13; 0% instances), DET (8; 0% instances), PRON (8; 0% instances), ADP (5; 0% instances), CCONJ (2; 0% instances), SCONJ (2; 0% instances), SYM (2; 0% instances)

2767 (100%) PUNCT nodes are leaves.

The highest child degree of a PUNCT node is 0.