Treebank Statistics: UD_German-HDT: POS Tags: PUNCT
There are 17 PUNCT
lemmas (0%), 17 PUNCT
types (0%) and 396270 PUNCT
tokens (11%).
Out of 16 observed tags, the rank of PUNCT
is: 13 in number of lemmas, 14 in number of types and 3 in number of tokens.
The 10 most frequent PUNCT
lemmas: ., ,, “, :, (, ), -, ;, ?, !
The 10 most frequent PUNCT
types: ., ,, “, :, (, ), -, ;, ?, !
The 10 most frequent ambiguous lemmas: . (PUNCT 156600, X 1), ” (PUNCT 45969, X 2), - (PUNCT 7124, VERB 689), ? (PUNCT 1286, X 1), ! (PUNCT 278, X 1)
The 10 most frequent ambiguous types: . (PUNCT 156600, X 1), ” (PUNCT 45969, X 2), ? (PUNCT 1286, X 1), ! (PUNCT 278, X 1)
- .
- ”
- ?
- !
Morphology
The form / lemma ratio of PUNCT
is 1.000000 (the average of all parts of speech is 2.529726).
The 1st highest number of forms (1) was observed with the lemma “!”: !.
The 2nd highest number of forms (1) was observed with the lemma “!!”: !!.
The 3rd highest number of forms (1) was observed with the lemma “!!!”: !!!.
PUNCT
occurs with 2 features: PunctType (396256; 100% instances), Foreign (3; 0% instances)
PUNCT
occurs with 4 feature-value pairs: Foreign=Yes
, PunctType=Brck
, PunctType=Comm
, PunctType=Peri
PUNCT
occurs with 5 feature combinations.
The most frequent feature combination is PunctType=Peri
(177075 tokens).
Examples: ., :, ;, ?, !, …, !!!, !!, !!!!, ..
Relations
PUNCT
nodes are attached to their parents using 1 different relations: punct (396270; 100% instances)
Parents of PUNCT
nodes belong to 15 different parts of speech: VERB (257877; 65% instances), NOUN (61889; 16% instances), PROPN (27458; 7% instances), ADJ (20668; 5% instances), AUX (10810; 3% instances), X (10555; 3% instances), ADV (2612; 1% instances), DET (2100; 1% instances), NUM (1446; 0% instances), PRON (603; 0% instances), PART (102; 0% instances), INTJ (65; 0% instances), ADP (46; 0% instances), CCONJ (29; 0% instances), SCONJ (10; 0% instances)
396270 (100%) PUNCT
nodes are leaves.
The highest child degree of a PUNCT
node is 0.