Treebank Statistics: UD_German: POS Tags: PUNCT
There are 39 PUNCT
lemmas (0%), 39 PUNCT
types (0%) and 38292 PUNCT
tokens (13%).
Out of 15 observed tags, the rank of PUNCT
is: 13 in number of lemmas, 14 in number of types and 2 in number of tokens.
The 10 most frequent PUNCT
lemmas: ., ,, -, “, ), (, –, !, :, ‘’
The 10 most frequent PUNCT
types: ., ,, -, “, ), (, –, !, :, ‘’
The 10 most frequent ambiguous lemmas: . (PUNCT 14751, NOUN 1), ” (PUNCT 1894, NOUN 3), ) (PUNCT 1858, X 1), ( (PUNCT 1847, X 1), : (PUNCT 442, X 1), / (PUNCT 263, ADP 5, X 5, PROPN 2), ? (PUNCT 103, PROPN 3), ’ (PUNCT 68, NOUN 6), .. (PUNCT 15, PROPN 1), = (X 16, PUNCT 10)
The 10 most frequent ambiguous types: . (PUNCT 14751, NOUN 1), ” (PUNCT 1894, NOUN 3), ) (PUNCT 1858, X 1), ( (PUNCT 1847, X 1), : (PUNCT 442, X 1), / (PUNCT 263, ADP 5, X 5, PROPN 2), ? (PUNCT 103, PROPN 3), ’ (PUNCT 68, NOUN 6), .. (PUNCT 15, PROPN 1), = (X 16, PUNCT 10)
- .
- ”
- )
- (
- :
- /
- ?
- ’
- ..
- =
Morphology
The form / lemma ratio of PUNCT
is 1.000000 (the average of all parts of speech is 1.186689).
The 1st highest number of forms (1) was observed with the lemma “!”: !.
The 2nd highest number of forms (1) was observed with the lemma “””: ”.
The 3rd highest number of forms (1) was observed with the lemma “’”: ’.
PUNCT
does not occur with any features.
Relations
PUNCT
nodes are attached to their parents using 7 different relations: punct (38251; 100% instances), case (22; 0% instances), dep (14; 0% instances), appos (2; 0% instances), compound (1; 0% instances), conj (1; 0% instances), cop (1; 0% instances)
Parents of PUNCT
nodes belong to 15 different parts of speech: VERB (17292; 45% instances), NOUN (9325; 24% instances), PROPN (7328; 19% instances), ADJ (2990; 8% instances), NUM (803; 2% instances), PRON (185; 0% instances), ADV (125; 0% instances), X (125; 0% instances), ADP (45; 0% instances), CCONJ (20; 0% instances), AUX (18; 0% instances), PART (17; 0% instances), PUNCT (13; 0% instances), DET (4; 0% instances), SCONJ (2; 0% instances)
38279 (100%) PUNCT
nodes are leaves.
6 (0%) PUNCT
nodes have one child.
6 (0%) PUNCT
nodes have two children.
1 (0%) PUNCT
nodes have three or more children.
The highest child degree of a PUNCT
node is 4.
Children of PUNCT
nodes are attached using 6 different relations: punct (13; 59% instances), nmod (5; 23% instances), conj (1; 5% instances), mark (1; 5% instances), nsubj (1; 5% instances), nummod (1; 5% instances)
Children of PUNCT
nodes belong to 6 different parts of speech: PUNCT (13; 59% instances), NOUN (3; 14% instances), PRON (2; 9% instances), PROPN (2; 9% instances), NUM (1; 5% instances), SCONJ (1; 5% instances)