home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_German-GSD: POS Tags: PUNCT

There are 39 PUNCT lemmas (0%), 39 PUNCT types (0%) and 38497 PUNCT tokens (13%). Out of 17 observed tags, the rank of PUNCT is: 13 in number of lemmas, 14 in number of types and 2 in number of tokens.

The 10 most frequent PUNCT lemmas: ., ,, -, “, ), (, !, :, ``, ‘’

The 10 most frequent PUNCT types: ., ,, -, “, ), (, !, :, ``, ‘’

The 10 most frequent ambiguous lemmas: - (PUNCT 3684, SYM 2), (PUNCT 1886, SYM 2, NOUN 1), ) (PUNCT 1858, X 1), ( (PUNCT 1849, X 1), : (PUNCT 442, X 1), / (PUNCT 263, SYM 6, ADP 2, PROPN 2, X 2), ? (PUNCT 105, PROPN 1), (PUNCT 66, NOUN 6, SYM 1), .. (PUNCT 15, PROPN 1), = (SYM 15, PUNCT 10, X 1)

The 10 most frequent ambiguous types: - (PUNCT 3684, SYM 2, X 1), (PUNCT 1881, SYM 2, NOUN 1), ) (PUNCT 1858, X 1), ( (PUNCT 1849, X 1), : (PUNCT 442, X 1), / (PUNCT 263, SYM 6, ADP 2, PROPN 2, X 2), ? (PUNCT 105, PROPN 1), (PUNCT 66, NOUN 6, SYM 1), .. (PUNCT 15, PROPN 1), = (SYM 15, PUNCT 10, X 1)

Morphology

The form / lemma ratio of PUNCT is 1.000000 (the average of all parts of speech is 1.187208).

The 1st highest number of forms (2) was observed with the lemma “””: ”, ``.

The 2nd highest number of forms (1) was observed with the lemma “!”: !.

The 3rd highest number of forms (1) was observed with the lemma “’”: .

PUNCT occurs with 6 features: Mood (1; 0% instances), Number (1; 0% instances), Person (1; 0% instances), Tense (1; 0% instances), VerbForm (1; 0% instances), Voice (1; 0% instances)

PUNCT occurs with 6 feature-value pairs: Mood=Ind, Number=Sing, Person=3, Tense=Past, VerbForm=Fin, Voice=Pass

PUNCT occurs with 2 feature combinations. The most frequent feature combination is _ (38496 tokens). Examples: ., ,, -, “, ), (, !, :, ``, ‘’

Relations

PUNCT nodes are attached to their parents using 2 different relations: punct (38496; 100% instances), root (1; 0% instances)

Parents of PUNCT nodes belong to 18 different parts of speech: VERB (18587; 48% instances), NOUN (8783; 23% instances), PROPN (6226; 16% instances), ADJ (3006; 8% instances), NUM (885; 2% instances), PRON (348; 1% instances), ADV (240; 1% instances), X (151; 0% instances), DET (130; 0% instances), ADP (40; 0% instances), PART (34; 0% instances), CCONJ (31; 0% instances), AUX (18; 0% instances), SYM (9; 0% instances), INTJ (4; 0% instances), PUNCT (2; 0% instances), SCONJ (2; 0% instances), (1; 0% instances)

38496 (100%) PUNCT nodes are leaves.

0 (0%) PUNCT nodes have one child.

0 (0%) PUNCT nodes have two children.

1 (0%) PUNCT nodes have three or more children.

The highest child degree of a PUNCT node is 3.

Children of PUNCT nodes are attached using 2 different relations: punct (2; 67% instances), appos (1; 33% instances)

Children of PUNCT nodes belong to 2 different parts of speech: PUNCT (2; 67% instances), VERB (1; 33% instances)