home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Bavarian-MaiBaam: POS Tags: PUNCT

There are 1 PUNCT lemmas (6%), 20 PUNCT types (0%) and 2105 PUNCT tokens (14%). Out of 17 observed tags, the rank of PUNCT is: 13 in number of lemmas, 14 in number of types and 2 in number of tokens.

The 10 most frequent PUNCT lemmas: _

The 10 most frequent PUNCT types: ., ,, “, ?, (, ), :, !, „, “

The 10 most frequent ambiguous lemmas: _ (NOUN 2272, PUNCT 2105, DET 1959, VERB 1458, ADP 1417, ADV 1203, PRON 1133, AUX 926, ADJ 798, PROPN 545, CCONJ 380, SCONJ 340, NUM 240, PART 160, X 64, INTJ 23, SYM 7)

The 10 most frequent ambiguous types: ? (PUNCT 113, SYM 1), (PUNCT 5, SYM 1), (PUNCT 3, ADP 1)

Morphology

The form / lemma ratio of PUNCT is 20.000000 (the average of all parts of speech is 265.444444).

The 1st highest number of forms (20) was observed with the lemma “_”: !, “, (, ), ,, -, ., …, /, :, ;, ?, [, ], «, », –, “, ”, „.

PUNCT does not occur with any features.

Relations

PUNCT nodes are attached to their parents using 1 different relations: punct (2105; 100% instances)

Parents of PUNCT nodes belong to 16 different parts of speech: VERB (1211; 58% instances), NOUN (404; 19% instances), ADJ (162; 8% instances), PROPN (122; 6% instances), ADV (77; 4% instances), X (52; 2% instances), INTJ (15; 1% instances), DET (13; 1% instances), ADP (11; 1% instances), PRON (9; 0% instances), AUX (7; 0% instances), NUM (7; 0% instances), CCONJ (5; 0% instances), SYM (5; 0% instances), PART (4; 0% instances), SCONJ (1; 0% instances)

2105 (100%) PUNCT nodes are leaves.

The highest child degree of a PUNCT node is 0.