home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Beja-NSC: POS Tags: PUNCT

There are 1 PUNCT lemmas (6%), 6 PUNCT types (0%) and 1126 PUNCT tokens (19%). Out of 16 observed tags, the rank of PUNCT is: 13 in number of lemmas, 16 in number of types and 1 in number of tokens.

The 10 most frequent PUNCT lemmas: _

The 10 most frequent PUNCT types: /, //, ##, ?, #, {noise}

The 10 most frequent ambiguous lemmas: _ (PUNCT 1126, VERB 1097, DET 933, NOUN 894, ADP 408, PRON 395, SCONJ 298, PART 167, CCONJ 160, AUX 125, ADV 104, ADJ 77, PROPN 32, INTJ 28, NUM 26, X 18)

The 10 most frequent ambiguous types: / (PUNCT 773, X 2)

Morphology

The form / lemma ratio of PUNCT is 6.000000 (the average of all parts of speech is 76.500000).

The 1st highest number of forms (6) was observed with the lemma “_”: #, ##, /, //, ?, {noise}.

PUNCT does not occur with any features.

Relations

PUNCT nodes are attached to their parents using 1 different relations: punct (1126; 100% instances)

Parents of PUNCT nodes belong to 14 different parts of speech: VERB (563; 50% instances), NOUN (213; 19% instances), SCONJ (121; 11% instances), ADP (66; 6% instances), ADV (39; 3% instances), PRON (37; 3% instances), PART (26; 2% instances), INTJ (16; 1% instances), DET (11; 1% instances), ADJ (9; 1% instances), PROPN (9; 1% instances), X (9; 1% instances), NUM (5; 0% instances), AUX (2; 0% instances)

1126 (100%) PUNCT nodes are leaves.

The highest child degree of a PUNCT node is 0.