home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Beja-NSC: POS Tags: PUNCT

There are 1 PUNCT lemmas (6%), 6 PUNCT types (1%) and 241 PUNCT tokens (20%). Out of 16 observed tags, the rank of PUNCT is: 13 in number of lemmas, 12 in number of types and 2 in number of tokens.

The 10 most frequent PUNCT lemmas: _

The 10 most frequent PUNCT types: /, //, ?, ##, <, >

The 10 most frequent ambiguous lemmas: _ (VERB 242, PUNCT 241, DET 176, NOUN 168, PRON 106, SCONJ 68, ADP 39, AUX 38, CCONJ 34, PART 28, ADV 18, ADJ 17, NUM 12, INTJ 8, X 7, PROPN 4)

The 10 most frequent ambiguous types: / (PUNCT 152, X 2)

Morphology

The form / lemma ratio of PUNCT is 6.000000 (the average of all parts of speech is 26.312500).

The 1st highest number of forms (6) was observed with the lemma “_”: ##, /, //, <, >, ?.

PUNCT does not occur with any features.

Relations

PUNCT nodes are attached to their parents using 1 different relations: punct (241; 100% instances)

Parents of PUNCT nodes belong to 14 different parts of speech: VERB (132; 55% instances), NOUN (58; 24% instances), PART (10; 4% instances), PRON (9; 4% instances), SCONJ (8; 3% instances), ADP (6; 2% instances), ADJ (4; 2% instances), ADV (4; 2% instances), INTJ (3; 1% instances), AUX (2; 1% instances), DET (2; 1% instances), NUM (1; 0% instances), PROPN (1; 0% instances), X (1; 0% instances)

241 (100%) PUNCT nodes are leaves.

The highest child degree of a PUNCT node is 0.