home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-ESL: POS Tags: PUNCT

There are 1 PUNCT lemmas (6%), 1 PUNCT types (6%) and 8624 PUNCT tokens (10%). Out of 17 observed tags, the rank of PUNCT is: 13 in number of lemmas, 13 in number of types and 5 in number of tokens.

The 10 most frequent PUNCT lemmas: _

The 10 most frequent PUNCT types: _

The 10 most frequent ambiguous lemmas: _ (NOUN 14135, VERB 13583, PRON 9575, DET 9068, PUNCT 8624, ADP 7769, ADJ 5278, ADV 5121, AUX 4111, PART 3169, CONJ 2865, SCONJ 2278, PROPN 1574, NUM 776, INTJ 67, X 60, SYM 37)

The 10 most frequent ambiguous types: _ (NOUN 14135, VERB 13583, PRON 9575, DET 9068, PUNCT 8624, ADP 7769, ADJ 5278, ADV 5121, AUX 4111, PART 3169, CONJ 2865, SCONJ 2278, PROPN 1574, NUM 776, INTJ 67, X 60, SYM 37)

Morphology

The form / lemma ratio of PUNCT is 1.000000 (the average of all parts of speech is 1.000000).

The 1st highest number of forms (1) was observed with the lemma “_”: _.

PUNCT does not occur with any features.

Relations

PUNCT nodes are attached to their parents using 1 different relations: punct (8624; 100% instances)

Parents of PUNCT nodes belong to 15 different parts of speech: VERB (5604; 65% instances), NOUN (1432; 17% instances), ADJ (1153; 13% instances), PROPN (220; 3% instances), ADV (74; 1% instances), NUM (51; 1% instances), PRON (41; 0% instances), SYM (10; 0% instances), X (10; 0% instances), DET (9; 0% instances), AUX (8; 0% instances), INTJ (7; 0% instances), ADP (2; 0% instances), CONJ (2; 0% instances), PART (1; 0% instances)

8623 (100%) PUNCT nodes are leaves.

1 (0%) PUNCT nodes have one child.

The highest child degree of a PUNCT node is 1.

Children of PUNCT nodes are attached using 1 different relations: nummod (1; 100% instances)

Children of PUNCT nodes belong to 1 different parts of speech: NUM (1; 100% instances)