home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Uyghur: POS Tags: PUNCT

There are 1 PUNCT lemmas (7%), 15 PUNCT types (0%) and 2934 PUNCT tokens (19%). Out of 14 observed tags, the rank of PUNCT is: 12 in number of lemmas, 13 in number of types and 3 in number of tokens.

The 10 most frequent PUNCT lemmas: _

The 10 most frequent PUNCT types: .، ،، -، _، !، ؟، :، «، »، “

The 10 most frequent ambiguous lemmas: _ (NOUN 4872, VERB 3255, PUNCT 2934, PRON 1262, ADJ 1123, AUX 481, NUM 449, ADV 405, CCONJ 305, ADP 200, PART 123, DET 54, INTJ 41, X 5)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of PUNCT is 15.000000 (the average of all parts of speech is 429.142857).

The 1st highest number of forms (15) was observed with the lemma “_”: !, “, (, ), -, ., :, _, «, », ،, ؛, ؟, –, !.

PUNCT does not occur with any features.

Relations

PUNCT nodes are attached to their parents using 1 different relations: punct (2934; 100% instances)

Parents of PUNCT nodes belong to 12 different parts of speech: VERB (1589; 54% instances), NOUN (465; 16% instances), AUX (406; 14% instances), ADJ (173; 6% instances), PART (73; 2% instances), PRON (61; 2% instances), CCONJ (54; 2% instances), INTJ (35; 1% instances), ADV (30; 1% instances), NUM (30; 1% instances), ADP (16; 1% instances), DET (2; 0% instances)

2934 (100%) PUNCT nodes are leaves.

The highest child degree of a PUNCT node is 0.