home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Urdu-UDTB: POS Tags: PUNCT

There are 17 PUNCT lemmas (0%), 17 PUNCT types (0%) and 6911 PUNCT tokens (5%). Out of 16 observed tags, the rank of PUNCT is: 13 in number of lemmas, 13 in number of types and 7 in number of tokens.

The 10 most frequent PUNCT lemmas: ۔، ,، ،، -، ‘‘، )، (، “، ‘، ؟

The 10 most frequent PUNCT types: ۔، ,، ،، -، ‘‘، )، (، “، ‘، ؟

The 10 most frequent ambiguous lemmas: ۔ (PUNCT 5004, SCONJ 1), , (PUNCT 656, SCONJ 27), ، (PUNCT 420, SCONJ 3, PROPN 2), - (PUNCT 217, PROPN 3, CCONJ 1, NOUN 1), ‘’ (PUNCT 181, NUM 1), ) (PUNCT 149, PROPN 1), ( (PUNCT 147, PROPN 1), ! (PUNCT 16, NOUN 6), / (PUNCT 2, PROPN 1), ? (PROPN 1, PUNCT 1)

The 10 most frequent ambiguous types: ۔ (PUNCT 5005, AUX 2), , (PUNCT 656, SCONJ 27), ، (PUNCT 420, SCONJ 8, PROPN 3), - (PUNCT 216, PROPN 3, NOUN 1), ‘’ (PUNCT 181, NUM 1), ) (PUNCT 149, PROPN 1), ( (PUNCT 147, PROPN 1), ؟ (PUNCT 22, PROPN 1, VERB 1), ! (PUNCT 16, NOUN 1), / (PUNCT 2, PROPN 1)

Morphology

The form / lemma ratio of PUNCT is 1.000000 (the average of all parts of speech is 1.103404).

The 1st highest number of forms (2) was observed with the lemma “-”: -, ۔.

The 2nd highest number of forms (1) was observed with the lemma “!”: !.

The 3rd highest number of forms (1) was observed with the lemma “””: “.

PUNCT occurs with 1 features: Voice (1; 0% instances)

PUNCT occurs with 1 feature-value pairs: Voice=Act

PUNCT occurs with 2 feature combinations. The most frequent feature combination is _ (6910 tokens). Examples: ۔، ,، ،، -، ‘‘، )، (، “، ‘، ؟

Relations

PUNCT nodes are attached to their parents using 1 different relations: punct (6911; 100% instances)

Parents of PUNCT nodes belong to 14 different parts of speech: VERB (5208; 75% instances), PROPN (741; 11% instances), NOUN (632; 9% instances), ADJ (186; 3% instances), NUM (65; 1% instances), PRON (33; 0% instances), AUX (13; 0% instances), PART (13; 0% instances), PUNCT (6; 0% instances), ADV (4; 0% instances), DET (4; 0% instances), X (3; 0% instances), ADP (2; 0% instances), INTJ (1; 0% instances)

6905 (100%) PUNCT nodes are leaves.

6 (0%) PUNCT nodes have one child.

The highest child degree of a PUNCT node is 1.

Children of PUNCT nodes are attached using 1 different relations: punct (6; 100% instances)

Children of PUNCT nodes belong to 1 different parts of speech: PUNCT (6; 100% instances)