home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Urdu-UDTB: POS Tags: PUNCT

There are 19 PUNCT lemmas (0%), 19 PUNCT types (0%) and 6913 PUNCT tokens (5%). Out of 16 observed tags, the rank of PUNCT is: 14 in number of lemmas, 13 in number of types and 7 in number of tokens.

The 10 most frequent PUNCT lemmas: ۔، ,، ،، -، ‘‘، )، (، “، ‘، ؟

The 10 most frequent PUNCT types: ۔، ,، ،، -، ‘‘، )، (، “، ‘، ؟

The 10 most frequent ambiguous lemmas: ۔ (PUNCT 5004, AUX 1, SCONJ 1), , (PUNCT 656, SCONJ 27), ، (PUNCT 420, SCONJ 3, PROPN 2), - (PUNCT 217, PROPN 3, CCONJ 1, NOUN 1), ‘’ (PUNCT 181, NUM 1), ) (PUNCT 149, PROPN 1), ( (PUNCT 147, PROPN 1), ؟ (PUNCT 22, VERB 1), ! (PUNCT 16, NOUN 6), / (PUNCT 2, PROPN 1)

The 10 most frequent ambiguous types: ۔ (PUNCT 5005, AUX 2), , (PUNCT 656, SCONJ 27), ، (PUNCT 420, SCONJ 8, PROPN 3), - (PUNCT 216, PROPN 3, NOUN 1), ‘’ (PUNCT 181, NUM 1), ) (PUNCT 149, PROPN 1), ( (PUNCT 147, PROPN 1), ؟ (PUNCT 22, PROPN 1, VERB 1), ! (PUNCT 16, NOUN 1), / (PUNCT 2, PROPN 1)

Morphology

The form / lemma ratio of PUNCT is 1.000000 (the average of all parts of speech is 1.101903).

The 1st highest number of forms (2) was observed with the lemma “-”: -, ۔.

The 2nd highest number of forms (1) was observed with the lemma “!”: !.

The 3rd highest number of forms (1) was observed with the lemma “””: “.

PUNCT occurs with 1 features: Voice (1; 0% instances)

PUNCT occurs with 1 feature-value pairs: Voice=Act

PUNCT occurs with 2 feature combinations. The most frequent feature combination is _ (6912 tokens). Examples: ۔، ,، ،، -، ‘‘، )، (، “، ‘، ؟

Relations

PUNCT nodes are attached to their parents using 15 different relations: punct (6836; 99% instances), compound (14; 0% instances), conj (13; 0% instances), obj (13; 0% instances), advcl (10; 0% instances), acl (6; 0% instances), nmod (5; 0% instances), case (4; 0% instances), nsubj (3; 0% instances), acl:relcl (2; 0% instances), advmod (2; 0% instances), mark (2; 0% instances), amod (1; 0% instances), aux (1; 0% instances), dep (1; 0% instances)

Parents of PUNCT nodes belong to 13 different parts of speech: VERB (5209; 75% instances), PROPN (739; 11% instances), NOUN (634; 9% instances), ADJ (187; 3% instances), NUM (65; 1% instances), PRON (33; 0% instances), AUX (13; 0% instances), PART (13; 0% instances), PUNCT (9; 0% instances), ADV (4; 0% instances), DET (4; 0% instances), ADP (2; 0% instances), X (1; 0% instances)

6901 (100%) PUNCT nodes are leaves.

10 (0%) PUNCT nodes have one child.

0 (0%) PUNCT nodes have two children.

2 (0%) PUNCT nodes have three or more children.

The highest child degree of a PUNCT node is 5.

Children of PUNCT nodes are attached using 8 different relations: punct (9; 47% instances), compound (2; 11% instances), mark (2; 11% instances), nmod (2; 11% instances), advmod (1; 5% instances), iobj (1; 5% instances), nsubj (1; 5% instances), obl (1; 5% instances)

Children of PUNCT nodes belong to 6 different parts of speech: PUNCT (9; 47% instances), NOUN (4; 21% instances), PRON (2; 11% instances), SCONJ (2; 11% instances), ADP (1; 5% instances), PART (1; 5% instances)