home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Urdu-UDTB: POS Tags: PART

There are 103 PART lemmas (1%), 96 PART types (1%) and 3040 PART tokens (2%). Out of 16 observed tags, the rank of PART is: 9 in number of lemmas, 10 in number of types and 10 in number of tokens.

The 10 most frequent PART lemmas: بھی، نہیں، مسٹر، ہی، نہ، صرف، جناب، کہ، تو، بغیر

The 10 most frequent PART types: بھی، نہیں، مسٹر، ہی، نہ، صرف، جناب، کہ، تو، بغیر

The 10 most frequent ambiguous lemmas: بھی (PART 876, ADP 1), مسٹر (PART 341, NOUN 14, PROPN 3), ہی (PART 243, ADP 1), نہ (PART 197, X 1), صرف (PART 118, ADV 5, NOUN 4, ADJ 2, ADP 1), کہ (SCONJ 1961, PART 42, VERB 15, PRON 4, ADP 2, NOUN 1), تو (SCONJ 196, PART 39, PRON 38, PROPN 1), بغیر (PART 37, ADJ 1), و (CCONJ 314, PART 35, PROPN 10, NOUN 4, ADV 1), سے (ADP 2504, ADV 35, PART 31, PROPN 2, ADJ 1)

The 10 most frequent ambiguous types: بھی (PART 877, ADP 1), مسٹر (PART 341, NOUN 14, PROPN 2), ہی (PART 243, ADP 1), نہ (PART 198, X 1), صرف (PART 119, ADV 5, NOUN 4, ADJ 2, ADP 1), کہ (SCONJ 1970, PART 50, PRON 4, ADP 2, NOUN 1), تو (SCONJ 197, PART 39, PRON 24, CCONJ 1, PROPN 1), بغیر (PART 38, ADJ 1), و (CCONJ 318, PART 35, PROPN 10, NOUN 3, ADV 1), سے (ADP 2510, ADV 36, PART 31, PROPN 2, ADJ 1)

Morphology

The form / lemma ratio of PART is 0.932039 (the average of all parts of speech is 1.101903).

The 1st highest number of forms (2) was observed with the lemma “اور”: اور, بغیر.

The 2nd highest number of forms (2) was observed with the lemma “بھلا”: بھلا, بھلے.

The 3rd highest number of forms (2) was observed with the lemma “بھی”: بھی, ہی.

PART occurs with 10 features: Polarity (818; 27% instances), PronType (807; 27% instances), Case (55; 2% instances), Gender (47; 2% instances), Number (46; 2% instances), Person (38; 1% instances), Voice (3; 0% instances), Echo (2; 0% instances), Aspect (1; 0% instances), VerbForm (1; 0% instances)

PART occurs with 14 feature-value pairs: Aspect=Perf, Case=Acc, Case=Nom, Echo=Rdp, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, Person=3, Polarity=Neg, PronType=Ind, PronType=Neg, VerbForm=Part, Voice=Act

PART occurs with 18 feature combinations. The most frequent feature combination is _ (2162 tokens). Examples: بھی، مسٹر، ہی، صرف، جناب، کہ، تو، و، سے، بھر

Relations

PART nodes are attached to their parents using 17 different relations: dep (2164; 71% instances), advmod (760; 25% instances), obl (47; 2% instances), nmod (12; 0% instances), case (11; 0% instances), nsubj (11; 0% instances), dislocated (10; 0% instances), mark (7; 0% instances), obj (4; 0% instances), advcl (3; 0% instances), compound (3; 0% instances), amod (2; 0% instances), conj (2; 0% instances), acl:relcl (1; 0% instances), iobj (1; 0% instances), punct (1; 0% instances), root (1; 0% instances)

Parents of PART nodes belong to 13 different parts of speech: NOUN (1043; 34% instances), VERB (808; 27% instances), PROPN (592; 19% instances), PRON (235; 8% instances), NUM (129; 4% instances), ADJ (114; 4% instances), ADV (51; 2% instances), PART (32; 1% instances), DET (27; 1% instances), ADP (6; 0% instances), PUNCT (1; 0% instances), (1; 0% instances), SCONJ (1; 0% instances)

2987 (98%) PART nodes are leaves.

43 (1%) PART nodes have one child.

7 (0%) PART nodes have two children.

3 (0%) PART nodes have three or more children.

The highest child degree of a PART node is 5.

Children of PART nodes are attached using 14 different relations: dep (28; 41% instances), punct (12; 17% instances), case (6; 9% instances), nmod (6; 9% instances), nsubj (4; 6% instances), mark (3; 4% instances), compound (2; 3% instances), obj (2; 3% instances), acl (1; 1% instances), advcl (1; 1% instances), amod (1; 1% instances), aux (1; 1% instances), cc (1; 1% instances), conj (1; 1% instances)

Children of PART nodes belong to 12 different parts of speech: PART (32; 46% instances), PUNCT (13; 19% instances), NOUN (5; 7% instances), DET (4; 6% instances), ADJ (3; 4% instances), ADP (3; 4% instances), VERB (3; 4% instances), SCONJ (2; 3% instances), ADV (1; 1% instances), CCONJ (1; 1% instances), PRON (1; 1% instances), PROPN (1; 1% instances)