home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Pashto-Sikaram: POS Tags: PROPN

There are 68 PROPN lemmas (6%), 69 PROPN types (5%) and 217 PROPN tokens (4%). Out of 16 observed tags, the rank of PROPN is: 4 in number of lemmas, 4 in number of types and 9 in number of tokens.

The 10 most frequent PROPN lemmas: پښتو, پښتون, اردو, افغانستان, پاړسي, کابل, احمد, بابا, وحید, پیتر

The 10 most frequent PROPN types: پښتو, پښتانه, اردو, افغانستان, پاړسي, پښتنو, کابل, احمد, بابا, وحید

The 10 most frequent ambiguous lemmas: پښتو (PROPN 67, ADJ 2), پاړسي (PROPN 7, ADJ 1), کابل (PROPN 7, NOUN 1), _ (NOUN 21, ADJ 14, VERB 9, X 8, PROPN 3, ADP 2, NUM 2, PART 1, PRON 1, SYM 1), دري (PROPN 3, ADJ 1), کتاب (NOUN 25, PROPN 3), افغان (PROPN 2, ADJ 1, X 1), کوټه (PROPN 2, NOUN 1), انګرېزي (ADJ 1, PROPN 1), ايرانی (ADJ 3, PROPN 1)

The 10 most frequent ambiguous types: پښتو (PROPN 67, ADJ 2), پاړسي (PROPN 7, ADJ 1), کابل (PROPN 7, NOUN 1), دري (PROPN 3, ADJ 2), کتاب (NOUN 4, PROPN 3), افغان (PROPN 2, X 1), انګرېزي (ADJ 2, PROPN 1), روسي (ADJ 1, PROPN 1), قدوري (NOUN 1, PROPN 1), هرات (NOUN 1, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.014706 (the average of all parts of speech is 1.318390).

The 1st highest number of forms (3) was observed with the lemma “_”: خوشحالخان, عربي, پښتونخوا.

The 2nd highest number of forms (2) was observed with the lemma “پښتون”: پښتانه, پښتنو.

The 3rd highest number of forms (1) was observed with the lemma “آيینه”: ايینې.

PROPN occurs with 3 features: Case (217; 100% instances), Gender (217; 100% instances), Number (217; 100% instances)

PROPN occurs with 10 feature-value pairs: Case=Abl, Case=Acc, Case=Loc, Case=Nom, Case=Voc, Gender=Fem, Gender=Masc, Number=Coll, Number=Plur, Number=Sing

PROPN occurs with 11 feature combinations. The most frequent feature combination is Case=Nom|Gender=Fem|Number=Sing (40 tokens). Examples: پښتو, عربي, دري, پاړسي, اردو, انګرېزي, براون, خلاصه, روسي, قدوري

Relations

PROPN nodes are attached to their parents using 12 different relations: nmod (74; 34% instances), obl (44; 20% instances), nsubj (29; 13% instances), conj (20; 9% instances), obj (20; 9% instances), flat:name (17; 8% instances), nsubj:pass (4; 2% instances), appos (3; 1% instances), acl:relcl (2; 1% instances), root (2; 1% instances), orphan:objobl (1; 0% instances), vocative (1; 0% instances)

Parents of PROPN nodes belong to 6 different parts of speech: VERB (89; 41% instances), NOUN (87; 40% instances), PROPN (33; 15% instances), ADJ (5; 2% instances), (2; 1% instances), ADV (1; 0% instances)

77 (35%) PROPN nodes are leaves.

84 (39%) PROPN nodes have one child.

38 (18%) PROPN nodes have two children.

18 (8%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 7.

Children of PROPN nodes are attached using 21 different relations: case (121; 53% instances), punct (21; 9% instances), conj (20; 9% instances), flat:name (16; 7% instances), cc (14; 6% instances), advmod (5; 2% instances), cop (5; 2% instances), amod (4; 2% instances), parataxis (4; 2% instances), acl:relcl (3; 1% instances), nsubj (3; 1% instances), appos (2; 1% instances), mark (2; 1% instances), nmod (2; 1% instances), orphan:nsubjobj (2; 1% instances), advcl (1; 0% instances), aux:hab (1; 0% instances), det (1; 0% instances), nummod (1; 0% instances), obl (1; 0% instances), xcomp (1; 0% instances)

Children of PROPN nodes belong to 15 different parts of speech: ADP (121; 53% instances), PROPN (33; 14% instances), PUNCT (21; 9% instances), CCONJ (14; 6% instances), NOUN (9; 4% instances), AUX (6; 3% instances), VERB (6; 3% instances), ADJ (5; 2% instances), NUM (5; 2% instances), PART (3; 1% instances), ADV (2; 1% instances), SCONJ (2; 1% instances), DET (1; 0% instances), PRON (1; 0% instances), SYM (1; 0% instances)