home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Pashto-Sikaram: POS Tags: PROPN

There are 19 PROPN lemmas (5%), 19 PROPN types (4%) and 28 PROPN tokens (3%). Out of 14 observed tags, the rank of PROPN is: 5 in number of lemmas, 6 in number of types and 11 in number of tokens.

The 10 most frequent PROPN lemmas: پیتر, اردو, مریم, ټیګور, افغان, امريکا, انګرېزۍ, ایرانی, ایګوازو, براون

The 10 most frequent PROPN types: پیتر, اردو, مریم, ټیګور, افغان, امريکا, انګرېزۍ, ايرانیانو, ایګوازو, براون

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of PROPN is 1.000000 (the average of all parts of speech is 1.198413).

The 1st highest number of forms (1) was observed with the lemma “اردو”: اردو.

The 2nd highest number of forms (1) was observed with the lemma “افغان”: افغان.

The 3rd highest number of forms (1) was observed with the lemma “امريکا”: امريکا.

PROPN occurs with 3 features: Case (28; 100% instances), Gender (28; 100% instances), Number (28; 100% instances)

PROPN occurs with 8 feature-value pairs: Case=Acc, Case=Loc, Case=Nom, Case=Voc, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing

PROPN occurs with 8 feature combinations. The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing (8 tokens). Examples: افغان, ایګوازو, حبیبي, سمیس, طلوع, ټیګور, پیتر, ګیتانجلي

Relations

PROPN nodes are attached to their parents using 8 different relations: nmod (10; 36% instances), nsubj (5; 18% instances), flat (4; 14% instances), conj (3; 11% instances), obl (3; 11% instances), nsubj:pass (1; 4% instances), root (1; 4% instances), vocative (1; 4% instances)

Parents of PROPN nodes belong to 4 different parts of speech: NOUN (13; 46% instances), VERB (8; 29% instances), PROPN (6; 21% instances), (1; 4% instances)

10 (36%) PROPN nodes are leaves.

11 (39%) PROPN nodes have one child.

3 (11%) PROPN nodes have two children.

4 (14%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 4.

Children of PROPN nodes are attached using 8 different relations: case (12; 40% instances), conj (4; 13% instances), cc (3; 10% instances), flat (3; 10% instances), punct (3; 10% instances), advmod (2; 7% instances), orphan:nsubjobj (2; 7% instances), appos (1; 3% instances)

Children of PROPN nodes belong to 7 different parts of speech: ADP (12; 40% instances), PROPN (6; 20% instances), CCONJ (3; 10% instances), NOUN (3; 10% instances), PUNCT (3; 10% instances), PART (2; 7% instances), VERB (1; 3% instances)