home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Ruuli-RDT: POS Tags: PROPN

There are 63 PROPN lemmas (5%), 71 PROPN types (3%) and 96 PROPN tokens (2%). Out of 16 observed tags, the rank of PROPN is: 4 in number of lemmas, 6 in number of types and 14 in number of tokens.

The 10 most frequent PROPN lemmas: Kampala, Nakasongola, Namwera, Bunyala, DISO, Saito, Uganda, munankore, Buduuli, Buganda

The 10 most frequent PROPN types: Kampala, Nakasongola, Namwera, Uganda, Buduuli, Buganda, Bunyala, Cobb, Mal, Museveni

The 10 most frequent ambiguous lemmas: Museveni (PROPN 2, NOUN 1), Muganda (NOUN 1, PROPN 1), kanca (NOUN 5, PROPN 1)

The 10 most frequent ambiguous types: Okanca (NOUN 1, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.126984 (the average of all parts of speech is 2.036596).

The 1st highest number of forms (2) was observed with the lemma “Bunyala”: Bunyala, oBunyala.

The 2nd highest number of forms (2) was observed with the lemma “DISO”: ODISO, oDISO.

The 3rd highest number of forms (2) was observed with the lemma “Kyoga”: OKyoga, oKyoga.

PROPN occurs with 3 features: NounClass (96; 100% instances), Referent (35; 36% instances), Abbr (5; 5% instances)

PROPN occurs with 8 feature-value pairs: Abbr=Yes, NounClass=Bantu1, NounClass=Bantu11, NounClass=Bantu14, NounClass=Bantu2, NounClass=Bantu3, NounClass=Bantu9, Referent=Yes

PROPN occurs with 12 feature combinations. The most frequent feature combination is NounClass=Bantu1 (38 tokens). Examples: Kampala, Nakasongola, Namwera, Cobb, Mal, Museveni, Saito, Uganda, tereka, Amin

Relations

PROPN nodes are attached to their parents using 14 different relations: nsubj (19; 20% instances), root (13; 14% instances), nmod (12; 13% instances), obj (12; 13% instances), obl (12; 13% instances), flat:name (6; 6% instances), nmod:poss (6; 6% instances), vocative (5; 5% instances), appos (4; 4% instances), flat (3; 3% instances), conj (1; 1% instances), discourse (1; 1% instances), dislocated (1; 1% instances), parataxis (1; 1% instances)

Parents of PROPN nodes belong to 6 different parts of speech: VERB (44; 46% instances), NOUN (28; 29% instances), (13; 14% instances), AUX (4; 4% instances), PROPN (4; 4% instances), PRON (3; 3% instances)

38 (40%) PROPN nodes are leaves.

38 (40%) PROPN nodes have one child.

11 (11%) PROPN nodes have two children.

9 (9%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 5.

Children of PROPN nodes are attached using 21 different relations: case (38; 41% instances), punct (13; 14% instances), cop (6; 6% instances), flat:name (5; 5% instances), acl:relcl (4; 4% instances), det (4; 4% instances), nmod:desc (4; 4% instances), nsubj (4; 4% instances), advmod (2; 2% instances), nmod:poss (2; 2% instances), acl (1; 1% instances), advcl (1; 1% instances), advmod:emph (1; 1% instances), amod (1; 1% instances), cc (1; 1% instances), conj (1; 1% instances), csubj (1; 1% instances), discourse (1; 1% instances), flat (1; 1% instances), mark (1; 1% instances), parataxis (1; 1% instances)

Children of PROPN nodes belong to 14 different parts of speech: PART (22; 24% instances), ADP (17; 18% instances), PUNCT (13; 14% instances), NOUN (11; 12% instances), AUX (6; 6% instances), VERB (5; 5% instances), ADV (4; 4% instances), DET (4; 4% instances), PROPN (4; 4% instances), PRON (3; 3% instances), ADJ (1; 1% instances), CCONJ (1; 1% instances), INTJ (1; 1% instances), SCONJ (1; 1% instances)