home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hausa-SouthernAutogramm: POS Tags: PROPN

There are 112 PROPN lemmas (8%), 122 PROPN types (7%) and 301 PROPN tokens (2%). Out of 16 observed tags, the rank of PROPN is: 3 in number of lemmas, 3 in number of types and 12 in number of tokens.

The 10 most frequent PROPN lemmas: Allàː, Fulàːniː, Basaːwaː, Basaːwa, Bàːsa, Ùngwan, Dattì, Mângajàː, Gwànjoː, Maːlàmiː

The 10 most frequent PROPN types: Allàː, Basaːwaː, Fulàːniː, Basaːwa, Ùngwan, Bàːsa, Dattì, Gwànjoː, Mângajàː, Bàːsân

The 10 most frequent ambiguous lemmas: Allàː (PROPN 34, X 1), Ùngwan (PROPN 9, X 1), Riːmiː (PROPN 1, X 1)

The 10 most frequent ambiguous types: Allàː (PROPN 33, X 1), Ùngwan (PROPN 11, X 1), Muːsaː (PROPN 3, NOUN 1), Riːmiː (PROPN 1, X 1), Ɗan (NOUN 1, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.089286 (the average of all parts of speech is 1.303635).

The 1st highest number of forms (3) was observed with the lemma “Fulàːniː”: Filàːniː, Fulàːniː, Fulàːnîn.

The 2nd highest number of forms (2) was observed with the lemma “Allàː”: Allàn, Allàː.

The 3rd highest number of forms (2) was observed with the lemma “Bàtuːr̃èː”: Bàtuːr̃èn, Bàtuːr̃èː.

PROPN occurs with 4 features: Definite (29; 10% instances), Number (11; 4% instances), ExtPos (2; 1% instances), Foreign (2; 1% instances)

PROPN occurs with 5 feature-value pairs: Definite=Cons, Definite=Def, ExtPos=NOUN, Foreign=Yes, Number=Plur

PROPN occurs with 8 feature combinations. The most frequent feature combination is _ (261 tokens). Examples: Allàː, Basaːwa, Basaːwaː, Bàːsa, Dattì, Fulàːniː, Gwànjoː, Mângajàː, Maːlàmiː, Mài

Relations

PROPN nodes are attached to their parents using 19 different relations: flat:name (45; 15% instances), nmod (39; 13% instances), nsubj (35; 12% instances), root (34; 11% instances), obl:arg (26; 9% instances), appos (24; 8% instances), compound (23; 8% instances), dislocated (17; 6% instances), xcomp (14; 5% instances), obj (10; 3% instances), conj (9; 3% instances), discourse (7; 2% instances), obl (5; 2% instances), vocative (5; 2% instances), acl:relcl (2; 1% instances), dep (2; 1% instances), reparandum (2; 1% instances), ccomp (1; 0% instances), parataxis (1; 0% instances)

Parents of PROPN nodes belong to 10 different parts of speech: VERB (79; 26% instances), NOUN (76; 25% instances), PROPN (62; 21% instances), (34; 11% instances), PART (23; 8% instances), PRON (11; 4% instances), ADV (7; 2% instances), AUX (7; 2% instances), INTJ (1; 0% instances), X (1; 0% instances)

147 (49%) PROPN nodes are leaves.

68 (23%) PROPN nodes have one child.

58 (19%) PROPN nodes have two children.

28 (9%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 5.

Children of PROPN nodes are attached using 20 different relations: punct (70; 25% instances), case (54; 19% instances), flat:name (43; 15% instances), det (21; 7% instances), discourse (18; 6% instances), advmod (12; 4% instances), nmod (11; 4% instances), appos (9; 3% instances), conj (8; 3% instances), nsubj (8; 3% instances), dislocated (7; 2% instances), acl:relcl (5; 2% instances), cc (5; 2% instances), reparandum (4; 1% instances), dep (3; 1% instances), advcl:cleft (2; 1% instances), cop (2; 1% instances), amod (1; 0% instances), cc:preconj (1; 0% instances), compound (1; 0% instances)

Children of PROPN nodes belong to 15 different parts of speech: PUNCT (70; 25% instances), PROPN (62; 22% instances), PART (41; 14% instances), ADP (34; 12% instances), DET (21; 7% instances), NOUN (19; 7% instances), CCONJ (7; 2% instances), PRON (7; 2% instances), INTJ (6; 2% instances), VERB (6; 2% instances), ADV (4; 1% instances), ADJ (2; 1% instances), AUX (2; 1% instances), SCONJ (2; 1% instances), X (2; 1% instances)