home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hausa-SouthernAutogramm: POS Tags: PROPN

There are 112 PROPN lemmas (8%), 122 PROPN types (6%) and 304 PROPN tokens (2%). Out of 16 observed tags, the rank of PROPN is: 3 in number of lemmas, 3 in number of types and 12 in number of tokens.

The 10 most frequent PROPN lemmas: Allàː, Fulàːniː, Basaːwaː, Basaːwa, Bàːsa, Ùngwan, Dattì, Mângajàː, Gwànjoː, Maːlàmiː

The 10 most frequent PROPN types: Allàː, Basaːwaː, Fulàːniː, Basaːwa, Ùngwan, Bàːsa, Dattì, Gwànjoː, Mângajàː, Bàːsân

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: Muːsaː (PROPN 3, NOUN 1), Ɗan (NOUN 1, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.089286 (the average of all parts of speech is 1.352436).

The 1st highest number of forms (3) was observed with the lemma “Fulàːniː”: Filàːniː, Fulàːniː, Fulàːnîn.

The 2nd highest number of forms (2) was observed with the lemma “Allàː”: Allàn, Allàː.

The 3rd highest number of forms (2) was observed with the lemma “Bàtuːr̃èː”: Bàtuːr̃èn, Bàtuːr̃èː.

PROPN occurs with 3 features: Definite (29; 10% instances), Number (11; 4% instances), Foreign (2; 1% instances)

PROPN occurs with 4 feature-value pairs: Definite=Cons, Definite=Def, Foreign=Yes, Number=Plur

PROPN occurs with 7 feature combinations. The most frequent feature combination is _ (266 tokens). Examples: Allàː, Basaːwaː, Basaːwa, Bàːsa, Dattì, Fulàːniː, Gwànjoː, Mângajàː, Maːlàmiː, Mài

Relations

PROPN nodes are attached to their parents using 21 different relations: flat:name (46; 15% instances), root (43; 14% instances), nsubj (37; 12% instances), nmod (36; 12% instances), appos (23; 8% instances), compound (23; 8% instances), dislocated (19; 6% instances), obl:arg (17; 6% instances), obj (12; 4% instances), conj (9; 3% instances), obl (9; 3% instances), xcomp (7; 2% instances), obl:mod (6; 2% instances), vocative (5; 2% instances), acl:relcl (2; 1% instances), dep (2; 1% instances), discourse (2; 1% instances), nmod:poss (2; 1% instances), reparandum (2; 1% instances), ccomp (1; 0% instances), parataxis (1; 0% instances)

Parents of PROPN nodes belong to 11 different parts of speech: VERB (81; 27% instances), NOUN (76; 25% instances), PROPN (63; 21% instances), (43; 14% instances), PRON (15; 5% instances), PART (11; 4% instances), ADV (8; 3% instances), AUX (4; 1% instances), ADJ (1; 0% instances), INTJ (1; 0% instances), X (1; 0% instances)

141 (46%) PROPN nodes are leaves.

68 (22%) PROPN nodes have one child.

60 (20%) PROPN nodes have two children.

35 (12%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 6.

Children of PROPN nodes are attached using 21 different relations: punct (80; 25% instances), case (54; 17% instances), flat:name (44; 14% instances), discourse (26; 8% instances), det (21; 7% instances), advmod (12; 4% instances), dislocated (11; 3% instances), appos (10; 3% instances), conj (10; 3% instances), nmod (8; 2% instances), cc (7; 2% instances), nsubj (7; 2% instances), advcl:cleft (6; 2% instances), cop (5; 2% instances), acl:relcl (4; 1% instances), aux (4; 1% instances), reparandum (4; 1% instances), dep (3; 1% instances), cc:preconj (2; 1% instances), flat:foreign (2; 1% instances), amod (1; 0% instances)

Children of PROPN nodes belong to 15 different parts of speech: PUNCT (80; 25% instances), PROPN (63; 20% instances), PART (46; 14% instances), ADP (33; 10% instances), DET (21; 7% instances), NOUN (18; 6% instances), PRON (14; 4% instances), CCONJ (11; 3% instances), AUX (9; 3% instances), INTJ (7; 2% instances), ADV (5; 2% instances), VERB (5; 2% instances), X (5; 2% instances), ADJ (2; 1% instances), SCONJ (2; 1% instances)