home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hausa-SouthernAutogramm: POS Tags: PROPN

There are 112 PROPN lemmas (8%), 122 PROPN types (6%) and 304 PROPN tokens (2%). Out of 16 observed tags, the rank of PROPN is: 3 in number of lemmas, 3 in number of types and 12 in number of tokens.

The 10 most frequent PROPN lemmas: Allàː, Fulàːniː, Basaːwaː, Basaːwa, Bàːsa, Ùngwan, Dattì, Mângajàː, Gwànjoː, Maːlàmiː

The 10 most frequent PROPN types: Allàː, Basaːwaː, Fulàːniː, Basaːwa, Ùngwan, Bàːsa, Dattì, Gwànjoː, Mângajàː, Bàːsân

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: Muːsaː (PROPN 3, NOUN 1), Ɗan (NOUN 1, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.089286 (the average of all parts of speech is 1.357040).

The 1st highest number of forms (3) was observed with the lemma “Fulàːniː”: Filàːniː, Fulàːniː, Fulàːnîn.

The 2nd highest number of forms (2) was observed with the lemma “Allàː”: Allàn, Allàː.

The 3rd highest number of forms (2) was observed with the lemma “Bàtuːr̃èː”: Bàtuːr̃èn, Bàtuːr̃èː.

PROPN occurs with 4 features: Definite (30; 10% instances), Number (11; 4% instances), ExtPos (2; 1% instances), Foreign (2; 1% instances)

PROPN occurs with 5 feature-value pairs: Definite=Cons, Definite=Def, ExtPos=NOUN, Foreign=Yes, Number=Plur

PROPN occurs with 8 feature combinations. The most frequent feature combination is _ (263 tokens). Examples: Allàː, Basaːwa, Basaːwaː, Bàːsa, Dattì, Fulàːniː, Gwànjoː, Mângajàː, Maːlàmiː, Mài

Relations

PROPN nodes are attached to their parents using 19 different relations: flat:name (46; 15% instances), root (43; 14% instances), nmod (38; 13% instances), nsubj (37; 12% instances), appos (23; 8% instances), compound (23; 8% instances), dislocated (19; 6% instances), obl:arg (17; 6% instances), obl (15; 5% instances), obj (12; 4% instances), conj (9; 3% instances), xcomp (7; 2% instances), vocative (5; 2% instances), acl:relcl (2; 1% instances), dep (2; 1% instances), discourse (2; 1% instances), reparandum (2; 1% instances), ccomp (1; 0% instances), parataxis (1; 0% instances)

Parents of PROPN nodes belong to 11 different parts of speech: VERB (81; 27% instances), NOUN (77; 25% instances), PROPN (63; 21% instances), (43; 14% instances), PRON (15; 5% instances), PART (10; 3% instances), ADV (8; 3% instances), AUX (4; 1% instances), ADJ (1; 0% instances), INTJ (1; 0% instances), X (1; 0% instances)

141 (46%) PROPN nodes are leaves.

67 (22%) PROPN nodes have one child.

61 (20%) PROPN nodes have two children.

35 (12%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 6.

Children of PROPN nodes are attached using 21 different relations: punct (80; 25% instances), case (54; 17% instances), flat:name (44; 14% instances), discourse (26; 8% instances), det (21; 7% instances), dislocated (11; 3% instances), conj (10; 3% instances), nmod (10; 3% instances), advmod (9; 3% instances), appos (9; 3% instances), cop (9; 3% instances), cc (7; 2% instances), nsubj (7; 2% instances), advcl:cleft (6; 2% instances), acl:relcl (5; 2% instances), dep (4; 1% instances), reparandum (4; 1% instances), cc:preconj (2; 1% instances), compound (2; 1% instances), flat:foreign (2; 1% instances), amod (1; 0% instances)

Children of PROPN nodes belong to 15 different parts of speech: PUNCT (80; 25% instances), PROPN (63; 20% instances), ADP (57; 18% instances), NOUN (22; 7% instances), DET (21; 7% instances), PART (19; 6% instances), PRON (13; 4% instances), CCONJ (11; 3% instances), AUX (9; 3% instances), INTJ (7; 2% instances), ADV (6; 2% instances), VERB (6; 2% instances), X (5; 2% instances), ADJ (2; 1% instances), SCONJ (2; 1% instances)