home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Zaar-Autogramm: POS Tags: PROPN

There are 131 PROPN lemmas (8%), 156 PROPN types (6%) and 319 PROPN tokens (2%). Out of 16 observed tags, the rank of PROPN is: 4 in number of lemmas, 4 in number of types and 12 in number of tokens.

The 10 most frequent PROPN lemmas: Zaːr, Tʃokn, Kímsə, Malâːr, Bàːbá, Daːvàrì, Ngas, Tʃôkn, Zəgì, Púʤì

The 10 most frequent PROPN types: Zaːr, Tʃokn, Kímsə, Malâːr, Bàːbá, Ngas, Daːvàrì, Púʤì, Tʃôkn, Gòːdiya

The 10 most frequent ambiguous lemmas: Zaːr (NOUN 29, PROPN 20), Tʃokn (PROPN 16, NOUN 1), Kímsə (PROPN 15, NOUN 2), Daːvàrì (PROPN 8, NOUN 1), Púʤì (PROPN 6, X 1), Súle (PROPN 4, NOUN 1), Kúru (PROPN 2, NOUN 1), Lìːmƙása (PROPN 2, X 1), Gàmbár (NOUN 1, PROPN 1), Lim (X 2, PROPN 1)

The 10 most frequent ambiguous types: Zaːr (PROPN 18, NOUN 16), Tʃokn (PROPN 14, NOUN 1), Kímsə (PROPN 12, NOUN 2), Daːvàrì (PROPN 7, NOUN 1), Púʤì (PROPN 6, X 1), Kúru (PROPN 2, NOUN 1), Lìːmƙása (PROPN 2, X 1), Zaːri (PROPN 2, NOUN 1), Gàmbár (NOUN 1, PROPN 1), Lim (X 2, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.190840 (the average of all parts of speech is 1.611418).

The 1st highest number of forms (4) was observed with the lemma “Kímsə”: Kíms, Kímsə, Kímsə́y, Kímsə̂y.

The 2nd highest number of forms (4) was observed with the lemma “Malâːr”: *Maláːri-íː, Maláːri, Maláːrês, Malâːr.

The 3rd highest number of forms (3) was observed with the lemma “Sónde”: Sónde, Sóndey, Sônde.

PROPN occurs with 3 features: Definite (12; 4% instances), Deixis (8; 3% instances), Number (3; 1% instances)

PROPN occurs with 6 feature-value pairs: Definite=Cons, Definite=Def, Definite=Ind, Deixis=Prox, Deixis=Remt, Number=Plur

PROPN occurs with 7 feature combinations. The most frequent feature combination is _ (296 tokens). Examples: Zaːr, Tʃokn, Kímsə, Bàːbá, Malâːr, Ngas, Daːvàrì, Púʤì, Tʃôkn, Gòːdiya

Relations

PROPN nodes are attached to their parents using 19 different relations: compound (54; 17% instances), obj (31; 10% instances), dislocated (27; 8% instances), nsubj (27; 8% instances), nmod (23; 7% instances), obl:arg (22; 7% instances), xcomp (22; 7% instances), root (16; 5% instances), dep (15; 5% instances), nmod:poss (15; 5% instances), obl (15; 5% instances), flat:name (14; 4% instances), conj (10; 3% instances), reparandum (10; 3% instances), vocative (9; 3% instances), appos (4; 1% instances), discourse (2; 1% instances), parataxis (2; 1% instances), advcl (1; 0% instances)

Parents of PROPN nodes belong to 12 different parts of speech: VERB (145; 45% instances), NOUN (99; 31% instances), PROPN (26; 8% instances), (16; 5% instances), PART (15; 5% instances), X (5; 2% instances), PRON (4; 1% instances), INTJ (3; 1% instances), ADP (2; 1% instances), ADV (2; 1% instances), AUX (1; 0% instances), SCONJ (1; 0% instances)

188 (59%) PROPN nodes are leaves.

70 (22%) PROPN nodes have one child.

35 (11%) PROPN nodes have two children.

26 (8%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 6.

Children of PROPN nodes are attached using 19 different relations: case (58; 25% instances), punct (55; 24% instances), det (23; 10% instances), discourse (20; 9% instances), flat:name (15; 6% instances), reparandum (15; 6% instances), conj (12; 5% instances), advmod (11; 5% instances), cc (5; 2% instances), parataxis (4; 2% instances), ccomp (3; 1% instances), acl:relcl (2; 1% instances), appos (2; 1% instances), dep (2; 1% instances), nmod:poss (2; 1% instances), advcl (1; 0% instances), cc:preconj (1; 0% instances), dislocated (1; 0% instances), nmod (1; 0% instances)

Children of PROPN nodes belong to 13 different parts of speech: ADP (66; 28% instances), PUNCT (55; 24% instances), DET (26; 11% instances), PROPN (26; 11% instances), PART (14; 6% instances), INTJ (12; 5% instances), NOUN (10; 4% instances), ADV (6; 3% instances), VERB (6; 3% instances), X (5; 2% instances), CCONJ (3; 1% instances), PRON (2; 1% instances), SCONJ (2; 1% instances)