home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Bororo-BDT: POS Tags: PROPN

There are 18 PROPN lemmas (4%), 28 PROPN types (4%) and 48 PROPN tokens (3%). Out of 16 observed tags, the rank of PROPN is: 7 in number of lemmas, 7 in number of types and 9 in number of tokens.

The 10 most frequent PROPN lemmas: _, João, Maria, Deu, Fabrício, Pedro, Satanae, Adão, Badojeba, Birimodo

The 10 most frequent PROPN types: Fabrício, João, Maria, Deu, Satanae, Birimodo, Cereu, Kogae, Pedro, Adão

The 10 most frequent ambiguous lemmas: _ (VERB 46, NOUN 41, ADV 24, PRON 18, PROPN 17, X 16, ADP 14, PUNCT 13, PART 3, DET 2), Deu (NOUN 14, PROPN 3)

The 10 most frequent ambiguous types: Deu (NOUN 14, PROPN 3), Birimodo (PROPN 2, PRON 1)

Morphology

The form / lemma ratio of PROPN is 1.555556 (the average of all parts of speech is 1.661638).

The 1st highest number of forms (15) was observed with the lemma “_”: Akaruio, Apuie, Aroia, Birimodo, Buturegadu, Cereu, Fabrício, Kaboreu, Kogae, Kogebowu, Kuruguga, Manobaru, Meriribaru, Satanae, Tadugo.

The 2nd highest number of forms (2) was observed with the lemma “Pedro”: Pedro, Pedrore.

The 3rd highest number of forms (1) was observed with the lemma “Adão”: Adão.

PROPN does not occur with any features.

Relations

PROPN nodes are attached to their parents using 8 different relations: nsubj (22; 46% instances), conj (15; 31% instances), obl (3; 6% instances), parataxis (3; 6% instances), nmod (2; 4% instances), compound (1; 2% instances), flat (1; 2% instances), obj (1; 2% instances)

Parents of PROPN nodes belong to 4 different parts of speech: VERB (23; 48% instances), PROPN (18; 38% instances), NOUN (6; 13% instances), X (1; 2% instances)

23 (48%) PROPN nodes are leaves.

19 (40%) PROPN nodes have one child.

5 (10%) PROPN nodes have two children.

1 (2%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 11.

Children of PROPN nodes are attached using 8 different relations: conj (15; 38% instances), punct (13; 33% instances), compound (4; 10% instances), case (3; 8% instances), nsubj (2; 5% instances), dep (1; 3% instances), flat (1; 3% instances), nmod (1; 3% instances)

Children of PROPN nodes belong to 5 different parts of speech: PROPN (18; 45% instances), PUNCT (13; 33% instances), X (4; 10% instances), ADP (3; 8% instances), NOUN (2; 5% instances)