home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Breton-KEB: POS Tags: PROPN

There are 112 PROPN lemmas (6%), 120 PROPN types (5%) and 308 PROPN tokens (3%). Out of 16 observed tags, the rank of PROPN is: 6 in number of lemmas, 6 in number of types and 9 in number of tokens.

The 10 most frequent PROPN lemmas: Breizh, Yann, Yannig, Lenaig, Pariz, Frañs, Kemper, Kembre, Naoned, Brezhoneg

The 10 most frequent PROPN types: Breizh, Yann, Yannig, Lenaig, Pariz, Frañs, Kembre, Naoned, Brezhoneg, Divi

The 10 most frequent ambiguous lemmas: Ofis (X 5, PROPN 1)

The 10 most frequent ambiguous types: Brezhoneg (PROPN 5, NOUN 1), Bremañ (ADV 11, PROPN 2), Kemener (PROPN 2, NOUN 1), Ofis (X 5, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.071429 (the average of all parts of speech is 1.406011).

The 1st highest number of forms (3) was observed with the lemma “Pariz”: Bariz, Paris, Pariz.

The 2nd highest number of forms (2) was observed with the lemma “Breizh”: Breizh, Vreizh.

The 3rd highest number of forms (2) was observed with the lemma “Gwened”: Gwened, Wened.

PROPN occurs with 2 features: Number (308; 100% instances), Gender (107; 35% instances)

PROPN occurs with 4 feature-value pairs: Gender=Fem, Gender=Masc, Number=Plur, Number=Sing

PROPN occurs with 4 feature combinations. The most frequent feature combination is Number=Sing (200 tokens). Examples: Breizh, Yann, Pariz, Frañs, Kembre, Yannig, Lenaig, Naoned, Brezhoneg, Europa

Relations

PROPN nodes are attached to their parents using 15 different relations: nsubj (88; 29% instances), nmod:gen (67; 22% instances), obl (53; 17% instances), nmod (21; 7% instances), flat:name (19; 6% instances), conj (16; 5% instances), obl:agent (12; 4% instances), obj (10; 3% instances), root (8; 3% instances), appos (7; 2% instances), fixed:name (2; 1% instances), nmod:poss (2; 1% instances), dep (1; 0% instances), discourse (1; 0% instances), dislocated (1; 0% instances)

Parents of PROPN nodes belong to 8 different parts of speech: VERB (145; 47% instances), NOUN (113; 37% instances), PROPN (33; 11% instances), (8; 3% instances), ADJ (5; 2% instances), PRON (2; 1% instances), ADV (1; 0% instances), NUM (1; 0% instances)

161 (52%) PROPN nodes are leaves.

98 (32%) PROPN nodes have one child.

27 (9%) PROPN nodes have two children.

22 (7%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 10.

Children of PROPN nodes are attached using 19 different relations: case (95; 39% instances), punct (36; 15% instances), dep (18; 7% instances), conj (17; 7% instances), flat:name (17; 7% instances), det (12; 5% instances), appos (6; 2% instances), cc (6; 2% instances), cop (5; 2% instances), nmod:gen (5; 2% instances), advmod (4; 2% instances), aux (4; 2% instances), nsubj (4; 2% instances), nmod (3; 1% instances), obl (3; 1% instances), acl (2; 1% instances), amod (2; 1% instances), fixed:name (2; 1% instances), parataxis (1; 0% instances)

Children of PROPN nodes belong to 14 different parts of speech: ADP (95; 39% instances), PUNCT (36; 15% instances), PROPN (33; 14% instances), NOUN (20; 8% instances), X (18; 7% instances), DET (11; 5% instances), VERB (9; 4% instances), CCONJ (6; 2% instances), ADV (4; 2% instances), ADJ (3; 1% instances), PART (3; 1% instances), NUM (2; 1% instances), INTJ (1; 0% instances), PRON (1; 0% instances)