home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-GSD: POS Tags: PROPN

There are 15348 PROPN lemmas (43%), 15350 PROPN types (33%) and 27693 PROPN tokens (7%). Out of 16 observed tags, the rank of PROPN is: 1 in number of lemmas, 1 in number of types and 6 in number of tokens.

The 10 most frequent PROPN lemmas: France, Paris, Europe, États-Unis, de, Jean, Maroc, Espagne, New, York

The 10 most frequent PROPN types: France, Paris, Europe, États-Unis, de, Jean, Maroc, Espagne, New, York

The 10 most frequent ambiguous lemmas: Europe (PROPN 126, X 2), de (ADP 31302, PROPN 113, X 32), New (PROPN 63, X 5), York (PROPN 62, X 3), Canada (PROPN 54, X 1), San (PROPN 46, X 1), saint (NOUN 15, ADJ 10, PROPN 10, X 1), John (PROPN 45, X 1), The (X 53, PROPN 32), le (DET 43299, PROPN 12, X 2)

The 10 most frequent ambiguous types: Europe (PROPN 126, X 2), de (ADP 26313, DET 391, PROPN 113, X 32, ADV 1), New (PROPN 63, X 5), York (PROPN 62, X 3), Canada (PROPN 54, X 1), San (PROPN 46, X 1), saint (PROPN 10, NOUN 8, ADJ 4), John (PROPN 45, X 1), The (X 53, PROPN 32), le (DET 13756, PRON 281, PROPN 12, X 2)

Morphology

The form / lemma ratio of PROPN is 1.000130 (the average of all parts of speech is 1.309093).

The 1st highest number of forms (2) was observed with the lemma “Côte”: COTE, Côte.

The 2nd highest number of forms (2) was observed with the lemma “Gaule”: Gaule, Gaulle.

The 3rd highest number of forms (2) was observed with the lemma “Ivoire”: IVOIRE, Ivoire.

PROPN occurs with 4 features: Number (4979; 18% instances), Gender (3214; 12% instances), ExtPos (23; 0% instances), Typo (13; 0% instances)

PROPN occurs with 7 feature-value pairs: ExtPos=ADJ, ExtPos=PROPN, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, Typo=Yes

PROPN occurs with 17 feature combinations. The most frequent feature combination is _ (22691 tokens). Examples: France, Paris, de, Jean, Europe, York, New, Pierre, Charles, Louis

Relations

PROPN nodes are attached to their parents using 24 different relations: nmod (7717; 28% instances), flat:name (6324; 23% instances), appos (3336; 12% instances), nsubj (3065; 11% instances), conj (2438; 9% instances), obl:mod (1618; 6% instances), obl:arg (1310; 5% instances), obj (617; 2% instances), obl:agent (508; 2% instances), nsubj:pass (275; 1% instances), xcomp (189; 1% instances), root (157; 1% instances), nsubj:caus (25; 0% instances), advcl (24; 0% instances), parataxis (24; 0% instances), orphan (22; 0% instances), acl:relcl (12; 0% instances), ccomp (7; 0% instances), dislocated (7; 0% instances), obl (6; 0% instances), obj:agent (5; 0% instances), vocative (5; 0% instances), acl (1; 0% instances), advcl:cleft (1; 0% instances)

Parents of PROPN nodes belong to 13 different parts of speech: NOUN (10923; 39% instances), PROPN (9608; 35% instances), VERB (6290; 23% instances), ADJ (278; 1% instances), (157; 1% instances), PRON (147; 1% instances), ADV (96; 0% instances), X (87; 0% instances), NUM (67; 0% instances), ADP (21; 0% instances), SYM (12; 0% instances), DET (4; 0% instances), INTJ (3; 0% instances)

9048 (33%) PROPN nodes are leaves.

8135 (29%) PROPN nodes have one child.

6416 (23%) PROPN nodes have two children.

4094 (15%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 34.

Children of PROPN nodes are attached using 28 different relations: case (11512; 32% instances), flat:name (6609; 18% instances), det (4804; 13% instances), punct (4446; 12% instances), conj (2493; 7% instances), nmod (2065; 6% instances), cc (1389; 4% instances), appos (1046; 3% instances), amod (424; 1% instances), acl (401; 1% instances), acl:relcl (375; 1% instances), advmod (205; 1% instances), cop (130; 0% instances), nsubj (100; 0% instances), nummod (39; 0% instances), orphan (31; 0% instances), advcl:cleft (26; 0% instances), expl:subj (26; 0% instances), mark (25; 0% instances), parataxis (23; 0% instances), obl:mod (21; 0% instances), advcl (3; 0% instances), dep (2; 0% instances), dislocated (2; 0% instances), aux:tense (1; 0% instances), discourse (1; 0% instances), nsubj:outer (1; 0% instances), parataxis:insert (1; 0% instances)

Children of PROPN nodes belong to 16 different parts of speech: ADP (11481; 32% instances), PROPN (9608; 27% instances), DET (4804; 13% instances), PUNCT (4446; 12% instances), NOUN (1620; 4% instances), CCONJ (1355; 4% instances), VERB (794; 2% instances), NUM (640; 2% instances), X (464; 1% instances), ADJ (453; 1% instances), ADV (184; 1% instances), AUX (132; 0% instances), PRON (106; 0% instances), SYM (59; 0% instances), SCONJ (54; 0% instances), INTJ (1; 0% instances)