home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-PUD: POS Tags: PROPN

There are 883 PROPN lemmas (18%), 883 PROPN types (14%) and 1272 PROPN tokens (5%). Out of 15 observed tags, the rank of PROPN is: 2 in number of lemmas, 4 in number of types and 7 in number of tokens.

The 10 most frequent PROPN lemmas: Chine, Trump, J.-C., États-Unis, Amérique, Europe, Australie, France, Italie, Afrique

The 10 most frequent PROPN types: Chine, Trump, J.-C., États-Unis, Amérique, Europe, Australie, France, Italie, Afrique

The 10 most frequent ambiguous lemmas: BBC (PROPN 3, X 1), Danevirke (PROPN 3, NOUN 1), Disney (PROPN 3, X 1), Ontario (PROPN 3, X 1), Walt (PROPN 3, X 1), York (PROPN 3, X 1), Balkans (PROPN 2, NOUN 1), Saint (PROPN 2, ADJ 1), film (NOUN 17, PROPN 2), lune (NOUN 2, PROPN 2)

The 10 most frequent ambiguous types: BBC (PROPN 3, X 1), Danevirke (PROPN 3, NOUN 1), Disney (PROPN 3, X 1), Ontario (PROPN 3, X 1), Walt (PROPN 3, X 1), York (PROPN 3, X 1), Balkans (PROPN 2, NOUN 1), Saint (PROPN 2, ADJ 1), Terre (NOUN 2, PROPN 2), lune (NOUN 2, PROPN 2)

Morphology

The form / lemma ratio of PROPN is 1.000000 (the average of all parts of speech is 1.298345).

The 1st highest number of forms (1) was observed with the lemma “AKP”: AKP.

The 2nd highest number of forms (1) was observed with the lemma “Abakumov”: Abakumov.

The 3rd highest number of forms (1) was observed with the lemma “Abbotsford”: Abbotsford.

PROPN occurs with 2 features: Number (1221; 96% instances), Gender (970; 76% instances)

PROPN occurs with 4 feature-value pairs: Gender=Fem, Gender=Masc, Number=Plur, Number=Sing

PROPN occurs with 7 feature combinations. The most frequent feature combination is Gender=Masc|Number=Sing (592 tokens). Examples: Trump, J.-C., Joseph, Donald, Gerry, Cameroun, Edgar, Mexique, Rafferty, Richard

Relations

PROPN nodes are attached to their parents using 13 different relations: nmod (352; 28% instances), flat:name (252; 20% instances), nsubj (222; 17% instances), obl (174; 14% instances), appos (127; 10% instances), conj (68; 5% instances), obj (49; 4% instances), nsubj:pass (20; 2% instances), root (3; 0% instances), xcomp (2; 0% instances), acl:relcl (1; 0% instances), advcl (1; 0% instances), ccomp (1; 0% instances)

Parents of PROPN nodes belong to 9 different parts of speech: NOUN (477; 38% instances), VERB (434; 34% instances), PROPN (318; 25% instances), ADJ (21; 2% instances), NUM (8; 1% instances), PRON (8; 1% instances), (3; 0% instances), X (2; 0% instances), ADV (1; 0% instances)

410 (32%) PROPN nodes are leaves.

433 (34%) PROPN nodes have one child.

300 (24%) PROPN nodes have two children.

129 (10%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 8.

Children of PROPN nodes are attached using 21 different relations: case (535; 36% instances), det (255; 17% instances), flat:name (240; 16% instances), punct (140; 9% instances), conj (80; 5% instances), cc (51; 3% instances), appos (45; 3% instances), nmod (44; 3% instances), amod (30; 2% instances), nummod (18; 1% instances), acl:relcl (17; 1% instances), advmod (4; 0% instances), ccomp (4; 0% instances), cop (4; 0% instances), nsubj (4; 0% instances), mark (3; 0% instances), obl:mod (2; 0% instances), orphan (2; 0% instances), dep (1; 0% instances), parataxis (1; 0% instances), xcomp (1; 0% instances)

Children of PROPN nodes belong to 15 different parts of speech: ADP (531; 36% instances), PROPN (318; 21% instances), DET (255; 17% instances), PUNCT (140; 9% instances), NOUN (89; 6% instances), CCONJ (51; 3% instances), ADJ (32; 2% instances), NUM (22; 1% instances), VERB (22; 1% instances), PRON (5; 0% instances), ADV (4; 0% instances), AUX (4; 0% instances), X (4; 0% instances), SCONJ (2; 0% instances), SYM (2; 0% instances)