home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-FQB: POS Tags: PROPN

There are 1203 PROPN lemmas (32%), 1203 PROPN types (28%) and 2086 PROPN tokens (9%). Out of 16 observed tags, the rank of PROPN is: 2 in number of lemmas, 2 in number of types and 5 in number of tokens.

The 10 most frequent PROPN lemmas: États-Unis, New, Terre, Californie, Alaska, John, York, soleil, Charles, Kentucky

The 10 most frequent PROPN types: États-Unis, New, Terre, Californie, Alaska, John, York, soleil, Charles, Kentucky

The 10 most frequent ambiguous lemmas: New (PROPN 16, X 1), lune (PROPN 2, NOUN 1), ligue (PROPN 3, NOUN 1), King (PROPN 4, X 1), the (X 13, PROPN 1), Gateway (PROPN 2, X 1), Star (PROPN 2, X 1), World (PROPN 2, X 1), A (NOUN 1, PROPN 1, X 1), American (PROPN 1, X 1)

The 10 most frequent ambiguous types: New (PROPN 16, X 1), King (PROPN 4, X 1), the (X 13, PROPN 1), Gateway (PROPN 2, X 1), Pôle (NOUN 4, PROPN 2), Star (PROPN 2, X 1), World (PROPN 2, X 1), A (NOUN 1, PROPN 1, X 1), American (PROPN 1, X 1), Canyon (NOUN 1, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.000000 (the average of all parts of speech is 1.164044).

The 1st highest number of forms (1) was observed with the lemma “’N”: ‘N.

The 2nd highest number of forms (1) was observed with the lemma “20th”: 20th.

The 3rd highest number of forms (1) was observed with the lemma “A”: A.

PROPN occurs with 2 features: Number (986; 47% instances), Gender (932; 45% instances)

PROPN occurs with 4 feature-value pairs: Gender=Fem, Gender=Masc, Number=Plur, Number=Sing

PROPN occurs with 7 feature combinations. The most frequent feature combination is _ (1099 tokens). Examples: États-Unis, New, Terre, Soleil, Nobel, Lune, Titanic, Angeles, Bowl, C.

Relations

PROPN nodes are attached to their parents using 14 different relations: nmod (707; 34% instances), flat:name (581; 28% instances), nsubj (353; 17% instances), obl:mod (139; 7% instances), obj (66; 3% instances), nsubj:pass (62; 3% instances), appos (54; 3% instances), obl:arg (34; 2% instances), conj (32; 2% instances), root (20; 1% instances), obl:agent (13; 1% instances), dislocated (12; 1% instances), xcomp (12; 1% instances), dep (1; 0% instances)

Parents of PROPN nodes belong to 7 different parts of speech: NOUN (747; 36% instances), PROPN (657; 31% instances), VERB (600; 29% instances), ADJ (43; 2% instances), (20; 1% instances), PRON (16; 1% instances), NUM (3; 0% instances)

819 (39%) PROPN nodes are leaves.

542 (26%) PROPN nodes have one child.

530 (25%) PROPN nodes have two children.

195 (9%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 10.

Children of PROPN nodes are attached using 16 different relations: case (730; 32% instances), flat:name (569; 25% instances), det (538; 23% instances), punct (176; 8% instances), nmod (91; 4% instances), dep (34; 1% instances), amod (33; 1% instances), cc (32; 1% instances), conj (30; 1% instances), cop (20; 1% instances), nsubj (20; 1% instances), mark (13; 1% instances), nummod (2; 0% instances), acl (1; 0% instances), advmod (1; 0% instances), orphan (1; 0% instances)

Children of PROPN nodes belong to 14 different parts of speech: ADP (728; 32% instances), PROPN (657; 29% instances), DET (538; 23% instances), PUNCT (176; 8% instances), X (34; 1% instances), ADJ (33; 1% instances), CCONJ (32; 1% instances), NOUN (31; 1% instances), PRON (21; 1% instances), AUX (20; 1% instances), SCONJ (13; 1% instances), ADV (4; 0% instances), NUM (2; 0% instances), VERB (2; 0% instances)