home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-GSD: POS Tags: PRON

There are 43 PRON lemmas (0%), 118 PRON types (0%) and 18185 PRON tokens (5%). Out of 16 observed tags, the rank of PRON is: 10 in number of lemmas, 8 in number of types and 8 in number of tokens.

The 10 most frequent PRON lemmas: lui, soi, qui, ce, eux, moi, on, y, que, où

The 10 most frequent PRON types: il, qui, se, s’, elle, c’, on, y, ils, lui

The 10 most frequent ambiguous lemmas: ce (DET 2214, PRON 975, X 1), on (PRON 630, X 5), y (PRON 536, X 2, PROPN 1, SYM 1), que (SCONJ 2260, PRON 483, ADV 245), un (DET 10060, PRON 319, NUM 122, X 1), en (ADP 5860, PRON 282), lequel (PRON 221, ADJ 1, DET 1), tout (ADJ 456, PRON 170, ADV 143, DET 136), autre (ADJ 386, PRON 159), certain (DET 131, ADJ 62, PRON 48)

The 10 most frequent ambiguous types: se (PRON 1341, DET 1, X 1), s’ (PRON 984, SCONJ 47), on (PRON 329, X 5, AUX 2), y (PRON 527, X 2, PROPN 1, SYM 1), ce (DET 541, PRON 328, X 1), le (DET 13771, PRON 281, X 3), en (ADP 5073, PRON 281), vous (PRON 249, X 1), qu’ (SCONJ 675, PRON 248, ADV 92), que (SCONJ 1573, PRON 221, ADV 151)

Morphology

The form / lemma ratio of PRON is 2.744186 (the average of all parts of speech is 1.307955).

The 1st highest number of forms (13) was observed with the lemma “lui”: -elle, -il, -le, -t-elle, -t-il, elle, il, l’, la, le, lui, t’il, t-il.

The 2nd highest number of forms (9) was observed with the lemma “eux”: -elles, -eux, -ils, elles, eux, ils, les, leur, leurs.

The 3rd highest number of forms (8) was observed with the lemma “moi”: -je, -moi, J, j’, je, m’, me, moi.

PRON occurs with 10 features: PronType (18185; 100% instances), Person (14413; 79% instances), Number (11255; 62% instances), Emph (10297; 57% instances), Gender (9552; 53% instances), Reflex (2490; 14% instances), ExtPos (51; 0% instances), Typo (39; 0% instances), Number[psor] (3; 0% instances), Person[psor] (3; 0% instances)

PRON occurs with 22 feature-value pairs: Emph=No, Emph=Yes, ExtPos=ADP, ExtPos=ADV, ExtPos=PROPN, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, Number[psor]=Plur, Person=1, Person=2, Person=3, Person[psor]=1, PronType=Dem, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, Reflex=Yes, Typo=Yes

PRON occurs with 85 feature combinations. The most frequent feature combination is Emph=No|Gender=Masc|Number=Sing|Person=3|PronType=Prs (4263 tokens). Examples: il, le, lui, -t-il, -il

Relations

PRON nodes are attached to their parents using 33 different relations: nsubj (8187; 45% instances), obj (2094; 12% instances), obl:mod (1150; 6% instances), nsubj:pass (1059; 6% instances), expl:pv (1017; 6% instances), expl:subj (931; 5% instances), iobj (896; 5% instances), nmod (691; 4% instances), expl:pass (686; 4% instances), obl:arg (319; 2% instances), expl:comp (211; 1% instances), root (176; 1% instances), conj (175; 1% instances), parataxis (121; 1% instances), appos (81; 0% instances), fixed (69; 0% instances), nsubj:caus (63; 0% instances), acl:relcl (41; 0% instances), case (36; 0% instances), obj:agent (31; 0% instances), xcomp (30; 0% instances), iobj:agent (24; 0% instances), dislocated (21; 0% instances), obl:agent (15; 0% instances), advmod (13; 0% instances), dep:comp (11; 0% instances), ccomp (9; 0% instances), nsubj:outer (9; 0% instances), orphan (7; 0% instances), advcl (6; 0% instances), dep (3; 0% instances), acl (2; 0% instances), parataxis:insert (1; 0% instances)

Parents of PRON nodes belong to 15 different parts of speech: VERB (15016; 83% instances), NOUN (1709; 9% instances), ADJ (651; 4% instances), PRON (242; 1% instances), (176; 1% instances), PROPN (136; 1% instances), ADV (95; 1% instances), ADP (59; 0% instances), NUM (53; 0% instances), SYM (13; 0% instances), X (13; 0% instances), DET (9; 0% instances), INTJ (5; 0% instances), AUX (4; 0% instances), SCONJ (4; 0% instances)

15996 (88%) PRON nodes are leaves.

987 (5%) PRON nodes have one child.

665 (4%) PRON nodes have two children.

537 (3%) PRON nodes have three or more children.

The highest child degree of a PRON node is 9.

Children of PRON nodes are attached using 31 different relations: case (1101; 25% instances), punct (729; 16% instances), nmod (659; 15% instances), acl:relcl (340; 8% instances), det (306; 7% instances), cop (224; 5% instances), nsubj (187; 4% instances), cc (158; 4% instances), advmod (153; 3% instances), conj (115; 3% instances), acl (113; 3% instances), amod (85; 2% instances), fixed (84; 2% instances), appos (38; 1% instances), obl:mod (28; 1% instances), expl:subj (26; 1% instances), advcl:cleft (25; 1% instances), orphan (17; 0% instances), mark (15; 0% instances), advcl (12; 0% instances), aux:tense (9; 0% instances), parataxis (8; 0% instances), nummod (6; 0% instances), dislocated (4; 0% instances), goeswith (3; 0% instances), discourse (2; 0% instances), obl:agent (2; 0% instances), aux:pass (1; 0% instances), dep (1; 0% instances), nsubj:pass (1; 0% instances), parataxis:insert (1; 0% instances)

Children of PRON nodes belong to 15 different parts of speech: ADP (1064; 24% instances), PUNCT (729; 16% instances), NOUN (649; 15% instances), VERB (508; 11% instances), DET (306; 7% instances), PRON (242; 5% instances), AUX (241; 5% instances), ADV (194; 4% instances), CCONJ (152; 3% instances), PROPN (147; 3% instances), ADJ (127; 3% instances), SCONJ (41; 1% instances), NUM (36; 1% instances), X (15; 0% instances), INTJ (2; 0% instances)