Treebank Statistics: UD_French-Rhapsodie: POS Tags: PRON
There are 41 PRON
lemmas (1%), 87 PRON
types (2%) and 5356 PRON
tokens (12%).
Out of 15 observed tags, the rank of PRON
is: 7 in number of lemmas, 6 in number of types and 2 in number of tokens.
The 10 most frequent PRON
lemmas: moi, ce, lui, qui, vous, on, ça, y, eux, soi
The 10 most frequent PRON
types: c’, je, il, qui, vous, on, ça, y, j’, ce
The 10 most frequent ambiguous lemmas: ce (PRON 772, DET 228, PROPN 1), que (SCONJ 559, PRON 156, ADV 20, CCONJ 1), où (PRON 59, ADV 1), en (ADP 362, PRON 50, ADV 1), tout (ADJ 119, ADV 73, PRON 27, NOUN 18, DET 10), un (DET 972, PRON 26, NUM 6), autre (ADJ 35, PRON 25, DET 1), quoi (INTJ 17, PRON 14), personne (NOUN 9, PRON 5), aucun (DET 14, PRON 3)
The 10 most frequent ambiguous types: ce (PRON 124, DET 113, PROPN 1), que (SCONJ 426, PRON 98, ADV 17, CCONJ 1), s’ (PRON 67, SCONJ 7), le (DET 976, PRON 66), où (PRON 59, ADV 1), qu’ (SCONJ 133, PRON 58, ADV 3), en (ADP 362, PRON 50, ADV 1), l’ (DET 439, PRON 41), les (DET 561, PRON 34), tout (ADV 74, ADJ 53, NOUN 18, PRON 18)
- ce
- que
- s’
- le
- où
- qu’
- en
- l’
- les
- tout
Morphology
The form / lemma ratio of PRON
is 2.121951 (the average of all parts of speech is 1.352795).
The 1st highest number of forms (8) was observed with the lemma “lui”: -il, -t-il, elle, il, l’, la, le, lui.
The 2nd highest number of forms (6) was observed with the lemma “eux”: -ils, elles, eux, ils, les, leur.
The 3rd highest number of forms (6) was observed with the lemma “moi”: -moi, j’, je, m’, me, moi.
PRON
occurs with 8 features: PronType (5338; 100% instances), Person (4617; 86% instances), Number (4170; 78% instances), Gender (2394; 45% instances), ExtPos (16; 0% instances), Reflex (7; 0% instances), Number[psor] (2; 0% instances), Person[psor] (2; 0% instances)
PRON
occurs with 20 feature-value pairs: ExtPos=ADP
, ExtPos=ADV
, ExtPos=VERB
, Gender=Fem
, Gender=Masc
, Number=Plur
, Number=Sing
, Number[psor]=Sing
, Person=1
, Person=2
, Person=3
, Person[psor]=1
, Person[psor]=3
, PronType=Dem
, PronType=Ind
, PronType=Int
, PronType=Neg
, PronType=Prs
, PronType=Rel
, Reflex=Yes
PRON
occurs with 43 feature combinations.
The most frequent feature combination is Gender=Masc|Number=Sing|Person=3|PronType=Dem
(1082 tokens).
Examples: c’, ça, ce, -ce, cela, celui, ceci
Relations
PRON
nodes are attached to their parents using 33 different relations: nsubj (2992; 56% instances), obj (537; 10% instances), expl:subj (425; 8% instances), expl:comp (290; 5% instances), iobj (180; 3% instances), obl:mod (145; 3% instances), reparandum (142; 3% instances), dislocated (131; 2% instances), root (100; 2% instances), nsubj:pass (94; 2% instances), nmod (66; 1% instances), obl:arg (60; 1% instances), expl:pass (33; 1% instances), dep (31; 1% instances), conj (22; 0% instances), dep:comp (19; 0% instances), fixed (16; 0% instances), acl:relcl (15; 0% instances), ccomp (14; 0% instances), nsubj:caus (12; 0% instances), discourse (8; 0% instances), case (6; 0% instances), advcl (4; 0% instances), acl (2; 0% instances), obl (2; 0% instances), parataxis:parenth (2; 0% instances), xcomp (2; 0% instances), advcl:cleft (1; 0% instances), advmod (1; 0% instances), appos (1; 0% instances), mark (1; 0% instances), obl:agent (1; 0% instances), parataxis (1; 0% instances)
Parents of PRON
nodes belong to 15 different parts of speech: VERB (4054; 76% instances), NOUN (403; 8% instances), PRON (283; 5% instances), ADJ (269; 5% instances), AUX (140; 3% instances), (100; 2% instances), ADV (40; 1% instances), PROPN (26; 0% instances), DET (10; 0% instances), ADP (9; 0% instances), X (9; 0% instances), CCONJ (4; 0% instances), INTJ (4; 0% instances), NUM (4; 0% instances), SCONJ (1; 0% instances)
4501 (84%) PRON
nodes are leaves.
471 (9%) PRON
nodes have one child.
211 (4%) PRON
nodes have two children.
173 (3%) PRON
nodes have three or more children.
The highest child degree of a PRON
node is 8.
Children of PRON
nodes are attached using 33 different relations: punct (458; 28% instances), case (210; 13% instances), reparandum (154; 9% instances), acl:relcl (123; 7% instances), discourse (104; 6% instances), cop (98; 6% instances), advmod (75; 5% instances), nsubj (62; 4% instances), cc (43; 3% instances), expl:subj (37; 2% instances), advcl:cleft (36; 2% instances), det (36; 2% instances), obl:arg (32; 2% instances), dep (27; 2% instances), fixed (26; 2% instances), nmod (25; 2% instances), conj (18; 1% instances), mark (18; 1% instances), amod (16; 1% instances), appos (10; 1% instances), dislocated (8; 0% instances), obl:mod (8; 0% instances), acl (3; 0% instances), dep:comp (3; 0% instances), xcomp (3; 0% instances), aux:tense (2; 0% instances), parataxis (2; 0% instances), parataxis:parenth (2; 0% instances), vocative (2; 0% instances), advcl (1; 0% instances), ccomp (1; 0% instances), csubj (1; 0% instances), obj (1; 0% instances)
Children of PRON
nodes belong to 15 different parts of speech: PUNCT (458; 28% instances), PRON (283; 17% instances), ADP (205; 12% instances), VERB (162; 10% instances), AUX (105; 6% instances), ADV (97; 6% instances), INTJ (82; 5% instances), NOUN (77; 5% instances), CCONJ (44; 3% instances), DET (40; 2% instances), ADJ (36; 2% instances), SCONJ (30; 2% instances), X (13; 1% instances), PROPN (11; 1% instances), NUM (2; 0% instances)