Treebank Statistics: UD_French-ParTUT: POS Tags: VERB
There are 562 VERB
lemmas (19%), 1315 VERB
types (30%) and 2736 VERB
tokens (10%).
Out of 17 observed tags, the rank of VERB
is: 2 in number of lemmas, 2 in number of types and 4 in number of tokens.
The 10 most frequent VERB
lemmas: avoir, pouvoir, devoir, faire, dire, vouloir, concerner, être, établir, présenter
The 10 most frequent VERB
types: a, peut, voudrais, fait, doit, est, faire, dite, devrait, concernant
The 10 most frequent ambiguous lemmas: avoir (AUX 239, VERB 114), pouvoir (VERB 113, NOUN 3), devoir (VERB 93, NOUN 2), faire (VERB 68, AUX 13, NOUN 1), dire (VERB 51, NOUN 2), être (AUX 595, VERB 36, NOUN 2), accepter (VERB 10, NOUN 1), ajouter (VERB 6, ADJ 1), intégrer (VERB 6, ADJ 1), souvenir (NOUN 1, VERB 1)
The 10 most frequent ambiguous types: a (AUX 114, VERB 56, ADP 1, X 1), fait (VERB 30, NOUN 15, AUX 2), est (AUX 224, VERB 23, NOUN 2), faire (VERB 22, AUX 7), ont (AUX 60, VERB 17), dire (VERB 11, NOUN 1), font (VERB 10, AUX 1), avoir (VERB 9, AUX 5), demande (NOUN 7, VERB 6), prise (VERB 5, NOUN 1)
- a
- AUX 114: On a découvert cela pour la première fois en 1859 .
- VERB 56: 1 . Tout individu a droit à une nationalité .
- ADP 1: Paternité - Pas d’ utilisation commerciale - Partage de les conditions initiales a l’ identique .
- X 1: Elle ne fait que prolonger les réglementations transitoires en repoussant les délais , supprime des dispositions qui ont cessé d’ être pertinentes et règle les procédures pour ce qui est a ) de les transports ad hoc de marchandises dangereuses et b ) de la promulgation de dispositions nationales moins strictes , en particulier pour le transport de quantités très réduites de marchandises dangereuses dans des zones strictement délimitées .
- fait
- est
- faire
- ont
- dire
- font
- avoir
- demande
- prise
The form / lemma ratio of VERB
is 2.339858 (the average of all parts of speech is 1.455030).
The 1st highest number of forms (18) was observed with the lemma “pouvoir”: peut, peuvent, peux, pourra, pourraient, pourrait, pourrez, pourrons, pourront, pouvait, pouvez, pouvoir, pouvons, pu, puis, puisse, puissent, puissions.
The 2nd highest number of forms (16) was observed with the lemma “avoir”: a, ai, ait, aura, aurai, aurait, aurez, auront, avaient, avait, avez, avoir, avons, ayant, eu, ont.
The 3rd highest number of forms (13) was observed with the lemma “devoir”: devait, devez, devions, devons, devra, devraient, devrait, devrions, devrons, devront, doit, doivent, dû.
occurs with 6 features: VerbForm (2735; 100% instances), Tense (2045; 75% instances), Number (1929; 71% instances), Mood (1148; 42% instances), Person (1146; 42% instances), Gender (742; 27% instances)
occurs with 18 feature-value pairs: Gender=Fem
, Gender=Masc
, Mood=Cnd
, Mood=Imp
, Mood=Ind
, Mood=Sub
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Tense=Fut
, Tense=Imp
, Tense=Past
, Tense=Pres
, VerbForm=Fin
, VerbForm=Inf
, VerbForm=Part
occurs with 49 feature combinations.
The most frequent feature combination is VerbForm=Inf
(688 tokens).
Examples: faire, améliorer, dire, abonner, assurer, savoir, avoir, compter, utiliser, communiquer
nodes are attached to their parents using 15 different relations: root (812; 30% instances), acl (475; 17% instances), xcomp (362; 13% instances), conj (295; 11% instances), advcl (272; 10% instances), acl:relcl (258; 9% instances), ccomp (174; 6% instances), csubj (60; 2% instances), fixed (15; 1% instances), parataxis (5; 0% instances), appos (2; 0% instances), nsubj (2; 0% instances), obl (2; 0% instances), csubj:pass (1; 0% instances), dep (1; 0% instances)
Parents of VERB
nodes belong to 9 different parts of speech: VERB (1051; 38% instances), (812; 30% instances), NOUN (686; 25% instances), PRON (87; 3% instances), ADJ (80; 3% instances), PROPN (9; 0% instances), ADP (7; 0% instances), ADV (3; 0% instances), AUX (1; 0% instances)
72 (3%) VERB
nodes are leaves.
365 (13%) VERB
nodes have one child.
478 (17%) VERB
nodes have two children.
1821 (67%) VERB
nodes have three or more children.
The highest child degree of a VERB
node is 10.
Children of VERB
nodes are attached using 36 different relations: obl (1310; 14% instances), punct (1284; 14% instances), nsubj (1172; 13% instances), obj (1093; 12% instances), mark (766; 8% instances), advmod (670; 7% instances), xcomp (460; 5% instances), conj (278; 3% instances), cc (276; 3% instances), aux (275; 3% instances), advcl (249; 3% instances), aux:pass (240; 3% instances), nsubj:pass (221; 2% instances), ccomp (212; 2% instances), expl (184; 2% instances), iobj (106; 1% instances), obl:agent (69; 1% instances), nummod (66; 1% instances), vocative (61; 1% instances), csubj (36; 0% instances), appos (13; 0% instances), aux:caus (13; 0% instances), amod (10; 0% instances), case (10; 0% instances), nmod (10; 0% instances), obj:agent (9; 0% instances), dislocated (7; 0% instances), cop (6; 0% instances), det (6; 0% instances), discourse (5; 0% instances), nsubj:caus (4; 0% instances), parataxis (4; 0% instances), acl:relcl (3; 0% instances), csubj:pass (1; 0% instances), dep (1; 0% instances), iobj:agent (1; 0% instances)
Children of VERB
nodes belong to 16 different parts of speech: NOUN (2847; 31% instances), PRON (1344; 15% instances), PUNCT (1284; 14% instances), VERB (1051; 12% instances), AUX (534; 6% instances), ADV (528; 6% instances), ADP (470; 5% instances), SCONJ (296; 3% instances), CCONJ (276; 3% instances), PART (159; 2% instances), ADJ (152; 2% instances), NUM (92; 1% instances), PROPN (81; 1% instances), DET (10; 0% instances), SYM (5; 0% instances), X (2; 0% instances)