Treebank Statistics: UD_French-FTB: POS Tags: AUX
There are 7 AUX
lemmas (0%), 23 AUX
types (1%) and 12869 AUX
tokens (2%).
Out of 16 observed tags, the rank of AUX
is: 15 in number of lemmas, 10 in number of types and 11 in number of tokens.
The 10 most frequent AUX
lemmas: _, avoir, pouvoir, être, aller, vouloir, devoir
The 10 most frequent AUX
types: _, Peut, Ayant, Avez, Avoir, Est, Peuvent, Seront, Sont, A
The 10 most frequent ambiguous lemmas: _ (NOUN 115984, ADP 89082, DET 79465, PUNCT 73863, VERB 47092, ADJ 36213, ADV 22183, PROPN 21225, PRON 20877, NUM 17577, AUX 12831, CCONJ 11039, SCONJ 4969, X 2163, PART 239, INTJ 33), avoir (AUX 12, VERB 3), pouvoir (VERB 15, AUX 11, NOUN 1), être (VERB 26, AUX 10), devoir (AUX 1, VERB 1)
The 10 most frequent ambiguous types: _ (NOUN 115984, ADP 89082, DET 79465, PUNCT 73863, VERB 47092, ADJ 36213, ADV 22183, PROPN 21225, PRON 20877, NUM 17577, AUX 12831, CCONJ 11039, SCONJ 4969, X 2163, PART 239, INTJ 33), Peut (VERB 15, AUX 6), Ayant (AUX 5, VERB 1), Est (VERB 17, AUX 2), Sont (AUX 2, VERB 2), A (ADP 388, AUX 1, NOUN 1, X 1), Etant (AUX 1, VERB 1)
- _
- NOUN 115984: Nous _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADP 89082: Nous _ _ _ _ _ _ _ _ _ _ _ _ _ _
- DET 79465: Nous _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PUNCT 73863: Nous _ _ _ _ _ _ _ _ _ _ _ _ _ _
- VERB 47092: Nous _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADJ 36213: Nous _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADV 22183: Nous _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PROPN 21225: - _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PRON 20877: Nous _ _ _ _ _ _ _ _ _ _ _ _ _ _
- NUM 17577: Le _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- AUX 12831: Nous _ _ _ _ _ _ _ _ _ _ _ _ _ _
- CCONJ 11039: Nous _ _ _ _ _ _ _ _ _ _ _ _ _ _
- SCONJ 4969: Le _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- X 2163: In _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PART 239: L’ _ _ _ _ _ _ _ _ _ _ _
- INTJ 33: Le _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- Peut
- Ayant
- Est
- Sont
- A
- Etant
Morphology
The form / lemma ratio of AUX
is 3.285714 (the average of all parts of speech is 1.170225).
The 1st highest number of forms (7) was observed with the lemma “être”: Est, Etant, Fût, Sera, Serions, Seront, Sont.
The 2nd highest number of forms (5) was observed with the lemma “avoir”: A, Avez, Avoir, Ayant, Ont.
The 3rd highest number of forms (5) was observed with the lemma “pouvoir”: Peut, Peuvent, Pourrait, Pourront, Pouvait.
AUX
occurs with 6 features: VerbForm (12869; 100% instances), Tense (12054; 94% instances), Number (11882; 92% instances), Mood (10778; 84% instances), Person (10778; 84% instances), Gender (1104; 9% instances)
AUX
occurs with 17 feature-value pairs: Gender=Fem
, Gender=Masc
, Mood=Cnd
, Mood=Ind
, Mood=Sub
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Tense=Fut
, Tense=Imp
, Tense=Past
, Tense=Pres
, VerbForm=Fin
, VerbForm=Inf
, VerbForm=Part
AUX
occurs with 32 feature combinations.
The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin
(5363 tokens).
Examples: _, Peut, Est, A, Doit, Va, Veut
Relations
AUX
nodes are attached to their parents using 16 different relations: aux (7195; 56% instances), aux:pass (3331; 26% instances), root (1238; 10% instances), acl:relcl (364; 3% instances), conj (173; 1% instances), ccomp (159; 1% instances), advcl (136; 1% instances), parataxis (101; 1% instances), xcomp (99; 1% instances), acl (60; 0% instances), dep (7; 0% instances), orphan (2; 0% instances), fixed (1; 0% instances), iobj (1; 0% instances), nsubj (1; 0% instances), obl (1; 0% instances)
Parents of AUX
nodes belong to 14 different parts of speech: VERB (10630; 83% instances), (1238; 10% instances), NOUN (444; 3% instances), AUX (225; 2% instances), ADJ (203; 2% instances), PRON (91; 1% instances), PROPN (23; 0% instances), ADV (4; 0% instances), ADP (3; 0% instances), CCONJ (2; 0% instances), SCONJ (2; 0% instances), X (2; 0% instances), DET (1; 0% instances), NUM (1; 0% instances)
10521 (82%) AUX
nodes are leaves.
60 (0%) AUX
nodes have one child.
329 (3%) AUX
nodes have two children.
1959 (15%) AUX
nodes have three or more children.
The highest child degree of a AUX
node is 12.
Children of AUX
nodes are attached using 20 different relations: xcomp (2341; 27% instances), nsubj (2161; 25% instances), punct (2119; 24% instances), advmod (580; 7% instances), mark (387; 4% instances), obl (346; 4% instances), cc (242; 3% instances), aux (202; 2% instances), advcl (128; 1% instances), obj (111; 1% instances), iobj (49; 1% instances), parataxis (15; 0% instances), nummod (12; 0% instances), expl (11; 0% instances), dep (9; 0% instances), conj (8; 0% instances), csubj (4; 0% instances), acl (1; 0% instances), amod (1; 0% instances), det (1; 0% instances)
Children of AUX
nodes belong to 15 different parts of speech: VERB (2376; 27% instances), PUNCT (2119; 24% instances), NOUN (1375; 16% instances), PRON (1060; 12% instances), ADV (517; 6% instances), SCONJ (294; 3% instances), CCONJ (243; 3% instances), PROPN (238; 3% instances), AUX (225; 3% instances), ADP (147; 2% instances), ADJ (99; 1% instances), NUM (15; 0% instances), DET (11; 0% instances), INTJ (5; 0% instances), X (4; 0% instances)