Treebank Statistics: UD_French-Rhapsodie: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
12432 tokens (28%) have a non-empty value of Gender
.
2696 types (59%) occur at least once with a non-empty value of Gender
.
2219 lemmas (65%) occur at least once with a non-empty value of Gender
.
The feature is used with 7 part-of-speech tags: NOUN (5091; 12% instances), DET (2842; 6% instances), PRON (2394; 5% instances), ADJ (1277; 3% instances), VERB (715; 2% instances), PROPN (69; 0% instances), AUX (44; 0% instances).
NOUN
5091 NOUN tokens (97% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (3980; 78%).
NOUN
tokens may have the following values of Gender
:
Fem
(2263; 44% of non-emptyGender
): place, vie, fille, chose, gauche, rue, droite, fois, boule, chosesMasc
(2828; 56% of non-emptyGender
): fait, moment, ans, gens, temps, art, côté, monde, accord, casEMPTY
(139): peu, tout, bonjour, World, com, ca~, dis~, entour~, ex~, ma~
Paradigm fois | Masc | Fem |
---|---|---|
Number=Sing | fois | |
Number=Plur | fois | fois |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (1527) occur only with one value of Gender
.
DET
2842 DET tokens (64% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (2828; 100%), PronType=Art (2522; 89%), Definite=Def (1756; 62%).
DET
tokens may have the following values of Gender
:
Fem
(1242; 44% of non-emptyGender
): la, une, cette, sa, ma, toute, aucune, quelle, certaines, taMasc
(1600; 56% of non-emptyGender
): le, un, ce, cet, du, aucun, quel, certains, tel, tousEMPTY
(1629): les, l’, des, mon, votre, son, notre, ces, ses, quelque
Paradigm le | Masc | Fem |
---|---|---|
le | la |
PRON
2394 PRON tokens (45% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Person=3 (2346; 98%), Number=Sing (2209; 92%).
PRON
tokens may have the following values of Gender
:
Fem
(132; 6% of non-emptyGender
): elle, elles, une, la, celle, laquelle, celles, chacune, aucune, elle-mêmeMasc
(2262; 94% of non-emptyGender
): c’, il, on, ça, ils, ce, le, -ce, lui, rienEMPTY
(2962): je, qui, vous, y, j’, nous, se, tu, que, me
Paradigm lui | Masc | Fem |
---|---|---|
ExtPos=ADP | il | |
il, le, lui, -il, -t-il | elle, la |
ADJ
1277 ADJ tokens (81% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (944; 74%).
ADJ
tokens may have the following values of Gender
:
Fem
(511; 40% of non-emptyGender
): grande, petite, magique, autre, bonne, toutes, première, toute, seule, certaineMasc
(766; 60% of non-emptyGender
): petit, tous, tout, vrai, droit, français, premier, sûr, bon, grosEMPTY
(296): jeune, même, difficile, tout, propre, deuxième, facile, jeunes, grave, incroyable
Paradigm tout | Masc | Fem |
---|---|---|
Number=Sing | tout | toute |
Number=Plur | tous | toutes |
VERB
715 VERB tokens (17% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (715; 100%), Person=EMPTY (715; 100%), Tense=Past (714; 100%), VerbForm=Part (714; 100%), Number=Sing (612; 86%).
VERB
tokens may have the following values of Gender
:
Fem
(146; 20% of non-emptyGender
): arrivée, née, venue, rentrée, restée, étonnée, obligée, partie, renforcée, revenueMasc
(569; 80% of non-emptyGender
): dit, fait, eu, pu, travaillé, vu, arrivé, compris, vécu, étéEMPTY
(3500): a, est, va, dire, voilà, faut, allez, faire, ai, peut
Paradigm aller | Masc | Fem |
---|---|---|
allé | allée |
PROPN
69 PROPN tokens (7% of all PROPN
tokens) have a non-empty value of Gender
.
PROPN
tokens may have the following values of Gender
:
Fem
(31; 45% of non-emptyGender
): Nef, Beauce, Seine, CGC, France, Marne, Mort, Rolex, Shoah, VireMasc
(38; 55% of non-emptyGender
): Kenya, Maître, Gâtinais, Figaro, Beauceron, Argentins, Beaucerons, Chinois, Christ, ConseilEMPTY
(944): France, Paris, Gutiérrez, Chavant, Français, Messi, Rodriguez, Jésus, Europe, Notre-Dame
Gender
seems to be lexical feature of PROPN
. 100% lemmas (28) occur only with one value of Gender
.
AUX
44 AUX tokens (3% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Mood=EMPTY (44; 100%), Number=Sing (44; 100%), Person=EMPTY (44; 100%), Tense=Past (44; 100%), VerbForm=Part (44; 100%).
AUX
tokens may have the following values of Gender
:
Masc
(44; 100% of non-emptyGender
): été, faitEMPTY
(1598): est, a, ai, était, sont, suis, être, ont, avez, êtes
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (2354; 63%),
NOUN –[amod]–> ADJ (771; 82%),
NOUN –[conj]–> NOUN (150; 60%),
ADJ –[nsubj]–> PRON (97; 51%),
NOUN –[reparandum]–> NOUN (92; 69%),
DET –[reparandum]–> DET (83; 75%),
NOUN –[appos]–> NOUN (55; 79%),
ADJ –[nsubj]–> NOUN (37; 82%),
PRON –[reparandum]–> PRON (36; 90%),
VERB –[nsubj:pass]–> NOUN (36; 92%).