Treebank Statistics: UD_French-Rhapsodie: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
13316 tokens (30%) have a non-empty value of Gender.
2784 types (61%) occur at least once with a non-empty value of Gender.
2273 lemmas (67%) occur at least once with a non-empty value of Gender.
The feature is used with 7 part-of-speech tags: NOUN (5228; 12% instances), DET (3308; 7% instances), PRON (2423; 5% instances), ADJ (1561; 4% instances), VERB (704; 2% instances), PROPN (48; 0% instances), AUX (44; 0% instances).
NOUN
5228 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (4000; 77%).
NOUN tokens may have the following values of Gender:
Fem(2267; 43% of non-emptyGender): place, vie, fille, chose, gauche, rue, droite, fois, boule, chosesMasc(2961; 57% of non-emptyGender): peu, fait, moment, ans, français, gens, temps, art, côté, monde
| Paradigm fois | Masc | Fem |
|---|---|---|
| Number=Sing | fois | |
| Number=Plur | fois | fois |
Gender seems to be lexical feature of NOUN. 99% lemmas (1531) occur only with one value of Gender.
DET
3308 DET tokens (74% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (3294; 100%), PronType=Art (2901; 88%), Definite=Def (2135; 65%).
DET tokens may have the following values of Gender:
Fem(1426; 43% of non-emptyGender): la, une, l’, cette, sa, ma, son, toute, mon, aucuneMasc(1882; 57% of non-emptyGender): le, un, l’, ce, mon, son, cet, du, aucun, quelEMPTY(1148): les, des, l’, votre, notre, ces, ses, quelque, vos, nos
| Paradigm le | Masc | Fem |
|---|---|---|
| le, l' | la, l' |
PRON
2423 PRON tokens (46% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (2374; 98%), Number=Sing (2224; 92%), Case=EMPTY (1335; 55%), Emph=EMPTY (1299; 54%).
PRON tokens may have the following values of Gender:
Fem(132; 5% of non-emptyGender): elle, elles, une, la, laquelle, celle, celles, chacune, aucune, celle-làMasc(2291; 95% of non-emptyGender): c’, il, on, ça, ce, ils, le, -ce, lui, rienEMPTY(2857): je, qui, vous, y, j’, nous, se, tu, que, me
| Paradigm lui | Masc | Fem |
|---|---|---|
| Case=Acc|Emph=No|Number=Sing | le | la |
| Case=Nom|Emph=No|ExtPos=ADP|Number=Sing | il | |
| Case=Nom|Emph=No|Number=Sing | il, -il | elle |
| Case=Nom|Emph=No|Number=Plur | ils | elles |
| Emph=No|Number=Sing | le | |
| Emph=Yes|Number=Sing | lui | elle |
| Emph=Yes|Number=Plur | eux | |
| Number=Sing | -il, le, -t-il | |
| Number=Plur | -ils |
ADJ
1561 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1243; 80%).
ADJ tokens may have the following values of Gender:
Fem(617; 40% of non-emptyGender): jeune, grande, petite, autre, magique, bonne, toutes, première, toute, mêmeMasc(944; 60% of non-emptyGender): tout, petit, tous, vrai, droit, premier, sûr, même, bon, grand
| Paradigm tout | Masc | Fem |
|---|---|---|
| Number=Sing | tout | toute |
| Number=Plur | tous | toutes |
VERB
704 VERB tokens (17% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (704; 100%), Person=EMPTY (704; 100%), Tense=EMPTY (704; 100%), VerbForm=Part (704; 100%), Number=Sing (603; 86%), Voice=Act (446; 63%).
VERB tokens may have the following values of Gender:
Fem(146; 21% of non-emptyGender): arrivée, née, venue, rentrée, restée, étonnée, obligée, partie, renforcée, revenueMasc(558; 79% of non-emptyGender): dit, fait, eu, pu, travaillé, vu, arrivé, compris, vécu, étéEMPTY(3445): a, est, va, dire, voilà, faut, allez, faire, ai, peut
| Paradigm aller | Masc | Fem |
|---|---|---|
| Voice=Act | allée | |
| Voice=Pass | allé |
PROPN
48 PROPN tokens (5% of all PROPN tokens) have a non-empty value of Gender.
PROPN tokens may have the following values of Gender:
Fem(28; 58% of non-emptyGender): Nef, Beauce, Seine, CGC, France, Marne, Rolex, Shoah, VireMasc(20; 42% of non-emptyGender): Kenya, Gâtinais, Figaro, Beauceron, Christ, ParisienEMPTY(909): France, Paris, Gutiérrez, Chavant, Messi, Rodriguez, Jésus, Europe, Notre-Dame, Szymanoski
Gender seems to be lexical feature of PROPN. 100% lemmas (15) occur only with one value of Gender.
AUX
44 AUX tokens (3% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (44; 100%), Number=Sing (44; 100%), Person=EMPTY (44; 100%), Tense=Past (44; 100%), VerbForm=Part (44; 100%).
AUX tokens may have the following values of Gender:
Masc(44; 100% of non-emptyGender): été, faitEMPTY(1622): est, a, ai, était, sont, suis, être, ont, avez, êtes
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (2819; 75%),
NOUN –[amod]–> ADJ (934; 100%),
NOUN –[conj]–> NOUN (153; 60%),
ADJ –[nsubj]–> PRON (147; 72%),
NOUN –[reparandum]–> NOUN (93; 79%),
DET –[reparandum]–> DET (88; 81%),
DET –[fixed]–> NOUN (81; 100%),
NOUN –[appos]–> NOUN (56; 79%),
ADJ –[nsubj]–> NOUN (45; 100%),
ADJ –[conj]–> ADJ (42; 100%).