Treebank Statistics: UD_French-ParisStories: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
6634 tokens (16%) have a non-empty value of Gender.
278 types (9%) occur at least once with a non-empty value of Gender.
203 lemmas (9%) occur at least once with a non-empty value of Gender.
The feature is used with 10 part-of-speech tags: PRON (3175; 7% instances), DET (2415; 6% instances), ADJ (643; 2% instances), VERB (292; 1% instances), AUX (42; 0% instances), ADV (33; 0% instances), PROPN (16; 0% instances), X (10; 0% instances), NUM (6; 0% instances), NOUN (2; 0% instances).
PRON
3175 PRON tokens (49% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (3144; 99%), Number=Sing (3002; 95%), Emph=No (1823; 57%), Case=Nom (1744; 55%).
PRON tokens may have the following values of Gender:
Fem(309; 10% of non-emptyGender): elle, elles, la, une, lesquelles, toutes, certaines, elle-mêmeMasc(2866; 90% of non-emptyGender): on, c’, il, ça, ils, ce, le, lui, -ce, toutEMPTY(3254): je, j’, y, qui, tu, me, moi, s’, se, nous
| Paradigm lui | Masc | Fem |
|---|---|---|
| Case=Acc|Emph=No|Number=Sing|Person=3 | le | la |
| Case=Acc|Emph=No|Number=Sing | le | |
| Case=Dat|Emph=No|Number=Sing|Person=3 | lui | |
| Case=Nom|Emph=No|ExtPos=ADP|Number=Sing|Person=3 | il | |
| Case=Nom|Emph=No|ExtPos=VERB|Number=Sing|Person=3 | il | |
| Case=Nom|Emph=No|Number=Sing|Person=3 | il, elle | elle |
| Case=Nom|Emph=No|Number=Plur|Person=3 | ils | elles |
| Emph=No|Number=Sing|Person=3 | lui, le | |
| Emph=Yes|Number=Sing|Person=3 | lui | elle |
| Emph=Yes|Number=Plur|Person=3 | elles |
DET
2415 DET tokens (70% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (2404; 100%), Number[psor]=EMPTY (2156; 89%), Person[psor]=EMPTY (2156; 89%), Poss=EMPTY (2156; 89%), PronType=Art (2047; 85%), Definite=Def (1355; 56%).
DET tokens may have the following values of Gender:
Fem(869; 36% of non-emptyGender): la, une, ma, cette, sa, ta, aucune, quelle, certaines, touteMasc(1546; 64% of non-emptyGender): le, un, mon, ce, son, du, ton, cet, des, lesEMPTY(1052): les, l’, des, mes, ses, nos, mon, notre, quelque, chaque
| Paradigm le | Masc | Fem |
|---|---|---|
| Definite=Def|Number=Sing | le | la |
| Definite=Def|Number=Plur | les | |
| Definite=Ind|Number=Sing | le |
ADJ
643 ADJ tokens (54% of all ADJ tokens) have a non-empty value of Gender.
ADJ tokens may have the following values of Gender:
Fem(255; 40% of non-emptyGender): petite, première, toute, toutes, bonne, contente, grande, petites, dernière, différentesMasc(388; 60% of non-emptyGender): tout, petit, tous, premier, gros, mignon, beau, petits, bon, longEMPTY(548): même, vrai, autre, sympa, bizarre, horrible, seule, cool, autres, drôle
| Paradigm tout | Masc | Fem |
|---|---|---|
| _ | tout, tous | toute, toutes |
| PronType=Ind | tout, tous | toute |
VERB
292 VERB tokens (7% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (242; 83%), Tense=EMPTY (242; 83%), VerbForm=Part (241; 83%), Person=EMPTY (237; 81%), Number=EMPTY (211; 72%), Voice=Act (170; 58%).
VERB tokens may have the following values of Gender:
Fem(15; 5% of non-emptyGender): mise, prise, assise, avance, morte, ouverte, soumiseMasc(277; 95% of non-emptyGender): fait, dit, pris, mis, pu, été, compris, écrit, découvert, dégoutéEMPTY(3941): avait, a, sais, voilà, faire, dit, va, aller, avais, vois
| Paradigm prendre | Masc | Fem |
|---|---|---|
| Voice=Act | pris | prise |
| Voice=Pass | pris | prise |
AUX
42 AUX tokens (2% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (42; 100%), Number=Sing (42; 100%), VerbForm=Part (42; 100%), Person=EMPTY (41; 98%), Tense=Past (37; 88%).
AUX tokens may have the following values of Gender:
Masc(42; 100% of non-emptyGender): été, fait, euEMPTY(2234): est, était, a, ai, suis, étais, avait, sont, avais, étaient
ADV
33 ADV tokens (1% of all ADV tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADV and Gender co-occurred: ExtPos=EMPTY (33; 100%), Polarity=EMPTY (33; 100%).
ADV tokens may have the following values of Gender:
Masc(33; 100% of non-emptyGender): mal, tout, plus, superEMPTY(3417): pas, donc, parce, enfin, plus, vraiment, là, très, même, après
PROPN
16 PROPN tokens (4% of all PROPN tokens) have a non-empty value of Gender.
PROPN tokens may have the following values of Gender:
Fem(8; 50% of non-emptyGender): Flora, Caraïbes, GoPro, Latine, TerresMasc(8; 50% of non-emptyGender): Anglais, PSG, Chevaliers, MEMPTY(402): Paris, CROUS, Z, Agen, Ecosse, CP, Sanga, Athis, France, Liège
X
10 X tokens (8% of all X tokens) have a non-empty value of Gender.
The most frequent other feature values with which X and Gender co-occurred: Number=Sing (10; 100%), ExtPos=NOUN (6; 60%).
X tokens may have the following values of Gender:
Fem(2; 20% of non-emptyGender): ju~, quest~Masc(8; 80% of non-emptyGender): re~, dispro~, fa~, frig~, fr~, hu~, mid~EMPTY(115): XXX, s~, d~, j~, a~, euh~, i~, m~, pl~, qu~
NUM
6 NUM tokens (2% of all NUM tokens) have a non-empty value of Gender.
NUM tokens may have the following values of Gender:
Fem(1; 17% of non-emptyGender): uneMasc(5; 83% of non-emptyGender): neuf, unEMPTY(237): deux, trois, six, dix, cinq, mille, quatre, huit, quatorze, sept
| Paradigm un | Masc | Fem |
|---|---|---|
| un | une |
NOUN
2 NOUN tokens (0% of all NOUN tokens) have a non-empty value of Gender.
NOUN tokens may have the following values of Gender:
Masc(2; 100% of non-emptyGender): champignon, coocooningEMPTY(4421): coup, fait, peu, genre, temps, fois, ans, maison, moment, mère
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
PRON –[reparandum]–> PRON (65; 96%),
DET –[reparandum]–> DET (50; 77%),
PRON –[amod]–> ADJ (32; 94%),
DET –[fixed]–> ADJ (6; 100%),
PRON –[acl:relcl]–> ADJ (6; 67%),
PRON –[conj]–> PRON (4; 67%),
DET –[nsubj]–> PRON (3; 75%),
PRON –[appos]–> PRON (2; 100%),
ADJ –[appos]–> ADJ (1; 100%),
ADJ –[conj]–> NOUN (1; 100%).