Treebank Statistics: UD_French-ParTUT: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
11371 tokens (40%) have a non-empty value of Gender.
2603 types (63%) occur at least once with a non-empty value of Gender.
1915 lemmas (67%) occur at least once with a non-empty value of Gender.
The feature is used with 6 part-of-speech tags: NOUN (6052; 21% instances), DET (2788; 10% instances), ADJ (1272; 4% instances), VERB (742; 3% instances), PRON (447; 2% instances), AUX (70; 0% instances).
NOUN
6052 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (4312; 71%).
NOUN tokens may have the following values of Gender:
Fem(3169; 52% of non-emptyGender): commission, oeuvre, sécurité, directive, mesures, protection, exigences, décision, madame, matièreMasc(2883; 48% of non-emptyGender): parlement, programme, membres, droit, états, contrat, rapport, conseil, pays, monsieurEMPTY(32): commissaire, gens, collègue, adultes, jeunes, journalistes, politique, protagonistes, représentants, socialistes
| Paradigm œuvre | Masc | Fem |
|---|---|---|
| Number=Sing | oeuvre | oeuvre, œuvre |
| Number=Plur | oeuvres |
Gender seems to be lexical feature of NOUN. 95% lemmas (1190) occur only with one value of Gender.
DET
2788 DET tokens (58% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (2439; 87%), PronType=Art (2096; 75%), Definite=Def (1487; 53%).
DET tokens may have the following values of Gender:
Fem(1454; 52% of non-emptyGender): la, une, cette, des, toute, sa, leur, aucune, toutes, votreMasc(1334; 48% of non-emptyGender): le, un, ce, des, son, tous, tout, votre, cet, monEMPTY(1992): les, l’, le, ces, des, chaque, d’, quelques, ce, plusieurs
| Paradigm le | Masc | Fem |
|---|---|---|
| Definite=Def|Number=Sing | le | la |
| Definite=Def|Number=Plur | les | |
| Number=Sing | le | la |
| Number=Plur | les |
ADJ
1272 ADJ tokens (70% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (803; 63%).
ADJ tokens may have the following values of Gender:
Fem(642; 50% of non-emptyGender): présente, grande, dangereuses, telle, sociale, dérivée, première, nouvelle, collective, publiquesMasc(630; 50% of non-emptyGender): présent, européen, structurels, faux, important, premier, public, seul, nouveau, diversEMPTY(542): technique, possible, communautaire, autres, nécessaires, applicables, même, économique, électronique, applicable
| Paradigm présent | Masc | Fem |
|---|---|---|
| présent | présente |
VERB
742 VERB tokens (27% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (742; 100%), Mood=EMPTY (740; 100%), VerbForm=Part (740; 100%), Tense=Past (739; 100%), Number=Sing (467; 63%).
VERB tokens may have the following values of Gender:
Fem(275; 37% of non-emptyGender): dite, dites, accordée, adoptées, harmonisées, prise, prévues, rendues, établie, appliquéesMasc(467; 63% of non-emptyGender): fait, tenu, compris, donné, mis, dit, soumis, nommés, demandé, proposéEMPTY(1994): a, peut, voudrais, doit, est, faire, devrait, concernant, convient, ont
| Paradigm dire | Masc | Fem |
|---|---|---|
| Number=Sing | dit | dite |
| Number=Plur | dites |
PRON
447 PRON tokens (28% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (350; 78%), Number=Sing (337; 75%), PronType=Prs (310; 69%).
PRON tokens may have the following values of Gender:
Fem(85; 19% of non-emptyGender): elle, elles, laquelle, celle, une, auxquelles, celle-ci, la, aucune, cellesMasc(362; 81% of non-emptyGender): il, on, ils, le, ceux, chacun, tous, l’on, -il, NulEMPTY(1171): qui, nous, je, vous, ce, s’, se, c’, que, y
| Paradigm le | Masc | Fem |
|---|---|---|
| le | la |
AUX
70 AUX tokens (8% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (70; 100%), Number=Sing (70; 100%), Person=EMPTY (70; 100%), Tense=Past (70; 100%), VerbForm=Part (70; 100%).
AUX tokens may have the following values of Gender:
Masc(70; 100% of non-emptyGender): été, faitEMPTY(777): est, a, sont, être, ont, sera, soit, soient, étaient, suis
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (2578; 57%),
NOUN –[nmod]–> NOUN (1033; 51%),
NOUN –[amod]–> ADJ (977; 69%),
NOUN –[conj]–> NOUN (281; 55%),
NOUN –[acl]–> VERB (233; 51%),
VERB –[nsubj:pass]–> NOUN (120; 76%),
NOUN –[compound]–> NOUN (60; 91%),
ADJ –[conj]–> ADJ (42; 55%),
ADJ –[nsubj]–> NOUN (36; 58%),
NOUN –[nsubj]–> NOUN (25; 53%).