Treebank Statistics: UD_French-Rhapsodie: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
6348 tokens (14%) have a non-empty value of Gender.
392 types (9%) occur at least once with a non-empty value of Gender.
273 lemmas (8%) occur at least once with a non-empty value of Gender.
The feature is used with 7 part-of-speech tags: DET (2908; 7% instances), PRON (2385; 5% instances), ADJ (788; 2% instances), VERB (154; 0% instances), PROPN (66; 0% instances), AUX (44; 0% instances), NOUN (3; 0% instances).
DET
2908 DET tokens (65% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (2894; 100%), PronType=Art (2522; 87%), Definite=Def (1756; 60%).
DET tokens may have the following values of Gender:
Fem(1241; 43% of non-emptyGender): la, une, cette, sa, ma, toute, aucune, quelle, certaines, taMasc(1667; 57% of non-emptyGender): le, un, ce, mon, son, cet, du, aucun, quel, certainsEMPTY(1549): les, l’, des, votre, notre, ces, ses, quelque, vos, nos
| Paradigm le | Masc | Fem |
|---|---|---|
| le | la |
PRON
2385 PRON tokens (45% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (2337; 98%), Number=Sing (2200; 92%), Case=EMPTY (1297; 54%), Emph=EMPTY (1261; 53%).
PRON tokens may have the following values of Gender:
Fem(132; 6% of non-emptyGender): elle, elles, une, la, celle, laquelle, celles, chacune, aucune, elle-mêmeMasc(2253; 94% of non-emptyGender): c’, il, on, ça, ils, ce, le, -ce, lui, rienEMPTY(2953): je, qui, vous, y, j’, nous, se, tu, que, me
| Paradigm lui | Masc | Fem |
|---|---|---|
| Case=Nom|Emph=No|ExtPos=ADP|Number=Sing | il | |
| Case=Nom|Emph=No|Number=Sing | il, -il | elle |
| Case=Nom|Emph=No|Number=Plur | ils | elles |
| Emph=Yes|Number=Sing | lui | elle |
| Number=Sing | -il, le, -t-il |
ADJ
788 ADJ tokens (50% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=EMPTY (624; 79%).
ADJ tokens may have the following values of Gender:
Fem(291; 37% of non-emptyGender): bonne, petite, toutes, grande, toute, droite, première, certaine, chrétienne, différentesMasc(497; 63% of non-emptyGender): tout, tous, petit, droit, français, premier, bon, petits, gros, grandEMPTY(776): jeune, autre, vrai, même, sûr, magique, difficile, autres, jeunes, propre
| Paradigm tout | Masc | Fem |
|---|---|---|
| _ | tout, tous | toutes, toute |
| Number=Plur | tous |
VERB
154 VERB tokens (4% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (154; 100%), Person=EMPTY (154; 100%), Tense=EMPTY (154; 100%), VerbForm=Part (154; 100%), Number=EMPTY (127; 82%), Voice=Act (122; 79%).
VERB tokens may have the following values of Gender:
Fem(13; 8% of non-emptyGender): faite, mise, prise, prises, comprise, dites, faites, ouverte, produitesMasc(141; 92% of non-emptyGender): dit, fait, pu, compris, été, pris, mis, assis, fallu, mortEMPTY(3995): a, est, va, dire, voilà, faut, allez, faire, ai, peut
| Paradigm dire | Masc | Fem |
|---|---|---|
| dit | ||
| Voice=Act | dit | dites |
PROPN
66 PROPN tokens (7% of all PROPN tokens) have a non-empty value of Gender.
PROPN tokens may have the following values of Gender:
Fem(31; 47% of non-emptyGender): Nef, Beauce, Seine, CGC, France, Marne, Mort, Rolex, Shoah, VireMasc(35; 53% of non-emptyGender): Kenya, Maître, Gâtinais, Figaro, Beauceron, Chinois, Christ, Conseil, Général, HommesEMPTY(910): France, Paris, Gutiérrez, Chavant, Messi, Rodriguez, Jésus, Europe, Notre-Dame, Szymanoski
Gender seems to be lexical feature of PROPN. 100% lemmas (26) occur only with one value of Gender.
AUX
44 AUX tokens (3% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (44; 100%), Number=Sing (44; 100%), Person=EMPTY (44; 100%), Tense=Past (44; 100%), VerbForm=Part (44; 100%).
AUX tokens may have the following values of Gender:
Masc(44; 100% of non-emptyGender): été, faitEMPTY(1622): est, a, ai, était, sont, suis, être, ont, avez, êtes
NOUN
3 NOUN tokens (0% of all NOUN tokens) have a non-empty value of Gender.
NOUN tokens may have the following values of Gender:
Masc(3; 100% of non-emptyGender): Argentins, Beaucerons, FrançaisEMPTY(5209): peu, fait, moment, ans, place, gens, art, côté, monde, temps
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
DET –[reparandum]–> DET (85; 79%),
PRON –[reparandum]–> PRON (36; 92%),
PRON –[nsubj]–> PRON (25; 54%),
PRON –[amod]–> ADJ (14; 88%),
DET –[fixed]–> ADJ (11; 100%),
ADJ –[dislocated]–> PRON (6; 75%),
ADJ –[reparandum]–> ADJ (3; 100%),
DET –[conj]–> DET (3; 100%),
ADJ –[nsubj:outer]–> PRON (2; 100%),
ADJ –[nsubj]–> ADJ (2; 100%).