Treebank Statistics: UD_French-Sequoia: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
13528 tokens (19%) have a non-empty value of Gender.
2618 types (28%) occur at least once with a non-empty value of Gender.
1748 lemmas (26%) occur at least once with a non-empty value of Gender.
The feature is used with 6 part-of-speech tags: DET (5946; 8% instances), ADJ (3027; 4% instances), VERB (2206; 3% instances), PROPN (1429; 2% instances), PRON (910; 1% instances), AUX (10; 0% instances).
DET
5946 DET tokens (57% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (5898; 99%), PronType=Art (5365; 90%), Definite=Def (4063; 68%).
DET tokens may have the following values of Gender:
Fem(2761; 46% of non-emptyGender): la, une, cette, sa, aucune, certaines, toute, ma, quelles, toutesMasc(3185; 54% of non-emptyGender): le, un, ce, son, cet, aucun, tout, du, certains, quelEMPTY(4454): les, l’, des, ces, ses, votre, de, son, leur, d’
| Paradigm le | Masc | Fem |
|---|---|---|
| ExtPos=ADV | le | |
| ExtPos=PRON | le | |
| le | la | |
| Typo=Yes | le |
ADJ
3027 ADJ tokens (69% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (2011; 66%).
ADJ tokens may have the following values of Gender:
Fem(1386; 46% of non-emptyGender): européenne, première, rénale, française, toutes, nouvelle, intraveineuse, nationale, seule, osseuseMasc(1641; 54% of non-emptyGender): français, tous, ancien, osseux, zolédronique, premier, nombreux, dernier, faux, généralEMPTY(1360): autres, indésirables, autre, politique, même, clinique, politiques, cliniques, deuxième, jeune
| Paradigm tout | Masc | Fem |
|---|---|---|
| Number=Sing | tout | toute |
| Number=Plur | tous | toutes |
VERB
2206 VERB tokens (37% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (2206; 100%), Person=EMPTY (2206; 100%), Tense=EMPTY (2206; 100%), VerbForm=Part (2206; 100%), Number=Sing (1536; 70%), Voice=Pass (1513; 69%).
VERB tokens may have the following values of Gender:
Fem(674; 31% of non-emptyGender): observée, recommandée, administrée, destinée, maintenue, menée, rapportées, traitées, versées, liéeMasc(1532; 69% of non-emptyGender): mis, eu, traités, utilisé, atteints, administré, reçu, pris, fait, présentéEMPTY(3681): doit, voir, a, peut, doivent, faire, faut, est, peuvent, concernant
| Paradigm devoir | Masc | Fem |
|---|---|---|
| Number=Sing|Voice=Act | dû, du | |
| Number=Plur|Voice=Pass | dues |
PROPN
1429 PROPN tokens (44% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1396; 98%).
PROPN tokens may have the following values of Gender:
Fem(381; 27% of non-emptyGender): France, Paget, Europe, Christine, Denise, Afrique, Chine, Jean, Blanche, SociétéMasc(1048; 73% of non-emptyGender): paris, Jacques, Chirac, Taïwan, Michel, Hauts-de-Seine, Didier, Alain, Maupas, FrançoisEMPTY(1829): Aclasta, Angiox, RPR, Halphen, Jean-Claude, Méry, Schuller, Thomson, Francis, Éric
| Paradigm Jean | Masc | Fem |
|---|---|---|
| Jean | Jean |
Gender seems to be lexical feature of PROPN. 100% lemmas (434) occur only with one value of Gender.
PRON
910 PRON tokens (33% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (904; 99%), Person=3 (861; 95%), Number=Sing (751; 83%), PronType=Prs (666; 73%), Emph=No (626; 69%), Case=Nom (555; 61%).
PRON tokens may have the following values of Gender:
Fem(183; 20% of non-emptyGender): elle, laquelle, elles, la, lesquelles, une, celle-ci, celles, celle, chacuneMasc(727; 80% of non-emptyGender): il, ce, ils, un, le, -il, lui, eux, ceux, lequelEMPTY(1881): qui, nous, se, je, s’, vous, y, que, c’, j’
| Paradigm lui | Masc | Fem |
|---|---|---|
| Case=Acc|Emph=No|Number=Sing | la | |
| Case=Nom|Emph=No|ExtPos=ADP|Number=Sing | il | |
| Case=Nom|Emph=No|ExtPos=ADV|Number=Sing | il | |
| Case=Nom|Emph=No|Number=Sing | il, -il | elle, -elle |
| Case=Nom|Emph=No|Number=Plur | ils | elles |
| Case=Nom|Emph=No|Number=Plur|Typo=Yes | elles | |
| Emph=No|Number=Sing | -il, -t-il, le, lui | -t-elle, la |
| Emph=Yes|Number=Sing | lui | elle |
AUX
10 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (10; 100%), Number=Sing (10; 100%), Person=EMPTY (10; 100%), Tense=EMPTY (10; 100%), VerbForm=Part (10; 100%).
AUX tokens may have the following values of Gender:
Masc(10; 100% of non-emptyGender): faitEMPTY(2309): est, a, été, ont, être, sont, était, avait, avoir, sera
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
PROPN –[det]–> DET (225; 57%),
VERB –[conj]–> VERB (104; 52%),
PROPN –[conj]–> PROPN (79; 57%),
VERB –[nsubj:pass]–> PRON (58; 52%),
ADJ –[det]–> DET (31; 56%),
PRON –[amod]–> ADJ (9; 69%),
ADJ –[amod]–> ADJ (5; 71%),
ADJ –[obl:mod]–> ADJ (4; 57%),
PRON –[nmod]–> PRON (4; 100%),
ADJ –[parataxis]–> VERB (2; 67%).