Treebank Statistics: UD_French-Sequoia: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
30618 tokens (43%) have a non-empty value of Gender.
6700 types (71%) occur at least once with a non-empty value of Gender.
4801 lemmas (71%) occur at least once with a non-empty value of Gender.
The feature is used with 8 part-of-speech tags: NOUN (14575; 21% instances), DET (7102; 10% instances), ADJ (4387; 6% instances), VERB (2206; 3% instances), PROPN (1425; 2% instances), PRON (910; 1% instances), AUX (10; 0% instances), NUM (3; 0% instances).
NOUN
14575 NOUN tokens (97% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (10390; 71%).
NOUN tokens may have the following values of Gender:
Fem(6959; 48% of non-emptyGender): affaire, bivalirudine, commission, perfusion, administration, solution, dose, étude, fois, unionMasc(7616; 52% of non-emptyGender): patients, mg, ans, cas, traitement, président, effets, M., cours, mlEMPTY(453): enfants, HLM, ICP, D, collègues, ACT, B, intermédiaires, A, ISBN
| Paradigm patient | Masc | Fem |
|---|---|---|
| Number=Sing | patient | patiente |
| Number=Sing|Typo=Yes | patient | |
| Number=Plur | patients | patientes |
Gender seems to be lexical feature of NOUN. 99% lemmas (2775) occur only with one value of Gender.
DET
7102 DET tokens (68% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (7054; 99%), PronType=Art (6418; 90%), Definite=Def (5152; 73%).
DET tokens may have the following values of Gender:
Fem(3496; 49% of non-emptyGender): la, l’, une, cette, sa, son, aucune, certaines, toute, maMasc(3606; 51% of non-emptyGender): le, un, l’, ce, son, cet, aucun, tout, du, certainsEMPTY(3298): les, des, l’, ces, ses, votre, de, leur, d’, plusieurs
| Paradigm le | Masc | Fem |
|---|---|---|
| ExtPos=ADV | le | |
| ExtPos=PRON | le | |
| le, l' | la, l' | |
| Typo=Yes | le |
ADJ
4387 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (2932; 67%).
ADJ tokens may have the following values of Gender:
Fem(2064; 47% of non-emptyGender): européenne, première, rénale, française, autres, toutes, clinique, nouvelle, politique, intraveineuseMasc(2323; 53% of non-emptyGender): tous, ancien, autres, indésirables, zolédronique, premier, français, dernier, général, autre
| Paradigm autre | Masc | Fem |
|---|---|---|
| _ | AUTRE(S) | |
| Number=Sing | autre | autre |
| Number=Plur | autres | autres |
VERB
2206 VERB tokens (37% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (2206; 100%), Person=EMPTY (2206; 100%), Tense=EMPTY (2206; 100%), VerbForm=Part (2206; 100%), Number=Sing (1536; 70%), Voice=Pass (1513; 69%).
VERB tokens may have the following values of Gender:
Fem(674; 31% of non-emptyGender): observée, recommandée, administrée, destinée, maintenue, menée, rapportées, traitées, versées, liéeMasc(1532; 69% of non-emptyGender): mis, eu, traités, utilisé, atteints, administré, reçu, pris, fait, présentéEMPTY(3681): doit, voir, a, peut, doivent, faire, faut, est, peuvent, concernant
| Paradigm devoir | Masc | Fem |
|---|---|---|
| Number=Sing|Voice=Act | dû, du | |
| Number=Plur|Voice=Pass | dues |
PROPN
1425 PROPN tokens (44% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1392; 98%).
PROPN tokens may have the following values of Gender:
Fem(377; 26% of non-emptyGender): France, Paget, Europe, Christine, Denise, Afrique, Chine, Jean, Blanche, SociétéMasc(1048; 74% of non-emptyGender): paris, Jacques, Chirac, Taïwan, Michel, Hauts-de-Seine, Didier, Alain, Maupas, FrançoisEMPTY(1825): Aclasta, Angiox, RPR, Halphen, Jean-Claude, Méry, Schuller, Thomson, Francis, Éric
| Paradigm Jean | Masc | Fem |
|---|---|---|
| Jean | Jean |
Gender seems to be lexical feature of PROPN. 100% lemmas (434) occur only with one value of Gender.
PRON
910 PRON tokens (33% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (904; 99%), Person=3 (861; 95%), Number=Sing (751; 83%), PronType=Prs (666; 73%), Emph=No (626; 69%), Case=Nom (555; 61%).
PRON tokens may have the following values of Gender:
Fem(183; 20% of non-emptyGender): elle, laquelle, elles, la, lesquelles, une, celle-ci, celles, celle, chacuneMasc(727; 80% of non-emptyGender): il, ce, ils, un, le, -il, lui, eux, ceux, lequelEMPTY(1831): qui, nous, se, je, s’, vous, y, que, c’, j’
| Paradigm lui | Masc | Fem |
|---|---|---|
| Case=Acc|Emph=No|Number=Sing | le | la |
| Case=Nom|Emph=No|ExtPos=ADP|Number=Sing | il | |
| Case=Nom|Emph=No|ExtPos=ADV|Number=Sing | il | |
| Case=Nom|Emph=No|Number=Sing | il, -il | elle, -elle |
| Case=Nom|Emph=No|Number=Plur | ils, -ils | elles |
| Case=Nom|Emph=No|Number=Plur|Typo=Yes | elles | |
| Emph=No|Number=Sing | -il, -t-il, le, lui | -t-elle, la |
| Emph=No|Number=Plur | -ils | |
| Emph=Yes|Number=Sing | lui | elle |
| Emph=Yes|Number=Plur | eux |
AUX
10 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (10; 100%), Number=Sing (10; 100%), Person=EMPTY (10; 100%), Tense=EMPTY (10; 100%), VerbForm=Part (10; 100%).
AUX tokens may have the following values of Gender:
Masc(10; 100% of non-emptyGender): faitEMPTY(2309): est, a, été, ont, être, sont, était, avait, avoir, sera
NUM
3 NUM tokens (0% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (3; 100%), Number=Sing (3; 100%).
NUM tokens may have the following values of Gender:
Masc(3; 100% of non-emptyGender): 2006-08-07EMPTY(1823): deux, 5, trois, 2, 2006, 10, 1, 30, 3, 4
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (6490; 69%),
NOUN –[amod]–> ADJ (3641; 99%),
NOUN –[acl]–> VERB (631; 63%),
NOUN –[conj]–> NOUN (572; 56%),
VERB –[nsubj:pass]–> NOUN (349; 86%),
PROPN –[det]–> DET (270; 69%),
NOUN –[appos]–> NOUN (139; 58%),
ADJ –[nsubj]–> NOUN (130; 100%),
ADJ –[conj]–> ADJ (118; 98%),
VERB –[conj]–> VERB (104; 52%).