Treebank Statistics: UD_French-Sequoia: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
27934 tokens (40%) have a non-empty value of Gender
.
6152 types (65%) occur at least once with a non-empty value of Gender
.
4403 lemmas (65%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: NOUN (14522; 21% instances), DET (5864; 8% instances), ADJ (2999; 4% instances), VERB (2195; 3% instances), PROPN (1433; 2% instances), PRON (910; 1% instances), AUX (10; 0% instances), NUM (1; 0% instances).
NOUN
14522 NOUN tokens (95% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (10300; 71%).
NOUN
tokens may have the following values of Gender
:
Fem
(6882; 47% of non-emptyGender
): affaire, bivalirudine, commission, perfusion, administration, solution, dose, étude, fois, unionMasc
(7640; 53% of non-emptyGender
): %, patients, mg, ans, cas, traitement, président, effets, M., coursEMPTY
(697): enfants, HLM, ICP, D, collègues, ACT, °C, B, intermédiaires, responsables
Paradigm patient | Masc | Fem |
---|---|---|
Number=Sing | patient | patiente |
Number=Sing|Typo=Yes | patient | |
Number=Plur | patients | patientes |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (2731) occur only with one value of Gender
.
DET
5864 DET tokens (56% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (5820; 99%), PronType=Art (5358; 91%), Definite=Def (4064; 69%).
DET
tokens may have the following values of Gender
:
Fem
(2762; 47% of non-emptyGender
): la, une, cette, sa, aucune, certaines, toute, ma, quelles, toutesMasc
(3102; 53% of non-emptyGender
): le, un, ce, cet, aucun, tout, du, certains, quel, tousEMPTY
(4533): les, l’, des, son, ces, ses, votre, de, leur, d’
Paradigm le | Masc | Fem |
---|---|---|
Definite=Def|ExtPos=ADV|PronType=Art | le | |
Definite=Def|ExtPos=PRON|PronType=Art | le | |
Definite=Def|PronType=Art | le, l' | la, l' |
Definite=Def|PronType=Art|Typo=Yes | le | |
Le |
ADJ
2999 ADJ tokens (68% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (1963; 65%).
ADJ
tokens may have the following values of Gender
:
Fem
(1377; 46% of non-emptyGender
): européenne, première, rénale, française, toutes, nouvelle, intraveineuse, nationale, seule, osseuseMasc
(1622; 54% of non-emptyGender
): français, tous, ancien, osseux, zolédronique, premier, nombreux, dernier, compris, fauxEMPTY
(1403): autres, indésirables, autre, politique, même, clinique, politiques, cliniques, deuxième, jeune
Paradigm tout | Masc | Fem |
---|---|---|
Number=Sing | tout | toute |
Number=Plur | tous | toutes |
VERB
2195 VERB tokens (37% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (2195; 100%), Person=EMPTY (2195; 100%), Tense=Past (2195; 100%), VerbForm=Part (2195; 100%), Number=Sing (1536; 70%), Voice=Pass (1476; 67%).
VERB
tokens may have the following values of Gender
:
Fem
(674; 31% of non-emptyGender
): observée, recommandée, administrée, destinée, maintenue, menée, rapportées, traitées, versées, liéeMasc
(1521; 69% of non-emptyGender
): mis, eu, traités, utilisé, atteints, administré, reçu, pris, fait, présentéEMPTY
(3691): doit, voir, a, peut, doivent, faire, faut, est, peuvent, concernant
Paradigm devoir | Masc | Fem |
---|---|---|
Number=Sing | dû, du | |
Number=Plur|Voice=Pass | dues |
PROPN
1433 PROPN tokens (44% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (1399; 98%).
PROPN
tokens may have the following values of Gender
:
Fem
(383; 27% of non-emptyGender
): France, Paget, Europe, Christine, Denise, Afrique, Chine, Jean, Blanche, SociétéMasc
(1050; 73% of non-emptyGender
): paris, Jacques, Chirac, Taïwan, Michel, Hauts-de-Seine, Didier, Alain, Maupas, FrançoisEMPTY
(1828): Aclasta, Angiox, RPR, Halphen, Jean-Claude, Méry, Schuller, Thomson, Francis, Éric
Paradigm Jean | Masc | Fem |
---|---|---|
Jean | Jean |
Gender
seems to be lexical feature of PROPN
. 100% lemmas (436) occur only with one value of Gender
.
PRON
910 PRON tokens (32% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Reflex=EMPTY (904; 99%), Person=3 (861; 95%), Number=Sing (751; 83%), PronType=Prs (666; 73%).
PRON
tokens may have the following values of Gender
:
Fem
(183; 20% of non-emptyGender
): elle, laquelle, elles, la, lesquelles, une, celle-ci, celles, celle, chacuneMasc
(727; 80% of non-emptyGender
): il, ce, ils, un, le, -il, lui, eux, ceux, lequelEMPTY
(1898): qui, nous, se, je, s’, vous, y, que, c’, dont
Paradigm lui | Masc | Fem |
---|---|---|
ExtPos=ADP | il | |
ExtPos=ADV | il | |
il, le, -il, lui, -t-il | elle, la, -elle, -t-elle |
AUX
10 AUX tokens (0% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Mood=EMPTY (10; 100%), Number=Sing (10; 100%), Person=EMPTY (10; 100%), Tense=Past (10; 100%), VerbForm=Part (10; 100%).
AUX
tokens may have the following values of Gender
:
Masc
(10; 100% of non-emptyGender
): faitEMPTY
(2309): est, a, été, ont, être, sont, était, avait, avoir, sera
NUM
1 NUM tokens (0% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=Card (1; 100%).
NUM
tokens may have the following values of Gender
:
Masc
(1; 100% of non-emptyGender
): neufEMPTY
(1776): deux, 5, trois, 2, 2006, 10, 1, 30, 3, 4
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (5309; 57%),
NOUN –[amod]–> ADJ (2456; 67%),
NOUN –[acl]–> VERB (629; 63%),
NOUN –[conj]–> NOUN (569; 55%),
VERB –[nsubj:pass]–> NOUN (351; 86%),
PROPN –[det]–> DET (228; 58%),
NOUN –[appos]–> NOUN (132; 55%),
VERB –[conj]–> VERB (104; 52%),
ADJ –[nsubj]–> NOUN (81; 62%),
PROPN –[conj]–> PROPN (79; 56%).