Treebank Statistics: UD_French-Sequoia: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
27727 tokens (39%) have a non-empty value of Gender
.
6134 types (65%) occur at least once with a non-empty value of Gender
.
4396 lemmas (65%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: NOUN (14309; 20% instances), DET (5864; 8% instances), ADJ (3000; 4% instances), VERB (2195; 3% instances), PROPN (1438; 2% instances), PRON (910; 1% instances), AUX (10; 0% instances), NUM (1; 0% instances).
NOUN
14309 NOUN tokens (94% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (10110; 71%).
NOUN
tokens may have the following values of Gender
:
Fem
(6736; 47% of non-emptyGender
): affaire, bivalirudine, commission, perfusion, administration, solution, dose, étude, fois, guerreMasc
(7573; 53% of non-emptyGender
): %, patients, mg, ans, cas, traitement, président, effets, M., coursEMPTY
(878): h, enfants, kg, HLM, ICP, D, collègues, ACT, °C, aide
Paradigm patient | Masc | Fem |
---|---|---|
Number=Sing | patient | patiente |
Number=Sing|Typo=Yes | patient | |
Number=Plur | patients | patientes |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (2721) occur only with one value of Gender
.
DET
5864 DET tokens (56% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (5820; 99%), PronType=Art (5358; 91%), Definite=Def (4064; 69%).
DET
tokens may have the following values of Gender
:
Fem
(2762; 47% of non-emptyGender
): la, une, cette, sa, aucune, certaines, toute, ma, quelles, toutesMasc
(3102; 53% of non-emptyGender
): le, un, ce, cet, aucun, tout, du, certains, quel, tousEMPTY
(4582): les, l’, des, son, ces, ses, votre, de, leur, d’
Paradigm le | Masc | Fem |
---|---|---|
Definite=Def|ExtPos=ADV|PronType=Art | le | |
Definite=Def|ExtPos=PRON|PronType=Art | le | |
Definite=Def|PronType=Art | le, l' | la, l' |
Definite=Def|PronType=Art|Typo=Yes | le | |
Le |
ADJ
3000 ADJ tokens (68% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (1963; 65%).
ADJ
tokens may have the following values of Gender
:
Fem
(1378; 46% of non-emptyGender
): européenne, première, rénale, française, toutes, nouvelle, intraveineuse, nationale, seule, osseuseMasc
(1622; 54% of non-emptyGender
): français, tous, ancien, osseux, zolédronique, premier, nombreux, dernier, compris, fauxEMPTY
(1402): autres, indésirables, autre, politique, même, clinique, politiques, cliniques, deuxième, jeune
Paradigm tout | Masc | Fem |
---|---|---|
Number=Sing | tout | toute |
Number=Plur | tous | toutes |
VERB
2195 VERB tokens (37% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (2195; 100%), Person=EMPTY (2195; 100%), Tense=Past (2195; 100%), VerbForm=Part (2195; 100%), Number=Sing (1536; 70%), Voice=Pass (1476; 67%).
VERB
tokens may have the following values of Gender
:
Fem
(674; 31% of non-emptyGender
): observée, recommandée, administrée, destinée, maintenue, menée, rapportées, traitées, versées, liéeMasc
(1521; 69% of non-emptyGender
): mis, eu, traités, utilisé, atteints, administré, reçu, pris, fait, présentéEMPTY
(3691): doit, voir, a, peut, doivent, faire, faut, est, peuvent, concernant
Paradigm devoir | Masc | Fem |
---|---|---|
Number=Sing | dû, du | |
Number=Plur|Voice=Pass | dues |
PROPN
1438 PROPN tokens (43% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (1404; 98%).
PROPN
tokens may have the following values of Gender
:
Fem
(386; 27% of non-emptyGender
): France, Paget, Europe, Christine, Denise, Afrique, Chine, Jean, Blanche, SociétéMasc
(1052; 73% of non-emptyGender
): paris, Jacques, Chirac, Taïwan, Michel, Hauts-de-Seine, Didier, Alain, Maupas, FrançoisEMPTY
(1880): Aclasta, Angiox, Union, RPR, Halphen, Jean-Claude, Méry, Schuller, Thomson, Francis
Paradigm Jean | Masc | Fem |
---|---|---|
Jean | Jean |
Gender
seems to be lexical feature of PROPN
. 100% lemmas (438) occur only with one value of Gender
.
PRON
910 PRON tokens (32% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Reflex=EMPTY (904; 99%), Person=3 (861; 95%), Number=Sing (751; 83%), PronType=Prs (666; 73%).
PRON
tokens may have the following values of Gender
:
Fem
(183; 20% of non-emptyGender
): elle, laquelle, elles, la, lesquelles, une, celle-ci, celles, celle, chacuneMasc
(727; 80% of non-emptyGender
): il, ce, ils, un, le, -il, lui, eux, ceux, lequelEMPTY
(1898): qui, nous, se, je, s’, vous, y, que, c’, dont
Paradigm lui | Masc | Fem |
---|---|---|
ExtPos=ADP | il | |
ExtPos=ADV | il | |
il, le, -il, lui, -t-il | elle, la, -elle, -t-elle |
AUX
10 AUX tokens (0% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Mood=EMPTY (10; 100%), Number=Sing (10; 100%), Person=EMPTY (10; 100%), Tense=Past (10; 100%), VerbForm=Part (10; 100%).
AUX
tokens may have the following values of Gender
:
Masc
(10; 100% of non-emptyGender
): faitEMPTY
(2309): est, a, été, ont, être, sont, était, avait, avoir, sera
NUM
1 NUM tokens (0% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=Card (1; 100%).
NUM
tokens may have the following values of Gender
:
Masc
(1; 100% of non-emptyGender
): neufEMPTY
(1724): deux, 5, trois, 2, 2006, 10, 1, 30, 3, 4
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (5296; 57%),
NOUN –[amod]–> ADJ (2407; 66%),
NOUN –[acl]–> VERB (625; 63%),
NOUN –[conj]–> NOUN (563; 55%),
VERB –[nsubj:pass]–> NOUN (348; 86%),
PROPN –[det]–> DET (230; 58%),
NOUN –[appos]–> NOUN (130; 54%),
VERB –[conj]–> VERB (104; 52%),
ADJ –[nsubj]–> NOUN (81; 62%),
PROPN –[conj]–> PROPN (79; 56%).