Treebank Statistics: UD_French-FQB: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
9830 tokens (41%) have a non-empty value of Gender
.
2532 types (60%) occur at least once with a non-empty value of Gender
.
2167 lemmas (60%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: NOUN (3674; 15% instances), DET (2801; 12% instances), ADJ (1238; 5% instances), PROPN (932; 4% instances), VERB (768; 3% instances), PRON (413; 2% instances), ADP (3; 0% instances), AUX (1; 0% instances).
NOUN
3674 NOUN tokens (91% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (3036; 83%).
NOUN
tokens may have the following values of Gender
:
Fem
(1546; 42% of non-emptyGender
): année, ville, compagnie, population, capitale, guerre, date, taxe, université, équipeMasc
(2128; 58% of non-emptyGender
): nom, pays, président, état, lieu, logement, film, prix, corps, tempsEMPTY
(377): aide, espace, CNN, période, fin, livre, enfants, radio, CPR, tour
Paradigm président | Masc | Fem |
---|---|---|
président | présidente |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (1235) occur only with one value of Gender
.
DET
2801 DET tokens (73% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (2784; 99%), PronType=Art (2234; 80%), Definite=Def (1919; 69%).
DET
tokens may have the following values of Gender
:
Fem
(1194; 43% of non-emptyGender
): la, quelle, une, sa, ma, quelles, certaines, cetteMasc
(1607; 57% of non-emptyGender
): le, quel, un, les, quels, ce, cet, du, toutEMPTY
(1033): l’, les, des, mon, mes, son, la, votre, de, ses
Paradigm le | Masc | Fem |
---|---|---|
le, les | la |
ADJ
1238 ADJ tokens (82% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (1078; 87%).
ADJ
tokens may have the following values of Gender
:
Fem
(537; 43% of non-emptyGender
): quelle, première, américaine, quelles, principale, grande, haute, dernière, foncière, téléviséeMasc
(701; 57% of non-emptyGender
): quel, premier, américain, grand, Quels, mondial, anglais, national, personnel, calleuxEMPTY
(273): célèbre, autre, deuxième, islamique, monétaire, nucléaire, véritable, folique, même, originaire
Paradigm quel | Masc | Fem |
---|---|---|
Number=Sing | quel | quelle |
Number=Plur | Quels | quelles |
PROPN
932 PROPN tokens (45% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (920; 99%).
PROPN
tokens may have the following values of Gender
:
Fem
(258; 28% of non-emptyGender
): Californie, Australie, Angleterre, Italie, Afrique, Amérique, Corée, Philippines, Berlin, ChineMasc
(674; 72% of non-emptyGender
): Alaska, John, York, Charles, Kentucky, Jackson, Japon, Mississippi, Londres, ReimsEMPTY
(1154): États-Unis, New, Terre, Soleil, Nobel, Logan, Lune, Titanic, Angeles, Bowl
Gender
seems to be lexical feature of PROPN
. 100% lemmas (478) occur only with one value of Gender
.
VERB
768 VERB tokens (41% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (768; 100%), Person=EMPTY (768; 100%), Tense=Past (768; 100%), VerbForm=Part (768; 100%), Number=Sing (690; 90%), Voice=EMPTY (522; 68%).
VERB
tokens may have the following values of Gender
:
Fem
(139; 18% of non-emptyGender
): connue, située, devenue, construite, déroulée, intitulée, morte, fabriquée, faite, fondéeMasc
(629; 82% of non-emptyGender
): inventé, né, situé, écrit, mort, connu, joué, eu, fait, remportéEMPTY
(1122): trouve, est, a, signifie, Nommez, puis, eut, fait, dois, ai
Paradigm nommer | Masc | Fem |
---|---|---|
nommé | nommée | |
Voice=Pass | nommé |
PRON
413 PRON tokens (25% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: PronType=EMPTY (388; 94%), Person=3 (385; 93%), Number=Sing (374; 91%).
PRON
tokens may have the following values of Gender
:
Fem
(118; 29% of non-emptyGender
): -t-elle, -elle, laquelle, -elles, une, celle, celle-ci, elle, elles, lesquellesMasc
(295; 71% of non-emptyGender
): -t-il, -il, -ils, il, lequel, le, un, quelqu’un, celui, celui-ciEMPTY
(1245): qui, qu’, -ce, se, que, -je, je, -t-on, s’, -on
Paradigm il | Masc | Fem |
---|---|---|
Number=Sing | -t-il, -il, il | -t-elle, -elle, elle |
Number=Plur | -ils | -elles, elles |
ADP
3 ADP tokens (0% of all ADP
tokens) have a non-empty value of Gender
.
ADP
tokens may have the following values of Gender
:
Fem
(3; 100% of non-emptyGender
): deEMPTY
(2866): de, à, d’, en, dans, pour, sur, par, sous, comme
AUX
1 AUX tokens (0% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Mood=EMPTY (1; 100%), Number=Sing (1; 100%), Person=EMPTY (1; 100%), Tense=Past (1; 100%), VerbForm=Part (1; 100%).
AUX
tokens may have the following values of Gender
:
Masc
(1; 100% of non-emptyGender
): faitEMPTY
(1699): est, a, était, fut, sont, été, ai, ont, suis, avoir
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (2209; 71%),
NOUN –[amod]–> ADJ (631; 71%),
ADJ –[nsubj]–> NOUN (499; 95%),
VERB –[expl:subj]–> PRON (203; 64%),
VERB –[nsubj:pass]–> NOUN (140; 95%),
NOUN –[acl]–> VERB (80; 60%),
ADJ –[det]–> DET (41; 77%),
NOUN –[conj]–> NOUN (21; 54%),
NOUN –[nsubj]–> PROPN (15; 56%),
NOUN –[flat:name]–> PROPN (7; 58%).