Treebank Statistics: UD_French-FQB: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
9738 tokens (41%) have a non-empty value of Gender.
2485 types (59%) occur at least once with a non-empty value of Gender.
2119 lemmas (58%) occur at least once with a non-empty value of Gender.
The feature is used with 7 part-of-speech tags: NOUN (3706; 16% instances), DET (2816; 12% instances), ADJ (1244; 5% instances), PROPN (787; 3% instances), VERB (769; 3% instances), PRON (415; 2% instances), AUX (1; 0% instances).
NOUN
3706 NOUN tokens (91% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (3066; 83%).
NOUN tokens may have the following values of Gender:
Fem(1560; 42% of non-emptyGender): année, ville, compagnie, population, capitale, guerre, date, taxe, université, équipeMasc(2146; 58% of non-emptyGender): nom, pays, président, état, lieu, logement, film, prix, corps, tempsEMPTY(348): aide, espace, CNN, période, livre, enfants, radio, CPR, fin, tour
| Paradigm président | Masc | Fem |
|---|---|---|
| président | présidente |
Gender seems to be lexical feature of NOUN. 99% lemmas (1248) occur only with one value of Gender.
DET
2816 DET tokens (73% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (2798; 99%), PronType=Art (2249; 80%), Definite=Def (1932; 69%).
DET tokens may have the following values of Gender:
Fem(1207; 43% of non-emptyGender): la, quelle, une, sa, ma, quelles, certaines, cetteMasc(1609; 57% of non-emptyGender): le, quel, un, les, quels, ce, cet, d’, du, toutEMPTY(1016): l’, les, des, mon, mes, son, votre, de, ses, vos
| Paradigm le | Masc | Fem |
|---|---|---|
| ExtPos=PRON | le | |
| le, les | la |
ADJ
1244 ADJ tokens (82% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1083; 87%).
ADJ tokens may have the following values of Gender:
Fem(541; 43% of non-emptyGender): quelle, première, américaine, quelles, principale, grande, haute, dernière, foncière, téléviséeMasc(703; 57% of non-emptyGender): quel, premier, américain, grand, Quels, mondial, anglais, national, personnel, calleuxEMPTY(268): célèbre, autre, deuxième, islamique, monétaire, nucléaire, véritable, folique, même, originaire
| Paradigm quel | Masc | Fem |
|---|---|---|
| Number=Sing | quel | quelle |
| Number=Plur | Quels | quelles |
PROPN
787 PROPN tokens (38% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (775; 98%).
PROPN tokens may have the following values of Gender:
Fem(247; 31% of non-emptyGender): Californie, Australie, Angleterre, Italie, Afrique, Amérique, Corée, Philippines, Berlin, ChineMasc(540; 69% of non-emptyGender): Alaska, John, Kentucky, Japon, Mississippi, Londres, Reims, Croix-Rouge, Bob, CanadaEMPTY(1299): États-Unis, New, Terre, York, Soleil, Nobel, Logan, Lune, Titanic, Angeles
Gender seems to be lexical feature of PROPN. 100% lemmas (416) occur only with one value of Gender.
VERB
769 VERB tokens (41% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (769; 100%), Person=EMPTY (769; 100%), Tense=Past (769; 100%), VerbForm=Part (769; 100%), Number=Sing (691; 90%), Voice=EMPTY (523; 68%).
VERB tokens may have the following values of Gender:
Fem(139; 18% of non-emptyGender): connue, située, devenue, construite, déroulée, intitulée, morte, fabriquée, faite, fondéeMasc(630; 82% of non-emptyGender): inventé, né, situé, écrit, mort, connu, joué, eu, fait, remportéEMPTY(1121): trouve, est, a, signifie, Nommez, puis, eut, dois, fait, ai
| Paradigm nommer | Masc | Fem |
|---|---|---|
| nommé | nommée | |
| Voice=Pass | nommé |
PRON
415 PRON tokens (25% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (387; 93%), PronType=Prs (383; 92%), Number=Sing (376; 91%).
PRON tokens may have the following values of Gender:
Fem(118; 28% of non-emptyGender): -t-elle, -elle, laquelle, -elles, une, celle, celle-ci, elle, elles, lesquellesMasc(297; 72% of non-emptyGender): -t-il, -il, -ils, il, lequel, le, un, l’on, quelqu’un, celuiEMPTY(1239): qui, qu’, -ce, se, que, -je, je, -t-on, s’, -on
| Paradigm lui | Masc | Fem |
|---|---|---|
| Number=Sing | -t-il, -il, il, le | -t-elle, -elle, elle |
| Number=Plur | -ils, eux | -elles, elles |
AUX
1 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (1; 100%), Number=Sing (1; 100%), Person=EMPTY (1; 100%), Tense=Past (1; 100%), VerbForm=Part (1; 100%).
AUX tokens may have the following values of Gender:
Masc(1; 100% of non-emptyGender): faitEMPTY(1699): est, a, était, fut, sont, été, ai, ont, suis, avoir
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (2224; 72%),
NOUN –[amod]–> ADJ (635; 71%),
ADJ –[nsubj]–> NOUN (499; 95%),
VERB –[expl:subj]–> PRON (203; 64%),
VERB –[nsubj:pass]–> NOUN (140; 95%),
NOUN –[acl]–> VERB (80; 60%),
ADJ –[det]–> DET (41; 77%),
NOUN –[conj]–> NOUN (21; 54%),
NOUN –[nsubj]–> PROPN (15; 56%),
NOUN –[flat:name]–> PROPN (7; 58%).