Treebank Statistics: UD_French-FQB: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
9863 tokens (41%) have a non-empty value of Gender.
2537 types (60%) occur at least once with a non-empty value of Gender.
2172 lemmas (60%) occur at least once with a non-empty value of Gender.
The feature is used with 8 part-of-speech tags: NOUN (3686; 15% instances), DET (2815; 12% instances), ADJ (1242; 5% instances), PROPN (932; 4% instances), VERB (769; 3% instances), PRON (415; 2% instances), ADP (3; 0% instances), AUX (1; 0% instances).
NOUN
3686 NOUN tokens (91% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (3048; 83%).
NOUN tokens may have the following values of Gender:
Fem(1557; 42% of non-emptyGender): année, ville, compagnie, population, capitale, guerre, date, taxe, université, équipeMasc(2129; 58% of non-emptyGender): nom, pays, président, état, lieu, logement, film, prix, corps, tempsEMPTY(365): aide, espace, CNN, période, livre, enfants, radio, CPR, fin, tour
| Paradigm président | Masc | Fem |
|---|---|---|
| président | présidente |
Gender seems to be lexical feature of NOUN. 99% lemmas (1240) occur only with one value of Gender.
DET
2815 DET tokens (73% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (2798; 99%), PronType=Art (2249; 80%), Definite=Def (1932; 69%).
DET tokens may have the following values of Gender:
Fem(1207; 43% of non-emptyGender): la, quelle, une, sa, ma, quelles, certaines, cetteMasc(1608; 57% of non-emptyGender): le, quel, un, les, quels, ce, cet, du, toutEMPTY(1016): l’, les, des, mon, mes, son, votre, de, ses, vos
| Paradigm le | Masc | Fem |
|---|---|---|
| ExtPos=PRON | le | |
| le, les | la |
ADJ
1242 ADJ tokens (82% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1082; 87%).
ADJ tokens may have the following values of Gender:
Fem(541; 44% of non-emptyGender): quelle, première, américaine, quelles, principale, grande, haute, dernière, foncière, téléviséeMasc(701; 56% of non-emptyGender): quel, premier, américain, grand, Quels, mondial, anglais, national, personnel, calleuxEMPTY(269): célèbre, autre, deuxième, islamique, monétaire, nucléaire, véritable, folique, même, originaire
| Paradigm quel | Masc | Fem |
|---|---|---|
| Number=Sing | quel | quelle |
| Number=Plur | Quels | quelles |
PROPN
932 PROPN tokens (45% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (920; 99%).
PROPN tokens may have the following values of Gender:
Fem(258; 28% of non-emptyGender): Californie, Australie, Angleterre, Italie, Afrique, Amérique, Corée, Philippines, Berlin, ChineMasc(674; 72% of non-emptyGender): Alaska, John, York, Charles, Kentucky, Jackson, Japon, Mississippi, Londres, ReimsEMPTY(1154): États-Unis, New, Terre, Soleil, Nobel, Logan, Lune, Titanic, Angeles, Bowl
Gender seems to be lexical feature of PROPN. 100% lemmas (478) occur only with one value of Gender.
VERB
769 VERB tokens (41% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (769; 100%), Person=EMPTY (769; 100%), Tense=Past (769; 100%), VerbForm=Part (769; 100%), Number=Sing (691; 90%), Voice=EMPTY (523; 68%).
VERB tokens may have the following values of Gender:
Fem(139; 18% of non-emptyGender): connue, située, devenue, construite, déroulée, intitulée, morte, fabriquée, faite, fondéeMasc(630; 82% of non-emptyGender): inventé, né, situé, écrit, mort, connu, joué, eu, fait, remportéEMPTY(1121): trouve, est, a, signifie, Nommez, puis, eut, dois, fait, ai
| Paradigm nommer | Masc | Fem |
|---|---|---|
| nommé | nommée | |
| Voice=Pass | nommé |
PRON
415 PRON tokens (25% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (387; 93%), PronType=Prs (383; 92%), Number=Sing (376; 91%).
PRON tokens may have the following values of Gender:
Fem(118; 28% of non-emptyGender): -t-elle, -elle, laquelle, -elles, une, celle, celle-ci, elle, elles, lesquellesMasc(297; 72% of non-emptyGender): -t-il, -il, -ils, il, lequel, le, un, l’on, quelqu’un, celuiEMPTY(1243): qui, qu’, -ce, se, que, -je, je, -t-on, s’, -on
| Paradigm il | Masc | Fem |
|---|---|---|
| Number=Sing | -t-il, -il, il | -t-elle, -elle, elle |
| Number=Plur | -ils | -elles, elles |
ADP
3 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.
ADP tokens may have the following values of Gender:
Fem(3; 100% of non-emptyGender): deEMPTY(2866): de, à, d’, en, dans, pour, sur, par, sous, comme
AUX
1 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (1; 100%), Number=Sing (1; 100%), Person=EMPTY (1; 100%), Tense=Past (1; 100%), VerbForm=Part (1; 100%).
AUX tokens may have the following values of Gender:
Masc(1; 100% of non-emptyGender): faitEMPTY(1699): est, a, était, fut, sont, été, ai, ont, suis, avoir
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (2214; 71%),
NOUN –[amod]–> ADJ (634; 71%),
ADJ –[nsubj]–> NOUN (499; 95%),
VERB –[expl:subj]–> PRON (203; 64%),
VERB –[nsubj:pass]–> NOUN (140; 95%),
NOUN –[acl]–> VERB (80; 60%),
ADJ –[det]–> DET (41; 77%),
NOUN –[conj]–> NOUN (21; 54%),
NOUN –[nsubj]–> PROPN (15; 56%),
NOUN –[flat:name]–> PROPN (7; 58%).