Treebank Statistics: UD_French: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem, Masc, Neut.
176106 tokens (44%) have a non-empty value of Gender.
21036 types (49%) occur at least once with a non-empty value of Gender.
13915 lemmas (42%) occur at least once with a non-empty value of Gender.
The feature is used with 8 part-of-speech tags: NOUN (74671; 19% instances), DET (58542; 15% instances), ADJ (22793; 6% instances), VERB (11223; 3% instances), PRON (7974; 2% instances), AUX (890; 0% instances), NUM (10; 0% instances), PROPN (3; 0% instances).
NOUN
74671 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (56091; 75%).
NOUN tokens may have the following values of Gender:
Fem(33049; 44% of non-emptyGender): ville, partie, région, fois, commune, années, famille, année, fin, placeMasc(41621; 56% of non-emptyGender): ans, pays, nom, monde, temps, groupe, siècle, état, cours, lieuNeut(1; 0% of non-emptyGender): MuseumEMPTY(507): A, Co., league, world, Association, Company, Mt, Panther, Trail, blackface
| Paradigm fois | Masc | Fem |
|---|---|---|
| Number=Sing | fois | |
| Number=Plur | fois | fois |
Gender seems to be lexical feature of NOUN. 98% lemmas (9229) occur only with one value of Gender.
DET
58542 DET tokens (95% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (50441; 86%), Number=Sing (45622; 78%), Definite=Def (40396; 69%).
DET tokens may have the following values of Gender:
Fem(26050; 44% of non-emptyGender): la, une, les, l’, sa, cette, des, ses, son, leurMasc(32492; 56% of non-emptyGender): le, les, un, l’, son, des, ce, ses, ces, deEMPTY(3197): les, l’, the, des, son, d’, de, ses, chaque, a
| Paradigm le | Masc | Fem |
|---|---|---|
| Number=Sing | la | |
| Number=Sing|PronType=Art | le, l', l | la, l', l, Les, là |
| Number=Plur|PronType=Art | les | les, L |
ADJ
22793 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (16529; 73%).
ADJ tokens may have the following values of Gender:
Fem(10741; 47% of non-emptyGender): première, française, grande, même, nouvelle, nombreuses, nationale, autres, seule, internationaleMasc(12051; 53% of non-emptyGender): premier, français, autres, grand, nouveau, même, dernier, nombreux, seul, ancienNeut(1; 0% of non-emptyGender): KoninklijkEMPTY(186): National, live, new, American, Blue, complete, Great, Last, 3e, Black
| Paradigm premier | Masc | Fem |
|---|---|---|
| Number=Sing | premier, 1er, Ier, 1e, 1 | première, 1re, 1ère |
| Number=Plur | premiers | premières |
VERB
11223 VERB tokens (35% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (11223; 100%), Person=EMPTY (11223; 100%), Tense=Past (11223; 100%), VerbForm=Part (11223; 100%), Number=Sing (9031; 80%).
VERB tokens may have the following values of Gender:
Fem(3251; 29% of non-emptyGender): située, née, créée, appelée, utilisée, connue, construite, mise, publiée, nomméeMasc(7972; 71% of non-emptyGender): né, situé, eu, fait, mort, connu, nommé, réalisé, utilisé, crééEMPTY(21121): a, peut, fait, est, faire, partir, trouve, devient, doit, ont
| Paradigm faire | Masc | Fem |
|---|---|---|
| Number=Sing | fait, fais | faite |
| Number=Plur | faits | faites |
PRON
7974 PRON tokens (44% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (6893; 86%), Person=3 (6646; 83%), PronType=Prs (5805; 73%).
PRON tokens may have the following values of Gender:
Fem(1741; 22% of non-emptyGender): elle, elles, une, la, celle, laquelle, celles, -elle, celle-ci, lesquellesMasc(6233; 78% of non-emptyGender): il, on, ils, le, un, -il, lequel, celui, tout, ceuxEMPTY(10055): qui, se, s’, c’, lui, ce, nous, dont, où, je
| Paradigm il | Masc | Fem |
|---|---|---|
| Number=Sing|Person=2 | -Tu | |
| Number=Sing|Person=3 | il, -il, t-il, Lui | -elle, elle |
| Number=Sing | Lui | |
| Number=Plur|Person=3 | ils, -ils | elles, -elles |
AUX
890 AUX tokens (7% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (890; 100%), Number=Sing (890; 100%), Person=EMPTY (890; 100%), Tense=Past (890; 100%), VerbForm=Part (890; 100%).
AUX tokens may have the following values of Gender:
Fem(1; 0% of non-emptyGender): faiteMasc(889; 100% of non-emptyGender): été, faitEMPTY(11979): est, a, sont, ont, était, fut, être, avait, avoir, ai
| Paradigm faire | Masc | Fem |
|---|---|---|
| fait | faite |
NUM
10 NUM tokens (0% of all NUM tokens) have a non-empty value of Gender.
NUM tokens may have the following values of Gender:
Fem(10; 100% of non-emptyGender): 00H30, 12H30, 14h25, 15H00, 18h, 18h40, 20h40, 22h, 23h, 48HEMPTY(10662): deux, trois, 2, 3, 5, quatre, 4, 2010, 2009, 2008
Gender seems to be lexical feature of NUM. 100% lemmas (10) occur only with one value of Gender.
PROPN
3 PROPN tokens (0% of all PROPN tokens) have a non-empty value of Gender.
PROPN tokens may have the following values of Gender:
Fem(1; 33% of non-emptyGender): ItalieMasc(2; 67% of non-emptyGender): Palais, mémoriqueEMPTY(30369): France, Paris, Europe, États-Unis, Jean, Maroc, Espagne, la, New, York
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (53959; 99%),
NOUN –[amod]–> ADJ (18532; 99%),
NOUN –[conj]–> NOUN (3391; 63%),
NOUN –[acl]–> VERB (2959; 70%),
VERB –[nsubj:pass]–> NOUN (1539; 96%),
ADJ –[conj]–> ADJ (905; 97%),
ADJ –[nsubj]–> NOUN (903; 97%),
NOUN –[appos]–> NOUN (888; 58%),
NOUN –[nsubj]–> NOUN (587; 61%),
ADJ –[obl]–> NOUN (572; 52%).