Treebank Statistics: UD_French-FTB: Features: Number
This feature is universal.
It occurs with 2 different values: Plur, Sing.
335363 tokens (58%) have a non-empty value of Number.
1469 types (79%) occur at least once with a non-empty value of Number.
1258 lemmas (80%) occur at least once with a non-empty value of Number.
The feature is used with 10 part-of-speech tags: NOUN (114997; 20% instances), DET (85470; 15% instances), ADJ (35898; 6% instances), VERB (34799; 6% instances), PRON (22336; 4% instances), PROPN (18612; 3% instances), AUX (11882; 2% instances), NUM (11279; 2% instances), ADP (89; 0% instances), X (1; 0% instances).
NOUN
114997 NOUN tokens (98% of all NOUN tokens) have a non-empty value of Number.
The most frequent other feature values with which NOUN and Number co-occurred: Gender=Masc (65887; 57%).
NOUN tokens may have the following values of Number:
Plur(37460; 33% of non-emptyNumber): _, Abstentions, Inscrits, MM., OUVRIERS, Retraites, Editions, MM, ÉTATS, AgentsSing(77537; 67% of non-emptyNumber): _, M., Mr, DOC, face, Fin, Résultat, Article, Grâce, CôtéEMPTY(2090): _, Vis, Cahin, Congrès, Fils, REPRÉSENTANT, Éditions
Number seems to be lexical feature of NOUN. 94% lemmas (430) occur only with one value of Number.
DET
85470 DET tokens (100% of all DET tokens) have a non-empty value of Number.
The most frequent other feature values with which DET and Number co-occurred: PronType=Art (76207; 89%), Definite=Def (62616; 73%).
DET tokens may have the following values of Number:
Plur(23212; 27% of non-emptyNumber): _, les, ces, des, D’, Leur, De, Plusieurs, Quelques, CertainsSing(62258; 73% of non-emptyNumber): _, le, la, l’, Cette, un, une, Ce, Son, CetEMPTY(24): _, le
ADJ
35898 ADJ tokens (98% of all ADJ tokens) have a non-empty value of Number.
The most frequent other feature values with which ADJ and Number co-occurred: Gender=Masc (18662; 52%).
ADJ tokens may have the following values of Number:
Plur(12237; 34% of non-emptyNumber): _, tous, Toutes, Seuls, Conscients, Pauvres, Seules, Nombreuses, Nouveaux, CapablesSing(23661; 66% of non-emptyNumber): _, Autre, Tout, Seul, Difficile, Seule, Premier, Deuxième, Dernier, PremièreEMPTY(664): _, FAUX, Quitte
VERB
34799 VERB tokens (73% of all VERB tokens) have a non-empty value of Number.
The most frequent other feature values with which VERB and Number co-occurred: Gender=EMPTY (20088; 58%), VerbForm=Fin (20087; 58%), Person=3 (19521; 56%), Mood=Ind (18832; 54%).
VERB tokens may have the following values of Number:
Plur(9019; 26% of non-emptyNumber): _, Exprimés, Notons, Réunis, Ajoutons, Supposons, Suivent, Ajoutez, Disparus, EmisesSing(25780; 74% of non-emptyNumber): _, Reste, Est, Peut, Voilà, Interrogé, faut, Né, Réuni, EntréEMPTY(12914): _, Lire, Donnant, Faisant, Mis, Moyennant, Dire, Estimant, Evoquant, Rappelant
PRON
22336 PRON tokens (97% of all PRON tokens) have a non-empty value of Number.
The most frequent other feature values with which PRON and Number co-occurred: Person=3 (20830; 93%), Reflex=EMPTY (18657; 84%), Gender=Masc (16118; 72%), PronType=EMPTY (13281; 59%).
PRON tokens may have the following values of Number:
Plur(6102; 27% of non-emptyNumber): _, ils, nous, elles, Ceux, Certains, Celles, Tous, Vous, S’Sing(16234; 73% of non-emptyNumber): _, il, c’, On, Elle, ce, Cela, Je, Celui, ToutEMPTY(613): _, 30 000, Ce, Quarante, Y
PROPN
18612 PROPN tokens (85% of all PROPN tokens) have a non-empty value of Number.
The most frequent other feature values with which PROPN and Number co-occurred: Gender=Masc (12255; 66%).
PROPN tokens may have the following values of Number:
Plur(241; 1% of non-emptyNumber): _, Etats, Chargeurs, Ebauches, ETATS, Editions, Imprimeries, ReportersSing(18371; 99% of non-emptyNumber): _, Paris, Michel, France, FO, Jean, Air, FRANCFORT, Hachette, JacquesEMPTY(3248): _, CFE, Thomson, Elf, TF, ABOU, BCCI, Bouygues, CMB, EDF
Number seems to be lexical feature of PROPN. 100% lemmas (367) occur only with one value of Number.
AUX
11882 AUX tokens (92% of all AUX tokens) have a non-empty value of Number.
The most frequent other feature values with which AUX and Number co-occurred: VerbForm=Fin (10778; 91%), Person=3 (10569; 89%), Mood=Ind (9795; 82%), Tense=Pres (8812; 74%).
AUX tokens may have the following values of Number:
Plur(3221; 27% of non-emptyNumber): _, Avez, Peuvent, Seront, Sont, Allons, Ont, Pourront, SerionsSing(8661; 73% of non-emptyNumber): _, Peut, Est, A, Doit, Fût, Pourrait, Pouvait, Sera, VaEMPTY(987): _, Ayant, Avoir, Etant
NUM
11279 NUM tokens (63% of all NUM tokens) have a non-empty value of Number.
The most frequent other feature values with which NUM and Number co-occurred: NumType=Card (11261; 100%), Gender=Masc (7165; 64%).
NUM tokens may have the following values of Number:
Plur(5863; 52% of non-emptyNumber): _, Deux, Trois, Cinq, Quatre, Dix, Huit, Sept, Trente, QuinzeSing(5416; 48% of non-emptyNumber): _, 1992, 4, 27, 19, 1993, 3, 12, 13, 17EMPTY(6518): _, Cent, Quarante, Vingt, Dix, Deux, Sept, Soixante, 1, 24
Number seems to be lexical feature of NUM. 92% lemmas (59) occur only with one value of Number.
ADP
89 ADP tokens (0% of all ADP tokens) have a non-empty value of Number.
ADP tokens may have the following values of Number:
Plur(50; 56% of non-emptyNumber): _Sing(39; 44% of non-emptyNumber): _, ÀEMPTY(92507): _, en, A, Pour, à, dans, de, d’, après, avec
X
1 X tokens (0% of all X tokens) have a non-empty value of Number.
X tokens may have the following values of Number:
Plur(1; 100% of non-emptyNumber): _EMPTY(2191): _, NEW, New, British, Grand, In, A, Altus, BUENOS, Body
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number:
NOUN –[det]–> DET (69236; 96%),
NOUN –[amod]–> ADJ (22797; 98%),
NOUN –[nmod]–> NOUN (19296; 58%),
VERB –[nsubj]–> NOUN (11192; 89%),
NOUN –[nummod]–> NUM (8306; 97%),
VERB –[nsubj]–> PRON (8097; 94%),
VERB –[aux]–> AUX (5070; 75%),
NOUN –[nmod]–> PROPN (4406; 67%),
NOUN –[conj]–> NOUN (4358; 75%),
PROPN –[det]–> DET (4228; 80%).