Treebank Statistics: UD_Spanish-PUD: Features: Number
This feature is universal.
It occurs with 2 different values: Plur, Sing.
This is a layered feature with the following layers: Number, Number[psor].
13884 tokens (60%) have a non-empty value of Number.
5376 types (91%) occur at least once with a non-empty value of Number.
3941 lemmas (88%) occur at least once with a non-empty value of Number.
The feature is used with 7 part-of-speech tags: NOUN (4804; 21% instances), DET (3332; 14% instances), VERB (1727; 7% instances), ADJ (1477; 6% instances), PROPN (1225; 5% instances), PRON (729; 3% instances), AUX (590; 3% instances).
NOUN
4804 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Number.
The most frequent other feature values with which NOUN and Number co-occurred: Gender=Masc (2709; 56%).
NOUN tokens may have the following values of Number:
Plur(1389; 29% of non-emptyNumber): años, millones, personas, estados, veces, ciudades, datos, elecciones, inversores, mesesSing(3415; 71% of non-emptyNumber): año, guerra, lugar, parte, gobierno, mar, ciudad, estado, vez, díaEMPTY(3): cápita, HFC
| Paradigm año | Sing | Plur |
|---|---|---|
| año | años |
DET
3332 DET tokens (100% of all DET tokens) have a non-empty value of Number.
The most frequent other feature values with which DET and Number co-occurred: PronType=Art (2983; 90%), Definite=Def (2528; 76%), Gender=Masc (1982; 59%).
DET tokens may have the following values of Number:
Plur(797; 24% of non-emptyNumber): los, las, muchos, estos, muchas, otros, todas, varios, esos, unosSing(2535; 76% of non-emptyNumber): el, la, un, una, este, esta, esto, cada, ese, esoEMPTY(6): The, un
| Paradigm el | Sing | Plur |
|---|---|---|
| Gender=Masc | el | los |
| Gender=Fem | la | las |
VERB
1727 VERB tokens (76% of all VERB tokens) have a non-empty value of Number.
The most frequent other feature values with which VERB and Number co-occurred: Gender=EMPTY (1330; 77%), VerbForm=Fin (1330; 77%), Person=3 (1267; 73%), Mood=Ind (1221; 71%), Tense=Past (917; 53%).
VERB tokens may have the following values of Number:
Plur(448; 26% of non-emptyNumber): tienen, están, incluyen, llegaron, tenían, tuvieron, afirman, dieron, empezaron, correspondenSing(1279; 74% of non-emptyNumber): dijo, tiene, es, hay, hace, está, debido, dice, hecho, afirmóEMPTY(543): hacer, tener, ver, establecer, ayudar, dejar, enviar, incluyendo, pesar, producir
| Paradigm tener | Sing | Plur |
|---|---|---|
| Gender=Masc|Tense=Past|VerbForm=Part | tenido | |
| Mood=Cnd|Person=3|VerbForm=Fin | tendría | tendrían |
| Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | tengo | tenemos |
| Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | tienes | |
| Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | tendrá | tendrán |
| Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | tenía, tenia | tenían |
| Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | tuvo | tuvieron |
| Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | tiene | tienen |
| Mood=Sub|Person=3|Tense=Imp|VerbForm=Fin | tuviese | |
| Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | tenga |
ADJ
1477 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Number.
ADJ tokens may have the following values of Number:
Plur(440; 30% of non-emptyNumber): grandes, últimos, nuevos, Unidos, diferentes, nuevas, primeros, importantes, Olímpicos, electrónicosSing(1037; 70% of non-emptyNumber): gran, primera, mayor, nueva, nacional, posible, siguiente, primer, británica, increíbleEMPTY(8): American, Stranger, Talking, austro, co, ex, franco, multi
| Paradigm nuevo | Sing | Plur |
|---|---|---|
| Gender=Masc | nuevo | nuevos, nuevo |
| Gender=Fem | nueva | nuevas |
PROPN
1225 PROPN tokens (98% of all PROPN tokens) have a non-empty value of Number.
PROPN tokens may have the following values of Number:
Plur(25; 2% of non-emptyNumber): EUA, Andes, Balcanes, Alpes, B-29, CBS, Caribs, GIFs, Indias, LovingSing(1200; 98% of non-emptyNumber): China, Europa, Italia, Australia, Pekín, Albania, Francia, Trump, Bretaña, C.EMPTY(27): Hong, Kong, Castelfranco, Conte, Fjögur, Giovanni, Humblebums, Pelucca, Politti, Puerto
| Paradigm India | Sing | Plur |
|---|---|---|
| _ | India | |
| Gender=Fem | India | Indias |
Number seems to be lexical feature of PROPN. 100% lemmas (848) occur only with one value of Number.
PRON
729 PRON tokens (70% of all PRON tokens) have a non-empty value of Number.
The most frequent other feature values with which PRON and Number co-occurred: Reflex=EMPTY (724; 99%), PrepCase=EMPTY (606; 83%), Case=EMPTY (512; 70%), Poss=EMPTY (502; 69%), PronType=Prs (448; 61%), Person=3 (410; 56%), Gender=Masc (381; 52%).
PRON tokens may have the following values of Number:
Plur(175; 24% of non-emptyNumber): que, sus, ellos, les, cuales, nos, los, quienes, Cuáles, QuiénesSing(554; 76% of non-emptyNumber): su, que, lo, le, cual, me, él, ella, quien, laEMPTY(317): se, qué, You, cuanto, sí, consigo
| Paradigm él | Sing | Plur |
|---|---|---|
| Case=Acc,Nom|Gender=Masc|Person=3 | él, ello | ellos |
| Case=Acc,Nom|Gender=Fem|Person=3 | ella | |
| Case=Acc|Gender=Masc|Person=3|PrepCase=Npr | lo | los |
| Case=Acc|Gender=Fem|Person=3|PrepCase=Npr | la | las |
| Case=Dat|Person=3 | le | les |
| Gender=Masc|Person=3 | los | |
| Gender=Masc | los |
AUX
590 AUX tokens (93% of all AUX tokens) have a non-empty value of Number.
The most frequent other feature values with which AUX and Number co-occurred: VerbForm=Fin (575; 97%), Person=3 (548; 93%), Mood=Ind (520; 88%).
AUX tokens may have the following values of Number:
Plur(153; 26% of non-emptyNumber): son, fueron, han, habían, pueden, estaban, están, eran, hayan, debemosSing(437; 74% of non-emptyNumber): es, fue, ha, había, está, era, puede, estaba, podría, sidoEMPTY(44): ser, haber, siendo, estar, poder, Did, Do, Habiendo
| Paradigm ser | Sing | Plur |
|---|---|---|
| Gender=Masc|Tense=Past|VerbForm=Part | sido | |
| Mood=Cnd|Person=3|VerbForm=Fin | sería | serían |
| Mood=Ind|Person=1|Tense=Imp|VerbForm=Fin | era | |
| Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | somos | |
| Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | Tienes | |
| Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | será | serán |
| Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | era | eran |
| Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | fue | fueron |
| Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | es, considera | son, consideran |
| Mood=Sub|Person=1|Tense=Pres|VerbForm=Fin | seamos | |
| Mood=Sub|Person=3|Tense=Imp|VerbForm=Fin | fuera | fueran, pudieran |
| Mood=Sub|Person=3|Tense=Past|VerbForm=Fin | hubiera | |
| Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | sea | sean |
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number:
NOUN –[det]–> DET (2972; 100%),
NOUN –[amod]–> ADJ (1241; 99%),
NOUN –[nmod]–> NOUN (747; 58%),
VERB –[nsubj]–> NOUN (533; 88%),
NOUN –[nmod]–> PROPN (243; 71%),
NOUN –[det]–> PRON (239; 100%),
NOUN –[conj]–> NOUN (202; 77%),
VERB –[nsubj]–> PROPN (184; 91%),
NOUN –[acl:relcl]–> VERB (167; 80%),
VERB –[nsubj]–> PRON (165; 92%).