Treebank Statistics: UD_Spanish-PUD: Features: Number
This feature is universal.
It occurs with 2 different values: Plur, Sing.
This is a layered feature with the following layers: Number, Number[psor].
13879 tokens (60%) have a non-empty value of Number.
5377 types (91%) occur at least once with a non-empty value of Number.
3966 lemmas (88%) occur at least once with a non-empty value of Number.
The feature is used with 7 part-of-speech tags: NOUN (4814; 21% instances), DET (3332; 14% instances), VERB (1724; 7% instances), ADJ (1472; 6% instances), PROPN (1225; 5% instances), PRON (722; 3% instances), AUX (590; 3% instances).
NOUN
4814 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Number.
The most frequent other feature values with which NOUN and Number co-occurred: Gender=Masc (2719; 56%).
NOUN tokens may have the following values of Number:
Plur(1390; 29% of non-emptyNumber): años, millones, personas, estados, veces, ciudades, datos, elecciones, inversores, mesesSing(3424; 71% of non-emptyNumber): año, guerra, lugar, parte, gobierno, mar, ciudad, estado, vez, díaEMPTY(3): cápita, HFC
| Paradigm año | Sing | Plur |
|---|---|---|
| año | años |
DET
3332 DET tokens (100% of all DET tokens) have a non-empty value of Number.
The most frequent other feature values with which DET and Number co-occurred: PronType=Art (2983; 90%), Definite=Def (2528; 76%), Gender=Masc (1985; 60%).
DET tokens may have the following values of Number:
Plur(797; 24% of non-emptyNumber): los, las, muchos, estos, muchas, otros, todas, varios, esos, unosSing(2535; 76% of non-emptyNumber): el, la, un, una, este, esta, esto, cada, ese, esoEMPTY(6): The, un
| Paradigm el | Sing | Plur |
|---|---|---|
| Gender=Masc | el | los |
| Gender=Fem | la | las |
VERB
1724 VERB tokens (76% of all VERB tokens) have a non-empty value of Number.
The most frequent other feature values with which VERB and Number co-occurred: VerbForm=Fin (1341; 78%), Gender=EMPTY (1330; 77%), Person=3 (1267; 73%), Mood=Ind (1221; 71%), Tense=Past (917; 53%).
VERB tokens may have the following values of Number:
Plur(448; 26% of non-emptyNumber): tienen, están, incluyen, llegaron, tenían, tuvieron, afirman, dieron, empezaron, correspondenSing(1276; 74% of non-emptyNumber): dijo, tiene, es, hay, hace, está, debido, dice, hecho, afirmóEMPTY(541): hacer, tener, ver, establecer, ayudar, dejar, enviar, incluyendo, producir, asegurar
| Paradigm tener | Sing | Plur |
|---|---|---|
| Gender=Masc|Tense=Past|VerbForm=Part | tenido | |
| Mood=Cnd|Person=3|Tense=Imp|VerbForm=Fin | tendría | tendrían |
| Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | tengo | tenemos |
| Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | tienes | |
| Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | tendrá | tendrán |
| Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | tenía, tenia | tenían |
| Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | tuvo | tuvieron |
| Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | tiene | tienen |
| Mood=Sub|Person=3|Tense=Imp|VerbForm=Fin | tuviese | |
| Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | tenga |
ADJ
1472 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Number.
The most frequent other feature values with which ADJ and Number co-occurred: Gender=Masc (825; 56%).
ADJ tokens may have the following values of Number:
Plur(439; 30% of non-emptyNumber): grandes, últimos, nuevos, Unidos, diferentes, nuevas, primeros, importantes, Olímpicos, electrónicosSing(1033; 70% of non-emptyNumber): gran, primera, mayor, nueva, nacional, primer, británica, segunda, Unido, mismoEMPTY(8): American, Stranger, Talking, austro, co, ex, franco, multi
| Paradigm nuevo | Sing | Plur |
|---|---|---|
| Gender=Masc | nuevo | nuevos, nuevo |
| Gender=Fem | nueva | nuevas |
PROPN
1225 PROPN tokens (98% of all PROPN tokens) have a non-empty value of Number.
PROPN tokens may have the following values of Number:
Plur(25; 2% of non-emptyNumber): EUA, Andes, Balcanes, Alpes, B-29, CBS, Caribs, GIFs, Indias, LovingSing(1200; 98% of non-emptyNumber): China, Europa, Italia, Australia, Pekín, Albania, Francia, Trump, Bretaña, C.EMPTY(28): Hong, Kong, Castelfranco, Conte, Fjögur, Giovanni, Humblebums, Pelucca, Politti, Puerto
| Paradigm India | Sing | Plur |
|---|---|---|
| _ | India | |
| Gender=Fem | India | Indias |
Number seems to be lexical feature of PROPN. 100% lemmas (848) occur only with one value of Number.
PRON
722 PRON tokens (69% of all PRON tokens) have a non-empty value of Number.
The most frequent other feature values with which PRON and Number co-occurred: Reflex=EMPTY (717; 99%), PrepCase=EMPTY (599; 83%), Case=EMPTY (505; 70%), Poss=EMPTY (495; 69%), PronType=Prs (448; 62%), Person=3 (410; 57%), Gender=Masc (375; 52%).
PRON tokens may have the following values of Number:
Plur(175; 24% of non-emptyNumber): que, sus, ellos, les, cuales, nos, los, quienes, Cuáles, QuiénesSing(547; 76% of non-emptyNumber): su, que, lo, le, cual, me, él, ella, quien, laEMPTY(317): se, qué, You, cuanto, sí, consigo
| Paradigm él | Sing | Plur |
|---|---|---|
| Case=Acc,Nom|Gender=Masc|Person=3 | él, ello | ellos |
| Case=Acc,Nom|Gender=Fem|Person=3 | ella | |
| Case=Acc|Gender=Masc|Person=3|PrepCase=Npr | lo | los |
| Case=Acc|Gender=Fem|Person=3|PrepCase=Npr | la | las |
| Case=Dat|Person=3 | le | les |
| Gender=Masc|Person=3 | los | |
| Gender=Masc | los |
AUX
590 AUX tokens (93% of all AUX tokens) have a non-empty value of Number.
The most frequent other feature values with which AUX and Number co-occurred: VerbForm=Fin (575; 97%), Person=3 (548; 93%), Mood=Ind (520; 88%).
AUX tokens may have the following values of Number:
Plur(153; 26% of non-emptyNumber): son, fueron, han, habían, pueden, estaban, están, eran, hayan, debemosSing(437; 74% of non-emptyNumber): es, fue, ha, había, está, era, puede, estaba, podría, sidoEMPTY(44): ser, haber, siendo, estar, poder, Did, Do, Habiendo
| Paradigm ser | Sing | Plur |
|---|---|---|
| Gender=Masc|Tense=Past|VerbForm=Part | sido | |
| Mood=Cnd|Person=3|Tense=Imp|VerbForm=Fin | sería | serían |
| Mood=Ind|Person=1|Tense=Imp|VerbForm=Fin | era | |
| Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | somos | |
| Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | Tienes | |
| Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | será | serán |
| Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | era | eran |
| Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | fue | fueron |
| Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | es, considera | son, consideran |
| Mood=Sub|Person=1|Tense=Pres|VerbForm=Fin | seamos | |
| Mood=Sub|Person=3|Tense=Imp|VerbForm=Fin | fuera | fueran, pudieran |
| Mood=Sub|Person=3|Tense=Past|VerbForm=Fin | hubiera | |
| Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | sea | sean |
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number:
NOUN –[det]–> DET (2972; 100%),
NOUN –[amod]–> ADJ (1241; 99%),
NOUN –[nmod]–> NOUN (746; 59%),
VERB –[nsubj]–> NOUN (534; 88%),
NOUN –[nmod]–> PROPN (243; 71%),
NOUN –[det]–> PRON (239; 100%),
NOUN –[conj]–> NOUN (202; 77%),
VERB –[nsubj]–> PROPN (183; 91%),
NOUN –[acl:relcl]–> VERB (168; 80%),
VERB –[nsubj]–> PRON (163; 91%).