Treebank Statistics: UD_Spanish-PUD: Features: Number
This feature is universal.
It occurs with 2 different values: Plur
, Sing
.
This is a layered feature with the following layers: Number, Number[psor].
13880 tokens (60%) have a non-empty value of Number
.
5377 types (91%) occur at least once with a non-empty value of Number
.
3966 lemmas (88%) occur at least once with a non-empty value of Number
.
The feature is used with 7 part-of-speech tags: NOUN (4814; 21% instances), DET (3333; 14% instances), VERB (1724; 7% instances), ADJ (1472; 6% instances), PROPN (1225; 5% instances), PRON (722; 3% instances), AUX (590; 3% instances).
NOUN
4814 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NOUN
and Number
co-occurred: Gender=Masc (2719; 56%).
NOUN
tokens may have the following values of Number
:
Plur
(1390; 29% of non-emptyNumber
): años, millones, personas, estados, veces, ciudades, datos, elecciones, inversores, mesesSing
(3424; 71% of non-emptyNumber
): año, guerra, lugar, parte, gobierno, mar, ciudad, estado, vez, díaEMPTY
(1): HFC
Paradigm año | Sing | Plur |
---|---|---|
año | años |
DET
3333 DET tokens (100% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: PronType=Art (2984; 90%), Definite=Def (2529; 76%), Gender=Masc (1986; 60%).
DET
tokens may have the following values of Number
:
Plur
(797; 24% of non-emptyNumber
): los, las, muchos, estos, muchas, otros, todas, varios, esos, unosSing
(2536; 76% of non-emptyNumber
): el, la, un, una, este, esta, esto, cada, ese, esoEMPTY
(6): The, un
Paradigm el | Sing | Plur |
---|---|---|
Gender=Masc | el | los |
Gender=Fem | la | las |
VERB
1724 VERB tokens (76% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: VerbForm=Fin (1341; 78%), Gender=EMPTY (1330; 77%), Person=3 (1267; 73%), Mood=Ind (1221; 71%), Tense=Past (917; 53%).
VERB
tokens may have the following values of Number
:
Plur
(448; 26% of non-emptyNumber
): tienen, están, incluyen, llegaron, tenían, tuvieron, afirman, dieron, empezaron, correspondenSing
(1276; 74% of non-emptyNumber
): dijo, tiene, es, hay, hace, está, debido, dice, hecho, afirmóEMPTY
(541): hacer, tener, ver, establecer, ayudar, dejar, enviar, incluyendo, producir, asegurar
Paradigm tener | Sing | Plur |
---|---|---|
Gender=Masc|Tense=Past|VerbForm=Part | tenido | |
Mood=Cnd|Person=3|Tense=Imp|VerbForm=Fin | tendría | tendrían |
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | tengo | tenemos |
Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | tienes | |
Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | tendrá | tendrán |
Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | tenía, tenia | tenían |
Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | tuvo | tuvieron |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | tiene | tienen |
Mood=Sub|Person=3|Tense=Imp|VerbForm=Fin | tuviese | |
Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | tenga |
ADJ
1472 ADJ tokens (99% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: Gender=Masc (825; 56%).
ADJ
tokens may have the following values of Number
:
Plur
(439; 30% of non-emptyNumber
): grandes, últimos, nuevos, Unidos, diferentes, nuevas, primeros, importantes, Olímpicos, electrónicosSing
(1033; 70% of non-emptyNumber
): gran, primera, mayor, nueva, nacional, primer, británica, segunda, Unido, mismoEMPTY
(8): American, Stranger, Talking, austro, co, ex, franco, multi
Paradigm nuevo | Sing | Plur |
---|---|---|
Gender=Masc | nuevo | nuevos, nuevo |
Gender=Fem | nueva | nuevas |
PROPN
1225 PROPN tokens (98% of all PROPN
tokens) have a non-empty value of Number
.
PROPN
tokens may have the following values of Number
:
Plur
(25; 2% of non-emptyNumber
): EUA, Andes, Balcanes, Alpes, B-29, CBS, Caribs, GIFs, Indias, LovingSing
(1200; 98% of non-emptyNumber
): China, Europa, Italia, Australia, Pekín, Albania, Francia, Trump, Bretaña, C.EMPTY
(28): Hong, Kong, Castelfranco, Conte, Fjögur, Giovanni, Humblebums, Pelucca, Politti, Puerto
Paradigm India | Sing | Plur |
---|---|---|
_ | India | |
Gender=Fem | India | Indias |
Number
seems to be lexical feature of PROPN
. 100% lemmas (848) occur only with one value of Number
.
PRON
722 PRON tokens (69% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: Reflex=EMPTY (717; 99%), PrepCase=EMPTY (599; 83%), Case=EMPTY (505; 70%), Poss=EMPTY (495; 69%), PronType=Prs (448; 62%), Person=3 (410; 57%), Gender=Masc (375; 52%).
PRON
tokens may have the following values of Number
:
Plur
(175; 24% of non-emptyNumber
): que, sus, ellos, les, cuales, nos, los, quienes, Cuáles, QuiénesSing
(547; 76% of non-emptyNumber
): su, que, lo, le, cual, me, él, ella, quien, laEMPTY
(317): se, qué, You, cuanto, sí, consigo
Paradigm él | Sing | Plur |
---|---|---|
Case=Acc,Nom|Gender=Masc|Person=3 | él, ello | ellos |
Case=Acc,Nom|Gender=Fem|Person=3 | ella | |
Case=Acc|Gender=Masc|Person=3|PrepCase=Npr | lo | los |
Case=Acc|Gender=Fem|Person=3|PrepCase=Npr | la | las |
Case=Dat|Person=3 | le | les |
Gender=Masc|Person=3 | los | |
Gender=Masc | los |
AUX
590 AUX tokens (93% of all AUX
tokens) have a non-empty value of Number
.
The most frequent other feature values with which AUX
and Number
co-occurred: VerbForm=Fin (575; 97%), Person=3 (548; 93%), Mood=Ind (520; 88%).
AUX
tokens may have the following values of Number
:
Plur
(153; 26% of non-emptyNumber
): son, fueron, han, habían, pueden, estaban, están, eran, hayan, debemosSing
(437; 74% of non-emptyNumber
): es, fue, ha, había, está, era, puede, estaba, podría, sidoEMPTY
(44): ser, haber, siendo, estar, poder, Did, Do, Habiendo
Paradigm ser | Sing | Plur |
---|---|---|
Gender=Masc|Tense=Past|VerbForm=Part | sido | |
Mood=Cnd|Person=3|Tense=Imp|VerbForm=Fin | sería | serían |
Mood=Ind|Person=1|Tense=Imp|VerbForm=Fin | era | |
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | somos | |
Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | Tienes | |
Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | será | serán |
Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | era | eran |
Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | fue | fueron |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | es, considera | son, consideran |
Mood=Sub|Person=1|Tense=Pres|VerbForm=Fin | seamos | |
Mood=Sub|Person=3|Tense=Imp|VerbForm=Fin | fuera | fueran, pudieran |
Mood=Sub|Person=3|Tense=Past|VerbForm=Fin | hubiera | |
Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | sea | sean |
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
NOUN –[det]–> DET (2972; 100%),
NOUN –[amod]–> ADJ (1241; 99%),
NOUN –[nmod]–> NOUN (746; 59%),
VERB –[nsubj]–> NOUN (534; 88%),
NOUN –[nmod]–> PROPN (243; 71%),
NOUN –[det]–> PRON (239; 100%),
NOUN –[conj]–> NOUN (202; 77%),
VERB –[nsubj]–> PROPN (183; 91%),
NOUN –[acl:relcl]–> VERB (168; 80%),
VERB –[nsubj]–> PRON (163; 91%).