Number
: number
The Number
feature for Basque follows the standard UD guidelines for nouns, adjectives, determiners and adverbs. However, finite verbs contain agreement features on number for the subject, object and indirect object, so the Basque treebank follows the UD description for language-specific features, defining Number[erg]=Sing,Plur, Number[abs]=Sing,Plur, and Number[dat]=Sing,Plur.
Treebank Statistics (UD_Basque)
This feature is universal.
It occurs with 2 different values: Plur
, Sing
.
This is a layered feature with the following layers: Number, Number[abs], Number[dat], Number[erg].
32262 tokens (27%) have a non-empty value of Number
.
14065 types (58%) occur at least once with a non-empty value of Number
.
6382 lemmas (58%) occur at least once with a non-empty value of Number
.
The feature is used with 11 part-of-speech tags: eu-pos/NOUN (17147; 14% instances), eu-pos/PROPN (6190; 5% instances), eu-pos/ADJ (3853; 3% instances), eu-pos/DET (2723; 2% instances), eu-pos/ADP (1423; 1% instances), eu-pos/VERB (713; 1% instances), eu-pos/AUX (153; 0% instances), eu-pos/PRON (27; 0% instances), eu-pos/ADV (18; 0% instances), eu-pos/SYM (14; 0% instances), eu-pos/NUM (1; 0% instances).
NOUN
17147 eu-pos/NOUN tokens (58% of all NOUN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NOUN
and Number
co-occurred: Definite=Def (17144; 100%), Animacy=Inan (9682; 56%).
NOUN
tokens may have the following values of Number
:
Plur
(4285; 25% of non-emptyNumber
): gauzak, urteetan, arazoak, egunetan, hauteskundeak, jokalariak, eskubideen, egunotan, jokalariek, urteotanSing
(12862; 75% of non-emptyNumber
): partidua, taldea, taldeak, ostean, gobernuak, aurretik, aukera, garaipena, herrian, igandeanEMPTY
(12630): behar, nahi, euskal, urte, talde, uste, jokalari, lan, ezin, egun
Paradigm talde | Sing | Plur |
---|---|---|
Animacy=Inan|Case=Abl | taldetik | taldeetatik |
Animacy=Inan|Case=Abs | taldea, taldekoa | taldeak, taldeok |
Animacy=Inan|Case=All | taldera | taldeetara |
Animacy=Inan|Case=Com | taldearekin | |
Animacy=Inan|Case=Dat | taldeari | taldeei |
Animacy=Inan|Case=Erg | taldeak | taldeek, taldeok, taldekoek |
Animacy=Inan|Case=Gen | taldearen | taldeen |
Animacy=Inan|Case=Ine | taldean, taldearengan | taldeetan |
Animacy=Inan|Case=Loc | taldeko, talderako | taldeetako |
Case=Abs | Taldea | Taldeak |
Case=Gen | Taldearen |
PROPN
6190 eu-pos/PROPN tokens (63% of all PROPN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PROPN
and Number
co-occurred: Definite=Def (6184; 100%).
PROPN
tokens may have the following values of Number
:
Plur
(85; 1% of non-emptyNumber
): EEBBetako, EEBBek, EEBBetan, EEBBak, EEBBetara, Filipinetako, Moluketan, Bahamak, Bahametako, EEBBenSing
(6105; 99% of non-emptyNumber
): Europako, Espainiako, Frantziako, Israelgo, Nafarroako, Jugoslaviako, Miarritzek, EAJk, Osasunak, RealakEMPTY
(3679): Jose, Juan, Euskal, Iñaki, Luis, Joseba, David, Mikel, Jean, Miguel
Paradigm EEBB | Sing | Plur |
---|---|---|
Case=Abs|Definite=Def | EEBBak, EEBB | |
Case=Abs|Definite=Ind | EEBBetarako | |
Case=All|Definite=Def | EEBBetara | |
Case=Erg|Definite=Def | EEBBek, EEBB-EK | |
Case=Gen|Definite=Def | EEBBen | |
Case=Ine|Definite=Def | EEBBetan | |
Case=Loc|Definite=Def | EEBBetako | EEBBetako, EEBBetarako |
Number
seems to be lexical feature of PROPN
. 100% lemmas (1901) occur only with one value of Number
.
ADJ
3853 eu-pos/ADJ tokens (65% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: Definite=Def (3852; 100%), Case=Abs (2454; 64%).
ADJ
tokens may have the following values of Number
:
Plur
(1042; 27% of non-emptyNumber
): nagusiak, handiak, onak, berriak, Batuetako, onenak, ezberdinak, bereziak, desberdinak, ezagunakSing
(2811; 73% of non-emptyNumber
): handia, ona, bakarra, nagusia, zaila, berria, onena, garrantzitsua, osoa, bereziaEMPTY
(2120): hurrengo, nagusi, berri, inolako, nazioarteko, politiko, txiki, bizi, handirik, olinpiar
Paradigm handi | Sing | Plur |
---|---|---|
Case=Abs | handia, haundia | handiak, handikoak |
Case=Abs|Degree=Cmp | handiagoa, haundiagoa | handiagoak, haundiagoak |
Case=Abs|Degree=Sup | handiena | handienak |
Case=Abs|Degree=Abs | handiegia | |
Case=All|Degree=Cmp | handiagora | |
Case=Cau | handiagatik | handiengatik |
Case=Com | handiarekin | handiekin |
Case=Com|Degree=Cmp | handiagoarekin | |
Case=Erg | handiek | |
Case=Erg|Degree=Sup | handienek | |
Case=Gen | handien | |
Case=Gen|Degree=Cmp | handiagoaren | |
Case=Ine | handian | handietan |
Case=Ine|Degree=Cmp | handiagoetan | |
Case=Ins | handienaz | |
Case=Loc | handiko | handietako |
Case=Loc|Degree=Cmp | handiagoko | |
Case=Loc|Degree=Sup | handieneko | handienetariko |
DET
2723 eu-pos/DET tokens (67% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: Definite=Def (2324; 85%).
DET
tokens may have the following values of Number
:
Plur
(807; 30% of non-emptyNumber
): batzuk, guztiak, horiek, hauek, beren, gehienak, batzuek, guztiek, hauetan, haienSing
(1916; 70% of non-emptyNumber
): bere, hori, hau, horretan, honetan, horrek, bera, berak, haren, horrenEMPTY
(1351): beste, zer, gehiago, hainbat, asko, zenbait, zein, ugari, horretarako, gutxi
Paradigm bera | Sing | Plur |
---|---|---|
Case=Abl | beretik | |
Case=Abs|Definite=Def | bera, berekoa, berea | bereak, berekoak |
Case=Abs|Definite=Ind | bere | |
Case=All|Definite=Def | berarengana | |
Case=Ben | beretzat | |
Case=Ben|Definite=Def | berarentzat | |
Case=Cau|Definite=Def | beragatik, berarengatik | |
Case=Com|Definite=Def | berarekin | |
Case=Dat|Definite=Def | berari | |
Case=Erg|Definite=Def | berak | |
Case=Gen | bere | |
Case=Gen|Definite=Def | beraren | |
Case=Ine | berean | |
Case=Ine|Definite=Def | berarengan | |
Case=Ins|Definite=Def | beraz |
ADP
1423 eu-pos/ADP tokens (76% of all ADP
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADP
and Number
co-occurred: Definite=Def (1356; 95%), Animacy=EMPTY (938; 66%).
ADP
tokens may have the following values of Number
:
Plur
(462; 32% of non-emptyNumber
): artean, arteko, arabera, inguruan, aurrean, bidez, aurka, bezala, buruz, aurkakoSing
(961; 68% of non-emptyNumber
): arabera, aurka, kontra, inguruan, aurrean, aurkako, zehar, arte, buruz, kontrakoEMPTY
(453): gabe, arte, gisa, bezala, arteko, artean, gainean, gabeko, inguru, aurrean
Paradigm arte | Sing | Plur |
---|---|---|
Animacy=Anim|Case=Ine|Definite=Def | artean | |
Animacy=Anim|Case=Loc|Definite=Def | arteko | arteko |
Animacy=Inan|Case=Abs|Definite=Def | arte | |
Animacy=Inan|Case=Ine|Definite=Def | artean | |
Animacy=Inan|Case=Loc|Definite=Def | arteko | arteko |
Case=Abs|Definite=Def | arte | arte |
Case=Ine | artean | |
Case=Ine|Definite=Def | artean | artean |
Case=Ine|Definite=Def|Degree=Sup | artean | |
Case=Ine|Definite=Def|Person=1 | artean | |
Case=Ine|Definite=Def|Person=2 | artean | |
Case=Ine|Definite=Def|Person=3 | artean | |
Case=Loc|Definite=Def | arteko | arteko |
Case=Loc|Definite=Def|Degree=Sup | arteko | |
Case=Loc|Definite=Def|Person=3 | arteko |
VERB
713 eu-pos/VERB tokens (3% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: Number[abs]=EMPTY (622; 87%), Person[abs]=EMPTY (622; 87%), Mood=EMPTY (622; 87%), Aspect=EMPTY (620; 87%), VerbForm=Part (445; 62%), Case=Abs (410; 58%).
VERB
tokens may have the following values of Number
:
Plur
(184; 26% of non-emptyNumber
): eginak, atxilotuak, daudenak, daudenen, dituztenak, dutenak, eginez, gonbidatuak, ikasiak, jarriakSing
(529; 74% of non-emptyNumber
): izana, egina, hasia, litekeena, izateaz, egiteak, izateak, azpimarratzekoa, dagokionean, irekiaEMPTY
(20830): izan, da, egin, izango, dira, du, esan, egiten, eman, dago
Paradigm izan | Sing | Plur |
---|---|---|
Aspect=Prog|Case=Abl|Mood=Ind|Person[abs]=1 | ginenekotik | |
Aspect=Prog|Case=Abs|Mood=Ind|Person[abs]=3 | dena, direna | direnak, zirenak |
Aspect=Prog|Case=Abs|Mood=Ind|Person[abs]=3|Person[dat]=3 | zaizkienak | |
Aspect=Prog|Case=Ben|Mood=Ind|Person[abs]=3 | denarentzat | |
Aspect=Prog|Case=Dat|Mood=Ind|Person[abs]=3 | zirenei | |
Aspect=Prog|Case=Erg|Mood=Ind|Person[abs]=1 | garenok | |
Aspect=Prog|Case=Gen|Mood=Ind|Person[abs]=3 | zenaren | |
Aspect=Prog|Case=Ins | denez | |
Case=Abs|VerbForm=Part | izana, izandakoa | izanak |
Case=Cau | izateagatik | |
Case=Cau|VerbForm=Part | izanagatik | |
Case=Dat | izateari | |
Case=Erg | izateak | |
Case=Erg|VerbForm=Part | izanak, izandakoak | |
Case=Gen | izatearen | |
Case=Gen|VerbForm=Part | izanaren | |
Case=Ins | izateaz | |
Case=Ins|VerbForm=Part | izanaz |
AUX
153 eu-pos/AUX tokens (2% of all AUX
tokens) have a non-empty value of Number
.
The most frequent other feature values with which AUX
and Number
co-occurred: Person[abs]=3 (148; 97%), Mood=Ind (143; 93%), Number[dat]=EMPTY (134; 88%), Person[dat]=EMPTY (134; 88%), Number[abs]=Sing (116; 76%), Person[erg]=3 (98; 64%).
AUX
tokens may have the following values of Number
:
Plur
(54; 35% of non-emptyNumber
): dutenak, dituztenak, zirenak, direnak, dutenek, daitezkeenak, direnek, dutenen, zituztenak, dienakSing
(99; 65% of non-emptyNumber
): duena, dena, zuena, dutena, zena, dituena, duenak, denari, diona, duenarenEMPTY
(9920): zuen, da, du, zen, dira, dute, zuten, ziren, ditu, zituen
Paradigm *edun | Sing | Plur |
---|---|---|
Case=Abl|Mood=Ind|Person[abs]=3|Person[erg]=3 | dutenenetatik | |
Case=Abs|Mood=Cnd|Person[abs]=3|Person[erg]=3 | lukeena | |
Case=Abs|Mood=Ind|Person[abs]=1|Person[erg]=3 | gaituztenak | |
Case=Abs|Mood=Ind|Person[abs]=3|Person[dat]=1|Person[erg]=3 | zidatena, didana | zidatenak, dizkigunak |
Case=Abs|Mood=Ind|Person[abs]=3|Person[dat]=3|Person[erg]=3 | diona, ziona, diotena, zizkiona | diotenak, dienak |
Case=Abs|Mood=Ind|Person[abs]=3|Person[erg]=1 | nuena | |
Case=Abs|Mood=Ind|Person[abs]=3|Person[erg]=3 | duena, zuena, dutena, dituena, zutena, dutenetakoa, dutenena | dutenak, dituztenak, zituztenak, dituenak |
Case=Ben|Mood=Ind|Person[abs]=1|Person[erg]=3 | nauenarentzat | |
Case=Ben|Mood=Ind|Person[abs]=3|Person[erg]=3 | zuenarentzat | |
Case=Cau|Mood=Ind|Person[abs]=1|Person[erg]=3 | gintuenagatik | |
Case=Cau|Mood=Ind|Person[abs]=3|Person[erg]=3 | dutenagatik | |
Case=Com|Mood=Ind|Person[abs]=3|Person[dat]=3|Person[erg]=3 | zizkiotenekin | |
Case=Com|Mood=Ind|Person[abs]=3|Person[erg]=1 | dudanarekin | |
Case=Com|Mood=Ind|Person[abs]=3|Person[erg]=2 | duzunarekin | |
Case=Com|Mood=Ind|Person[abs]=3|Person[erg]=3 | dituztenekin | |
Case=Dat|Mood=Ind|Person[abs]=3|Person[erg]=1 | dugunari | |
Case=Dat|Mood=Ind|Person[abs]=3|Person[erg]=3 | duenari | zituenei |
Case=Erg|Mood=Ind|Person[abs]=3|Person[dat]=3|Person[erg]=3 | ziotenak | diotenek |
Case=Erg|Mood=Ind|Person[abs]=3|Person[erg]=1 | dudanak | |
Case=Erg|Mood=Ind|Person[abs]=3|Person[erg]=3 | duenak, zuenak | dutenek, dituztenek |
Case=Gen|Mood=Ind|Person[abs]=3|Person[erg]=3 | duenaren, dutenaren | dutenen, dituztenen |
Case=Loc|Mood=Ind|Person[abs]=3|Person[erg]=3 | zuteneko, duteneko |
PRON
27 eu-pos/PRON tokens (3% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: PronType=EMPTY (27; 100%), Definite=Def (27; 100%).
PRON
tokens may have the following values of Number
:
Plur
(8; 30% of non-emptyNumber
): geu, geure, zenbaitzuk, Geuregan, GeuriSing
(19; 70% of non-emptyNumber
): neure, norberak, neu, norberaren, Neuk, Zeu, Zeuk, heure, zeureEMPTY
(758): gure, nire, nik, euren, guk, zerbait, ezer, ni, inork, zure
ADV
18 eu-pos/ADV tokens (0% of all ADV
tokens) have a non-empty value of Number
.
ADV
tokens may have the following values of Number
:
Plur
(3; 17% of non-emptyNumber
): egungoak, goranzkoak, samarretanSing
(15; 83% of non-emptyNumber
): samarra, adinakoa, atzokoa, atzokoan, atzokoaren, aurtengoa, aurtengoak, aurtengora, gaurkoa, kontrakoaEMPTY
(5160): atzo, oso, gaur, orain, ondoren, gero, hala, bertan, beti, ondo
Paradigm samar | Sing | Plur |
---|---|---|
Case=Abs | samarra | |
Case=Ine | samarrean | samarretan |
SYM
14 eu-pos/SYM tokens (93% of all SYM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which SYM
and Number
co-occurred: Definite=Def (14; 100%), Case=Abs (10; 71%), Animacy=EMPTY (9; 64%).
SYM
tokens may have the following values of Number
:
Plur
(1; 7% of non-emptyNumber
): kvSing
(13; 93% of non-emptyNumber
): cm-ko, kg, m, m., KV, cm, kmEMPTY
(1): Kw
Paradigm kV | Sing | Plur |
---|---|---|
KV | kv |
NUM
1 eu-pos/NUM tokens (0% of all NUM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NUM
and Number
co-occurred: NumType=EMPTY (1; 100%).
NUM
tokens may have the following values of Number
:
Sing
(1; 100% of non-emptyNumber
): 21naEMPTY
(4612): bat, bi, azken, lehen, hiru, batean, bigarren, baten, batek, lau
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
NOUN –[nmod]–> DET (265; 53%),
ADJ –[nsubj]–> NOUN (191; 66%),
PROPN –[nmod]–> PROPN (66; 64%),
NOUN –[nsubj]–> DET (52; 62%),
PROPN –[appos]–> PROPN (50; 67%),
ADJ –[conj]–> ADJ (47; 57%),
PROPN –[appos]–> NOUN (37; 64%),
ADJ –[nsubj]–> DET (30; 81%),
NOUN –[conj]–> PROPN (30; 54%),
NOUN –[acl]–> ADJ (13; 52%).
Number in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]