home eu/feat edit page issue tracker

Number: number

The Number feature for Basque follows the standard UD guidelines for nouns, adjectives, determiners and adverbs. However, finite verbs contain agreement features on number for the subject, object and indirect object, so the Basque treebank follows the UD description for language-specific features, defining Number[erg]=Sing,Plur, Number[abs]=Sing,Plur, and Number[dat]=Sing,Plur.


Treebank Statistics (UD_Basque)

This feature is universal. It occurs with 2 different values: Plur, Sing.

This is a layered feature with the following layers: Number, Number[abs], Number[dat], Number[erg].

32262 tokens (27%) have a non-empty value of Number. 14065 types (58%) occur at least once with a non-empty value of Number. 6382 lemmas (58%) occur at least once with a non-empty value of Number. The feature is used with 11 part-of-speech tags: eu-pos/NOUN (17147; 14% instances), eu-pos/PROPN (6190; 5% instances), eu-pos/ADJ (3853; 3% instances), eu-pos/DET (2723; 2% instances), eu-pos/ADP (1423; 1% instances), eu-pos/VERB (713; 1% instances), eu-pos/AUX (153; 0% instances), eu-pos/PRON (27; 0% instances), eu-pos/ADV (18; 0% instances), eu-pos/SYM (14; 0% instances), eu-pos/NUM (1; 0% instances).

NOUN

17147 eu-pos/NOUN tokens (58% of all NOUN tokens) have a non-empty value of Number.

The most frequent other feature values with which NOUN and Number co-occurred: Definite=Def (17144; 100%), Animacy=Inan (9682; 56%).

NOUN tokens may have the following values of Number:

Paradigm taldeSingPlur
Animacy=Inan|Case=Abltaldetiktaldeetatik
Animacy=Inan|Case=Abstaldea, taldekoataldeak, taldeok
Animacy=Inan|Case=Alltalderataldeetara
Animacy=Inan|Case=Comtaldearekin
Animacy=Inan|Case=Dattaldearitaldeei
Animacy=Inan|Case=Ergtaldeaktaldeek, taldekoek, taldeok
Animacy=Inan|Case=Gentaldearentaldeen
Animacy=Inan|Case=Inetaldean, taldearengantaldeetan
Animacy=Inan|Case=Loctaldeko, talderakotaldeetako
Case=AbsTaldeaTaldeak
Case=GenTaldearen

PROPN

6190 eu-pos/PROPN tokens (63% of all PROPN tokens) have a non-empty value of Number.

The most frequent other feature values with which PROPN and Number co-occurred: Definite=Def (6184; 100%).

PROPN tokens may have the following values of Number:

Paradigm EEBBSingPlur
Case=Abs|Definite=DefEEBBak, EEBB
Case=Abs|Definite=IndEEBBetarako
Case=All|Definite=DefEEBBetara
Case=Erg|Definite=DefEEBBek, EEBB-EK
Case=Gen|Definite=DefEEBBen
Case=Ine|Definite=DefEEBBetan
Case=Loc|Definite=DefEEBBetakoEEBBetako, EEBBetarako

Number seems to be lexical feature of PROPN. 100% lemmas (1901) occur only with one value of Number.

ADJ

3853 eu-pos/ADJ tokens (65% of all ADJ tokens) have a non-empty value of Number.

The most frequent other feature values with which ADJ and Number co-occurred: Definite=Def (3852; 100%), Case=Abs (2454; 64%).

ADJ tokens may have the following values of Number:

Paradigm handiSingPlur
Case=Abshandia, haundiahandiak, handikoak
Case=Abs|Degree=Cmphandiagoa, haundiagoahandiagoak, haundiagoak
Case=Abs|Degree=Suphandienahandienak
Case=Abs|Degree=Abshandiegia
Case=All|Degree=Cmphandiagora
Case=Cauhandiagatikhandiengatik
Case=Comhandiarekinhandiekin
Case=Com|Degree=Cmphandiagoarekin
Case=Erghandiek
Case=Erg|Degree=Suphandienek
Case=Genhandien
Case=Gen|Degree=Cmphandiagoaren
Case=Inehandianhandietan
Case=Ine|Degree=Cmphandiagoetan
Case=Inshandienaz
Case=Lochandikohandietako
Case=Loc|Degree=Cmphandiagoko
Case=Loc|Degree=Suphandienekohandienetariko

DET

2723 eu-pos/DET tokens (67% of all DET tokens) have a non-empty value of Number.

The most frequent other feature values with which DET and Number co-occurred: Definite=Def (2324; 85%).

DET tokens may have the following values of Number:

Paradigm beraSingPlur
Case=Ablberetik
Case=Abs|Definite=Defbera, berea, berekoabereak, berekoak
Case=Abs|Definite=Indbere
Case=All|Definite=Defberarengana
Case=Benberetzat
Case=Ben|Definite=Defberarentzat
Case=Cau|Definite=Defberagatik, berarengatik
Case=Com|Definite=Defberarekin
Case=Dat|Definite=Defberari
Case=Erg|Definite=Defberak
Case=Genbere
Case=Gen|Definite=Defberaren
Case=Ineberean
Case=Ine|Definite=Defberarengan
Case=Ins|Definite=Defberaz

ADP

1423 eu-pos/ADP tokens (76% of all ADP tokens) have a non-empty value of Number.

The most frequent other feature values with which ADP and Number co-occurred: Definite=Def (1356; 95%), Animacy=EMPTY (938; 66%).

ADP tokens may have the following values of Number:

Paradigm arteSingPlur
Animacy=Anim|Case=Ine|Definite=Defartean
Animacy=Anim|Case=Loc|Definite=Defartekoarteko
Animacy=Inan|Case=Abs|Definite=Defarte
Animacy=Inan|Case=Ine|Definite=Defartean
Animacy=Inan|Case=Loc|Definite=Defartekoarteko
Case=Abs|Definite=Defartearte
Case=Ineartean
Case=Ine|Definite=Defarteanartean
Case=Ine|Definite=Def|Degree=Supartean
Case=Ine|Definite=Def|Person=1artean
Case=Ine|Definite=Def|Person=2artean
Case=Ine|Definite=Def|Person=3artean
Case=Loc|Definite=Defartekoarteko
Case=Loc|Definite=Def|Degree=Suparteko
Case=Loc|Definite=Def|Person=3arteko

VERB

713 eu-pos/VERB tokens (3% of all VERB tokens) have a non-empty value of Number.

The most frequent other feature values with which VERB and Number co-occurred: Number[abs]=EMPTY (622; 87%), Person[abs]=EMPTY (622; 87%), Mood=EMPTY (622; 87%), Aspect=EMPTY (620; 87%), VerbForm=Part (445; 62%), Case=Abs (410; 58%).

VERB tokens may have the following values of Number:

Paradigm izanSingPlur
Aspect=Prog|Case=Abl|Mood=Ind|Person[abs]=1ginenekotik
Aspect=Prog|Case=Abs|Mood=Ind|Person[abs]=3dena, direnadirenak, zirenak
Aspect=Prog|Case=Abs|Mood=Ind|Person[abs]=3|Person[dat]=3zaizkienak
Aspect=Prog|Case=Ben|Mood=Ind|Person[abs]=3denarentzat
Aspect=Prog|Case=Dat|Mood=Ind|Person[abs]=3zirenei
Aspect=Prog|Case=Erg|Mood=Ind|Person[abs]=1garenok
Aspect=Prog|Case=Gen|Mood=Ind|Person[abs]=3zenaren
Aspect=Prog|Case=Insdenez
Case=Abs|VerbForm=Partizana, izandakoaizanak
Case=Cauizateagatik
Case=Cau|VerbForm=Partizanagatik
Case=Datizateari
Case=Ergizateak
Case=Erg|VerbForm=Partizanak, izandakoak
Case=Genizatearen
Case=Gen|VerbForm=Partizanaren
Case=Insizateaz
Case=Ins|VerbForm=Partizanaz

AUX

153 eu-pos/AUX tokens (2% of all AUX tokens) have a non-empty value of Number.

The most frequent other feature values with which AUX and Number co-occurred: Person[abs]=3 (148; 97%), Mood=Ind (143; 93%), Number[dat]=EMPTY (134; 88%), Person[dat]=EMPTY (134; 88%), Number[abs]=Sing (116; 76%), Person[erg]=3 (98; 64%).

AUX tokens may have the following values of Number:

Paradigm *edunSingPlur
Case=Abl|Mood=Ind|Person[abs]=3|Person[erg]=3dutenenetatik
Case=Abs|Mood=Cnd|Person[abs]=3|Person[erg]=3lukeena
Case=Abs|Mood=Ind|Person[abs]=1|Person[erg]=3gaituztenak
Case=Abs|Mood=Ind|Person[abs]=3|Person[dat]=1|Person[erg]=3zidatena, didanazidatenak, dizkigunak
Case=Abs|Mood=Ind|Person[abs]=3|Person[dat]=3|Person[erg]=3diona, zizkiona, diotena, zionadienak, diotenak
Case=Abs|Mood=Ind|Person[abs]=3|Person[erg]=1nuena
Case=Abs|Mood=Ind|Person[abs]=3|Person[erg]=3duena, zuena, dutena, dituena, zutena, dutenetakoa, dutenenadutenak, dituztenak, zituztenak, dituenak
Case=Ben|Mood=Ind|Person[abs]=1|Person[erg]=3nauenarentzat
Case=Ben|Mood=Ind|Person[abs]=3|Person[erg]=3zuenarentzat
Case=Cau|Mood=Ind|Person[abs]=1|Person[erg]=3gintuenagatik
Case=Cau|Mood=Ind|Person[abs]=3|Person[erg]=3dutenagatik
Case=Com|Mood=Ind|Person[abs]=3|Person[dat]=3|Person[erg]=3zizkiotenekin
Case=Com|Mood=Ind|Person[abs]=3|Person[erg]=1dudanarekin
Case=Com|Mood=Ind|Person[abs]=3|Person[erg]=2duzunarekin
Case=Com|Mood=Ind|Person[abs]=3|Person[erg]=3dituztenekin
Case=Dat|Mood=Ind|Person[abs]=3|Person[erg]=1dugunari
Case=Dat|Mood=Ind|Person[abs]=3|Person[erg]=3duenarizituenei
Case=Erg|Mood=Ind|Person[abs]=3|Person[dat]=3|Person[erg]=3ziotenakdiotenek
Case=Erg|Mood=Ind|Person[abs]=3|Person[erg]=1dudanak
Case=Erg|Mood=Ind|Person[abs]=3|Person[erg]=3duenak, zuenakdutenek, dituztenek
Case=Gen|Mood=Ind|Person[abs]=3|Person[erg]=3duenaren, dutenarendutenen, dituztenen
Case=Loc|Mood=Ind|Person[abs]=3|Person[erg]=3zuteneko, duteneko

PRON

27 eu-pos/PRON tokens (3% of all PRON tokens) have a non-empty value of Number.

The most frequent other feature values with which PRON and Number co-occurred: Definite=Def (27; 100%), PronType=EMPTY (27; 100%).

PRON tokens may have the following values of Number:

ADV

18 eu-pos/ADV tokens (0% of all ADV tokens) have a non-empty value of Number.

ADV tokens may have the following values of Number:

Paradigm samarSingPlur
Case=Abssamarra
Case=Inesamarreansamarretan

SYM

14 eu-pos/SYM tokens (93% of all SYM tokens) have a non-empty value of Number.

The most frequent other feature values with which SYM and Number co-occurred: Definite=Def (14; 100%), Case=Abs (10; 71%), Animacy=EMPTY (9; 64%).

SYM tokens may have the following values of Number:

Paradigm kVSingPlur
KVkv

NUM

1 eu-pos/NUM tokens (0% of all NUM tokens) have a non-empty value of Number.

The most frequent other feature values with which NUM and Number co-occurred: NumType=EMPTY (1; 100%).

NUM tokens may have the following values of Number:

Relations with Agreement in Number

The 10 most frequent relations where parent and child node agree in Number: NOUN –[nmod]–> DET (265; 53%), ADJ –[nsubj]–> NOUN (191; 66%), PROPN –[nmod]–> PROPN (66; 64%), NOUN –[nsubj]–> DET (52; 62%), PROPN –[appos]–> PROPN (50; 67%), ADJ –[conj]–> ADJ (47; 57%), PROPN –[appos]–> NOUN (37; 64%), ADJ –[nsubj]–> DET (30; 81%), NOUN –[conj]–> PROPN (30; 54%), NOUN –[acl]–> ADJ (13; 52%).


Number in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]