Treebank Statistics: UD_Basque-BDT: Features: Definite
This feature is universal.
It occurs with 2 different values: Def, Ind.
37941 tokens (31%) have a non-empty value of Definite.
16304 types (67%) occur at least once with a non-empty value of Definite.
6993 lemmas (64%) occur at least once with a non-empty value of Definite.
The feature is used with 11 part-of-speech tags: NOUN (20577; 17% instances), PROPN (6191; 5% instances), ADJ (4579; 4% instances), DET (2965; 2% instances), VERB (1623; 1% instances), ADP (1497; 1% instances), AUX (242; 0% instances), PRON (207; 0% instances), ADV (27; 0% instances), NUM (19; 0% instances), SYM (14; 0% instances).
NOUN
20577 NOUN tokens (69% of all NOUN tokens) have a non-empty value of Definite.
The most frequent other feature values with which NOUN and Definite co-occurred: Number=Sing (12854; 62%), Animacy=Inan (11307; 55%).
NOUN tokens may have the following values of Definite:
Def(17130; 83% of non-emptyDefinite): taldeak, partidua, taldea, ostean, gobernuak, aurretik, aukera, garaipena, herrian, igandeanInd(3447; 17% of non-emptyDefinite): behar, nahi, uste, ezin, urte, ondorioz, aldiz, espero, lagun, ahalEMPTY(9136): euskal, talde, lan, jokalari, estatu, alderdi, urte, egun, partidu, aukera
| Paradigm talde | Ind | Def |
|---|---|---|
| Animacy=Inan|Case=Abl|Number=Sing | taldetik | |
| Animacy=Inan|Case=Abl|Number=Plur | taldeetatik | |
| Animacy=Inan|Case=Abs | talde | |
| Animacy=Inan|Case=Abs|Number=Sing | taldea, taldekoa | |
| Animacy=Inan|Case=Abs|Number=Plur | taldeak, taldeok | |
| Animacy=Inan|Case=All|Number=Sing | taldera | |
| Animacy=Inan|Case=All|Number=Plur | taldeetara | |
| Animacy=Inan|Case=Com|Number=Sing | taldearekin | |
| Animacy=Inan|Case=Dat|Number=Sing | taldeari | |
| Animacy=Inan|Case=Dat|Number=Plur | taldeei | |
| Animacy=Inan|Case=Erg | taldek | |
| Animacy=Inan|Case=Erg|Number=Sing | taldeak | |
| Animacy=Inan|Case=Erg|Number=Plur | taldeek, taldekoek, taldeok | |
| Animacy=Inan|Case=Ess | taldetzat | |
| Animacy=Inan|Case=Gen | talderen | |
| Animacy=Inan|Case=Gen|Number=Sing | taldearen | |
| Animacy=Inan|Case=Gen|Number=Plur | taldeen | |
| Animacy=Inan|Case=Ine | taldetan | |
| Animacy=Inan|Case=Ine|Number=Sing | taldean, taldearengan | |
| Animacy=Inan|Case=Ine|Number=Plur | taldeetan | |
| Animacy=Inan|Case=Ins | Taldez | |
| Animacy=Inan|Case=Loc|Number=Sing | taldeko, talderako | |
| Animacy=Inan|Case=Loc|Number=Plur | taldeetako | |
| Animacy=Inan|Case=Par | talderik | |
| Case=Abs|Number=Sing | Taldea | |
| Case=Abs|Number=Plur | Taldeak | |
| Case=Gen|Number=Sing | Taldearen |
PROPN
6191 PROPN tokens (63% of all PROPN tokens) have a non-empty value of Definite.
The most frequent other feature values with which PROPN and Definite co-occurred: Number=Sing (6104; 99%).
PROPN tokens may have the following values of Definite:
Def(6183; 100% of non-emptyDefinite): Europako, Espainiako, Frantziako, Israelgo, Nafarroako, EEBBetako, Jugoslaviako, Miarritzek, EAJk, OsasunakInd(8; 0% of non-emptyDefinite): Briverekiko, EEBBetarako, Eurokoparako, Klodenekiko, Madrilen, OLBKren, Txetxeniarako, VueltarakoEMPTY(3677): Jose, Juan, Euskal, Iñaki, Luis, Joseba, David, Mikel, Jean, Miguel
| Paradigm EEBB | Ind | Def |
|---|---|---|
| Case=Abs|Number=Plur | EEBBetarako | EEBBak, EEBB |
| Case=All|Number=Plur | EEBBetara | |
| Case=Erg|Number=Plur | EEBBek, EEBB-EK | |
| Case=Gen|Number=Plur | EEBBen | |
| Case=Ine|Number=Plur | EEBBetan | |
| Case=Loc|Number=Sing | EEBBetako | |
| Case=Loc|Number=Plur | EEBBetako, EEBBetarako |
Definite seems to be lexical feature of PROPN. 100% lemmas (1898) occur only with one value of Definite.
ADJ
4579 ADJ tokens (65% of all ADJ tokens) have a non-empty value of Definite.
The most frequent other feature values with which ADJ and Definite co-occurred: NumType=EMPTY (4579; 100%), Case=Abs (2923; 64%), Number=Sing (2811; 61%).
ADJ tokens may have the following values of Definite:
Def(3852; 84% of non-emptyDefinite): handia, ona, bakarra, nagusia, zaila, berria, onena, garrantzitsua, nagusiak, osoaInd(727; 16% of non-emptyDefinite): nagusi, bizi, handirik, handiz, zuzen, ziur, berri, indartsu, sendo, ageriEMPTY(2442): azken, lehen, bigarren, hirugarren, lehenengo, hurrengo, inolako, nazioarteko, laugarren, berri
| Paradigm handi | Ind | Def |
|---|---|---|
| Case=Abs | handi | |
| Case=Abs|Degree=Cmp|Number=Sing | handiagoa, haundiagoa | |
| Case=Abs|Degree=Cmp|Number=Plur | handiagoak, haundiagoak | |
| Case=Abs|Degree=Sup|Number=Sing | handiena | |
| Case=Abs|Degree=Sup|Number=Plur | handienak | |
| Case=Abs|Degree=Abs | handiegi | |
| Case=Abs|Degree=Abs|Number=Sing | handiegia | |
| Case=Abs|Number=Sing | handia, haundia | |
| Case=Abs|Number=Plur | handiak, handikoak | |
| Case=All|Degree=Cmp|Number=Sing | handiagora | |
| Case=Ben | handirentzat | |
| Case=Cau|Number=Sing | handiagatik | |
| Case=Cau|Number=Plur | handiengatik | |
| Case=Com|Degree=Cmp|Number=Sing | handiagoarekin | |
| Case=Com|Number=Sing | handiarekin | |
| Case=Com|Number=Plur | handiekin | |
| Case=Erg|Degree=Sup|Number=Plur | handienek | |
| Case=Erg|Number=Plur | handiek | |
| Case=Ess | handikotzat | |
| Case=Gen|Degree=Cmp|Number=Sing | handiagoaren | |
| Case=Gen|Number=Plur | handien | |
| Case=Ine | handitan | |
| Case=Ine|Degree=Cmp|Number=Plur | handiagoetan | |
| Case=Ine|Number=Sing | handian | |
| Case=Ine|Number=Plur | handietan | |
| Case=Ins | handiz | |
| Case=Ins|Number=Sing | handienaz | |
| Case=Loc | handitarako | |
| Case=Loc|Degree=Cmp|Number=Sing | handiagoko | |
| Case=Loc|Degree=Sup|Number=Sing | handieneko | |
| Case=Loc|Degree=Sup|Number=Plur | handienetariko | |
| Case=Loc|Number=Sing | handiko | |
| Case=Loc|Number=Plur | handietako | |
| Case=Par | handirik, haundirik |
DET
2965 DET tokens (72% of all DET tokens) have a non-empty value of Definite.
The most frequent other feature values with which DET and Definite co-occurred: Case=Abs (1549; 52%), Number=Sing (1539; 52%).
DET tokens may have the following values of Definite:
Def(2339; 79% of non-emptyDefinite): hori, hau, horretan, batzuk, guztiak, honetan, horiek, horrek, bera, hauekInd(626; 21% of non-emptyDefinite): zer, gehiago, asko, ugari, horretarako, gutxi, beste, gehiegi, zein, askokEMPTY(1174): bere, beste, hainbat, zenbait, berean, horren, zer, milioi, zein, horien
| Paradigm hori | Ind | Def |
|---|---|---|
| Case=Abl|Number=Sing | horretatik | |
| Case=Abs | horretarako | |
| Case=Abs|Number=Sing | hori, horixe, horretakoa | |
| Case=Abs|Number=Plur | horretakoak | |
| Case=All|Number=Sing | horretara | |
| Case=Cau|Number=Sing | horregatik, Horrexegatik, horrengatik | |
| Case=Com|Number=Sing | horrekin, horrexekin | |
| Case=Dat|Number=Sing | horri | |
| Case=Erg|Number=Sing | horrek | |
| Case=Gen|Number=Sing | horren | |
| Case=Ine|Number=Sing | horretan, horretantxe, horrexetan | |
| Case=Ins|Number=Sing | horrez, horretaz | |
| Case=Loc|Number=Sing | horretako, horretarako |
VERB
1623 VERB tokens (9% of all VERB tokens) have a non-empty value of Definite.
The most frequent other feature values with which VERB and Definite co-occurred: Mood=EMPTY (1553; 96%), Number[abs]=EMPTY (1553; 96%), Person[abs]=EMPTY (1553; 96%), Aspect=EMPTY (1551; 96%), Case=Abs (999; 62%), VerbForm=Fin (818; 50%).
VERB tokens may have the following values of Definite:
Def(646; 40% of non-emptyDefinite): egina, hasia, litekeena, egiteak, azpimarratzekoa, dagokionean, irekia, izana, armatuak, bateratuaInd(977; 60% of non-emptyDefinite): egiteko, emateko, izateko, eginez, erabiliz, lortzeko, irabazteko, jokatzeko, izan, jakitekoEMPTY(17444): egin, izan, esan, egiten, du, da, izango, eman, dago, joan
| Paradigm izan | Ind | Def |
|---|---|---|
| Aspect=Prog|Case=Abs|Mood=Ind|Number=Sing|Number[abs]=Plur|Person[abs]=3|VerbForm=Fin | direna | |
| Aspect=Prog|Case=Abs|Mood=Ind|Number=Plur|Number[abs]=Plur|Person[abs]=3|VerbForm=Fin | zirenak | |
| Aspect=Prog|Case=Dat|Mood=Ind|Number=Plur|Number[abs]=Plur|Person[abs]=3|VerbForm=Fin | zirenei | |
| Aspect=Prog|Case=Gen|Mood=Ind|Number=Sing|Number[abs]=Sing|Person[abs]=3|VerbForm=Fin | zenaren | |
| Aspect=Prog|Case=Ins|Number=Plur|VerbForm=Fin | denez | |
| Case=Abs|Number=Sing|VerbForm=Part | izana | |
| Case=Abs|VerbForm=Fin | izateko | |
| Case=Abs|VerbForm=Part | izan | |
| Case=Dat|Number=Sing|VerbForm=Fin | izateari | |
| Case=Erg|Number=Sing|VerbForm=Fin | izateak | |
| Case=Erg|Number=Sing|VerbForm=Part | izanak | |
| Case=Gen|Number=Sing|VerbForm=Part | izanaren | |
| Case=Ins|Number=Sing|VerbForm=Fin | izateaz | |
| Case=Ins|VerbForm=Part | izanez | |
| Case=Par|VerbForm=Part | izanik |
ADP
1497 ADP tokens (80% of all ADP tokens) have a non-empty value of Definite.
The most frequent other feature values with which ADP and Definite co-occurred: Animacy=EMPTY (936; 63%), Number=Sing (898; 60%).
ADP tokens may have the following values of Definite:
Def(1355; 91% of non-emptyDefinite): arabera, artean, aurka, arteko, inguruan, kontra, aurrean, aurkako, buruz, zeharInd(142; 9% of non-emptyDefinite): gabe, gabeko, kontra, aurka, artean, ezean, bezala, gain, gisan, kanpoEMPTY(378): arte, gisa, bezala, gainean, arteko, aurrean, inguru, artean, aurrera, barruan
| Paradigm arte | Ind | Def |
|---|---|---|
| Animacy=Anim|Case=Ine|Number=Plur | artean | |
| Animacy=Anim|Case=Loc|Number=Sing | arteko | |
| Animacy=Anim|Case=Loc|Number=Plur | arteko | |
| Animacy=Inan|Case=Abs|Number=Sing | arte | |
| Animacy=Inan|Case=Ine | artean | |
| Animacy=Inan|Case=Ine|Number=Plur | artean | |
| Animacy=Inan|Case=Loc|Number=Sing | arteko | |
| Animacy=Inan|Case=Loc|Number=Plur | arteko | |
| Case=Abs | arte | |
| Case=Abs|Number=Sing | arte | |
| Case=Abs|Number=Plur | arte | |
| Case=Ine | artean | |
| Case=Ine|Degree=Sup|Number=Plur | artean | |
| Case=Ine|Number=Sing | artean | |
| Case=Ine|Number=Plur | artean | |
| Case=Ine|Number=Plur|Person=1 | artean | |
| Case=Ine|Number=Plur|Person=2 | artean | |
| Case=Ine|Number=Plur|Person=3 | artean | |
| Case=Loc | arteko | |
| Case=Loc|Degree=Sup|Number=Plur | arteko | |
| Case=Loc|Number=Sing | arteko | |
| Case=Loc|Number=Plur | arteko | |
| Case=Loc|Number=Plur|Person=3 | arteko |
AUX
242 AUX tokens (2% of all AUX tokens) have a non-empty value of Definite.
The most frequent other feature values with which AUX and Definite co-occurred: Aspect=EMPTY (218; 90%), VerbForm=Fin (203; 84%), Person[abs]=3 (167; 69%), Mood=Ind (166; 69%), Number[erg]=EMPTY (132; 55%), Person[erg]=EMPTY (132; 55%), Number[abs]=Sing (128; 53%).
AUX tokens may have the following values of Definite:
Def(220; 91% of non-emptyDefinite): izana, dena, duena, dutenak, zuena, direnak, izateaz, dituztenak, dutena, zenaInd(22; 9% of non-emptyDefinite): izateko, izanik, izan, duenik, dutenen, egon, egoteko, izanezEMPTY(12306): da, zuen, zen, du, dira, izan, dute, zuten, ziren, ditu
PRON
207 PRON tokens (26% of all PRON tokens) have a non-empty value of Definite.
The most frequent other feature values with which PRON and Definite co-occurred: PronType=EMPTY (207; 100%), Case=Abs (126; 61%).
PRON tokens may have the following values of Definite:
Def(27; 13% of non-emptyDefinite): neure, norberak, neu, norberaren, geu, geure, zenbaitzuk, Geuregan, Geuri, NeukInd(180; 87% of non-emptyDefinite): zerbait, ezer, inork, inor, elkarrekin, norbait, elkar, elkarri, norbaitek, zertxobaitEMPTY(578): gure, nire, nik, euren, guk, ni, zure, gu, nork, beraiek
Definite seems to be lexical feature of PRON. 100% lemmas (17) occur only with one value of Definite.
ADV
27 ADV tokens (1% of all ADV tokens) have a non-empty value of Definite.
ADV tokens may have the following values of Definite:
Def(18; 67% of non-emptyDefinite): samarra, adinakoa, atzokoa, atzokoan, atzokoaren, aurtengoa, aurtengoak, aurtengora, egungoak, gaurkoaInd(9; 33% of non-emptyDefinite): seguruenik, gaurko, betirako, biharko, lehenbiziko, samarEMPTY(5225): atzo, oso, gaur, orain, ondoren, gero, hala, bertan, beti, ondo
| Paradigm gaur | Ind | Def |
|---|---|---|
| gaurko | ||
| Number=Sing | gaurkoa |
NUM
19 NUM tokens (1% of all NUM tokens) have a non-empty value of Definite.
The most frequent other feature values with which NUM and Definite co-occurred: NumType=Card (19; 100%).
NUM tokens may have the following values of Definite:
Ind(19; 100% of non-emptyDefinite): bana, 16na, 21na, 31na, banarekin, bederaEMPTY(3547): bat, bi, hiru, batean, baten, batek, lau, batez, bost, sei
SYM
14 SYM tokens (93% of all SYM tokens) have a non-empty value of Definite.
The most frequent other feature values with which SYM and Definite co-occurred: Number=Sing (13; 93%), Case=Abs (10; 71%), Animacy=EMPTY (9; 64%).
SYM tokens may have the following values of Definite:
Def(14; 100% of non-emptyDefinite): cm-ko, kg, kv, m, m., cm, kmEMPTY(1): Kw
Relations with Agreement in Definite
The 10 most frequent relations where parent and child node agree in Definite:
NOUN –[nmod]–> PROPN (1282; 52%),
NOUN –[conj]–> NOUN (481; 55%),
ADJ –[nsubj]–> NOUN (204; 67%),
PROPN –[nmod]–> PROPN (68; 66%),
NOUN –[nsubj]–> DET (56; 65%),
ADJ –[conj]–> ADJ (52; 53%),
PROPN –[appos]–> PROPN (50; 66%),
PROPN –[appos]–> NOUN (37; 64%),
NOUN –[conj]–> PROPN (32; 53%),
ADJ –[nsubj]–> DET (31; 79%).