Treebank Statistics: UD_Basque-BDT: Features: Definite
This feature is universal.
It occurs with 2 different values: Def
, Ind
.
37941 tokens (31%) have a non-empty value of Definite
.
16305 types (67%) occur at least once with a non-empty value of Definite
.
6993 lemmas (64%) occur at least once with a non-empty value of Definite
.
The feature is used with 11 part-of-speech tags: NOUN (20577; 17% instances), PROPN (6191; 5% instances), ADJ (4579; 4% instances), DET (2965; 2% instances), VERB (1623; 1% instances), ADP (1497; 1% instances), AUX (242; 0% instances), PRON (207; 0% instances), ADV (27; 0% instances), NUM (19; 0% instances), SYM (14; 0% instances).
NOUN
20577 NOUN tokens (69% of all NOUN
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which NOUN
and Definite
co-occurred: Number=Sing (12854; 62%), Animacy=Inan (11307; 55%).
NOUN
tokens may have the following values of Definite
:
Def
(17130; 83% of non-emptyDefinite
): taldeak, partidua, taldea, ostean, gobernuak, aurretik, aukera, garaipena, herrian, igandeanInd
(3447; 17% of non-emptyDefinite
): behar, nahi, uste, ezin, urte, ondorioz, aldiz, espero, lagun, ahalEMPTY
(9136): euskal, talde, lan, jokalari, estatu, alderdi, urte, egun, partidu, aukera
Paradigm talde | Ind | Def |
---|---|---|
Animacy=Inan|Case=Abl|Number=Sing | taldetik | |
Animacy=Inan|Case=Abl|Number=Plur | taldeetatik | |
Animacy=Inan|Case=Abs | talde | |
Animacy=Inan|Case=Abs|Number=Sing | taldea, taldekoa | |
Animacy=Inan|Case=Abs|Number=Plur | taldeak, taldeok | |
Animacy=Inan|Case=All|Number=Sing | taldera | |
Animacy=Inan|Case=All|Number=Plur | taldeetara | |
Animacy=Inan|Case=Com|Number=Sing | taldearekin | |
Animacy=Inan|Case=Dat|Number=Sing | taldeari | |
Animacy=Inan|Case=Dat|Number=Plur | taldeei | |
Animacy=Inan|Case=Erg | taldek | |
Animacy=Inan|Case=Erg|Number=Sing | taldeak | |
Animacy=Inan|Case=Erg|Number=Plur | taldeek, taldekoek, taldeok | |
Animacy=Inan|Case=Ess | taldetzat | |
Animacy=Inan|Case=Gen | talderen | |
Animacy=Inan|Case=Gen|Number=Sing | taldearen | |
Animacy=Inan|Case=Gen|Number=Plur | taldeen | |
Animacy=Inan|Case=Ine | taldetan | |
Animacy=Inan|Case=Ine|Number=Sing | taldean, taldearengan | |
Animacy=Inan|Case=Ine|Number=Plur | taldeetan | |
Animacy=Inan|Case=Ins | Taldez | |
Animacy=Inan|Case=Loc|Number=Sing | taldeko, talderako | |
Animacy=Inan|Case=Loc|Number=Plur | taldeetako | |
Animacy=Inan|Case=Par | talderik | |
Case=Abs|Number=Sing | Taldea | |
Case=Abs|Number=Plur | Taldeak | |
Case=Gen|Number=Sing | Taldearen |
PROPN
6191 PROPN tokens (63% of all PROPN
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which PROPN
and Definite
co-occurred: Number=Sing (6104; 99%).
PROPN
tokens may have the following values of Definite
:
Def
(6183; 100% of non-emptyDefinite
): Europako, Espainiako, Frantziako, Israelgo, Nafarroako, EEBBetako, Jugoslaviako, Miarritzek, EAJk, OsasunakInd
(8; 0% of non-emptyDefinite
): Briverekiko, EEBBetarako, Eurokoparako, Klodenekiko, Madrilen, OLBKren, Txetxeniarako, VueltarakoEMPTY
(3677): Jose, Juan, Euskal, Iñaki, Luis, Joseba, David, Mikel, Jean, Miguel
Paradigm EEBB | Ind | Def |
---|---|---|
Case=Abs|Number=Plur | EEBBetarako | EEBBak, EEBB |
Case=All|Number=Plur | EEBBetara | |
Case=Erg|Number=Plur | EEBBek, EEBB-EK | |
Case=Gen|Number=Plur | EEBBen | |
Case=Ine|Number=Plur | EEBBetan | |
Case=Loc|Number=Sing | EEBBetako | |
Case=Loc|Number=Plur | EEBBetako, EEBBetarako |
Definite
seems to be lexical feature of PROPN
. 100% lemmas (1898) occur only with one value of Definite
.
ADJ
4579 ADJ tokens (65% of all ADJ
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which ADJ
and Definite
co-occurred: NumType=EMPTY (4579; 100%), Case=Abs (2923; 64%), Number=Sing (2811; 61%).
ADJ
tokens may have the following values of Definite
:
Def
(3852; 84% of non-emptyDefinite
): handia, ona, bakarra, nagusia, zaila, berria, onena, garrantzitsua, nagusiak, osoaInd
(727; 16% of non-emptyDefinite
): nagusi, bizi, handirik, handiz, zuzen, ziur, berri, indartsu, sendo, ageriEMPTY
(2442): azken, lehen, bigarren, hirugarren, lehenengo, hurrengo, inolako, nazioarteko, laugarren, berri
Paradigm handi | Ind | Def |
---|---|---|
Case=Abs | handi | |
Case=Abs|Degree=Cmp|Number=Sing | handiagoa, haundiagoa | |
Case=Abs|Degree=Cmp|Number=Plur | handiagoak, haundiagoak | |
Case=Abs|Degree=Sup|Number=Sing | handiena | |
Case=Abs|Degree=Sup|Number=Plur | handienak | |
Case=Abs|Degree=Abs | handiegi | |
Case=Abs|Degree=Abs|Number=Sing | handiegia | |
Case=Abs|Number=Sing | handia, haundia | |
Case=Abs|Number=Plur | handiak, handikoak | |
Case=All|Degree=Cmp|Number=Sing | handiagora | |
Case=Ben | handirentzat | |
Case=Cau|Number=Sing | handiagatik | |
Case=Cau|Number=Plur | handiengatik | |
Case=Com|Degree=Cmp|Number=Sing | handiagoarekin | |
Case=Com|Number=Sing | handiarekin | |
Case=Com|Number=Plur | handiekin | |
Case=Erg|Degree=Sup|Number=Plur | handienek | |
Case=Erg|Number=Plur | handiek | |
Case=Ess | handikotzat | |
Case=Gen|Degree=Cmp|Number=Sing | handiagoaren | |
Case=Gen|Number=Plur | handien | |
Case=Ine | handitan | |
Case=Ine|Degree=Cmp|Number=Plur | handiagoetan | |
Case=Ine|Number=Sing | handian | |
Case=Ine|Number=Plur | handietan | |
Case=Ins | handiz | |
Case=Ins|Number=Sing | handienaz | |
Case=Loc | handitarako | |
Case=Loc|Degree=Cmp|Number=Sing | handiagoko | |
Case=Loc|Degree=Sup|Number=Sing | handieneko | |
Case=Loc|Degree=Sup|Number=Plur | handienetariko | |
Case=Loc|Number=Sing | handiko | |
Case=Loc|Number=Plur | handietako | |
Case=Par | handirik, haundirik |
DET
2965 DET tokens (72% of all DET
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which DET
and Definite
co-occurred: Case=Abs (1549; 52%), Number=Sing (1539; 52%).
DET
tokens may have the following values of Definite
:
Def
(2339; 79% of non-emptyDefinite
): hori, hau, horretan, batzuk, guztiak, honetan, horiek, horrek, bera, hauekInd
(626; 21% of non-emptyDefinite
): zer, gehiago, asko, ugari, horretarako, gutxi, beste, gehiegi, zein, askokEMPTY
(1174): bere, beste, hainbat, zenbait, berean, horren, zer, milioi, zein, horien
Paradigm hori | Ind | Def |
---|---|---|
Case=Abl|Number=Sing | horretatik | |
Case=Abs | horretarako | |
Case=Abs|Number=Sing | hori, horixe, horretakoa | |
Case=Abs|Number=Plur | horretakoak | |
Case=All|Number=Sing | horretara | |
Case=Cau|Number=Sing | horregatik, Horrexegatik, horrengatik | |
Case=Com|Number=Sing | horrekin, horrexekin | |
Case=Dat|Number=Sing | horri | |
Case=Erg|Number=Sing | horrek | |
Case=Gen|Number=Sing | horren | |
Case=Ine|Number=Sing | horretan, horretantxe, horrexetan | |
Case=Ins|Number=Sing | horrez, horretaz | |
Case=Loc|Number=Sing | horretako, horretarako |
VERB
1623 VERB tokens (9% of all VERB
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which VERB
and Definite
co-occurred: Mood=EMPTY (1553; 96%), Number[abs]=EMPTY (1553; 96%), Person[abs]=EMPTY (1553; 96%), Aspect=EMPTY (1551; 96%), Case=Abs (999; 62%), VerbForm=Fin (818; 50%).
VERB
tokens may have the following values of Definite
:
Def
(646; 40% of non-emptyDefinite
): egina, hasia, litekeena, egiteak, azpimarratzekoa, dagokionean, irekia, izana, armatuak, bateratuaInd
(977; 60% of non-emptyDefinite
): egiteko, emateko, izateko, eginez, erabiliz, lortzeko, irabazteko, jokatzeko, izan, jakitekoEMPTY
(17444): egin, izan, esan, egiten, du, da, izango, eman, dago, joan
Paradigm izan | Ind | Def |
---|---|---|
Aspect=Prog|Case=Abs|Mood=Ind|Number=Sing|Number[abs]=Plur|Person[abs]=3|VerbForm=Fin | direna | |
Aspect=Prog|Case=Abs|Mood=Ind|Number=Plur|Number[abs]=Plur|Person[abs]=3|VerbForm=Fin | zirenak | |
Aspect=Prog|Case=Dat|Mood=Ind|Number=Plur|Number[abs]=Plur|Person[abs]=3|VerbForm=Fin | zirenei | |
Aspect=Prog|Case=Gen|Mood=Ind|Number=Sing|Number[abs]=Sing|Person[abs]=3|VerbForm=Fin | zenaren | |
Aspect=Prog|Case=Ins|Number=Plur|VerbForm=Fin | denez | |
Case=Abs|Number=Sing|VerbForm=Part | izana | |
Case=Abs|VerbForm=Fin | izateko | |
Case=Abs|VerbForm=Part | izan | |
Case=Dat|Number=Sing|VerbForm=Fin | izateari | |
Case=Erg|Number=Sing|VerbForm=Fin | izateak | |
Case=Erg|Number=Sing|VerbForm=Part | izanak | |
Case=Gen|Number=Sing|VerbForm=Part | izanaren | |
Case=Ins|Number=Sing|VerbForm=Fin | izateaz | |
Case=Ins|VerbForm=Part | izanez | |
Case=Par|VerbForm=Part | izanik |
ADP
1497 ADP tokens (80% of all ADP
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which ADP
and Definite
co-occurred: Animacy=EMPTY (936; 63%), Number=Sing (898; 60%).
ADP
tokens may have the following values of Definite
:
Def
(1355; 91% of non-emptyDefinite
): arabera, artean, aurka, arteko, inguruan, kontra, aurrean, aurkako, buruz, zeharInd
(142; 9% of non-emptyDefinite
): gabe, gabeko, kontra, aurka, artean, ezean, bezala, gain, gisan, kanpoEMPTY
(378): arte, gisa, bezala, gainean, arteko, aurrean, inguru, artean, aurrera, barruan
Paradigm arte | Ind | Def |
---|---|---|
Animacy=Anim|Case=Ine|Number=Plur | artean | |
Animacy=Anim|Case=Loc|Number=Sing | arteko | |
Animacy=Anim|Case=Loc|Number=Plur | arteko | |
Animacy=Inan|Case=Abs|Number=Sing | arte | |
Animacy=Inan|Case=Ine | artean | |
Animacy=Inan|Case=Ine|Number=Plur | artean | |
Animacy=Inan|Case=Loc|Number=Sing | arteko | |
Animacy=Inan|Case=Loc|Number=Plur | arteko | |
Case=Abs | arte | |
Case=Abs|Number=Sing | arte | |
Case=Abs|Number=Plur | arte | |
Case=Ine | artean | |
Case=Ine|Degree=Sup|Number=Plur | artean | |
Case=Ine|Number=Sing | artean | |
Case=Ine|Number=Plur | artean | |
Case=Ine|Number=Plur|Person=1 | artean | |
Case=Ine|Number=Plur|Person=2 | artean | |
Case=Ine|Number=Plur|Person=3 | artean | |
Case=Loc | arteko | |
Case=Loc|Degree=Sup|Number=Plur | arteko | |
Case=Loc|Number=Sing | arteko | |
Case=Loc|Number=Plur | arteko | |
Case=Loc|Number=Plur|Person=3 | arteko |
AUX
242 AUX tokens (2% of all AUX
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which AUX
and Definite
co-occurred: Aspect=EMPTY (218; 90%), VerbForm=Fin (203; 84%), Person[abs]=3 (167; 69%), Mood=Ind (166; 69%), Number[erg]=EMPTY (132; 55%), Person[erg]=EMPTY (132; 55%), Number[abs]=Sing (128; 53%).
AUX
tokens may have the following values of Definite
:
Def
(220; 91% of non-emptyDefinite
): izana, dena, duena, dutenak, zuena, direnak, izateaz, dituztenak, dutena, zenaInd
(22; 9% of non-emptyDefinite
): izateko, izanik, izan, duenik, dutenen, egon, egoteko, izanezEMPTY
(12306): da, zuen, zen, du, dira, izan, dute, zuten, ziren, ditu
PRON
207 PRON tokens (26% of all PRON
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which PRON
and Definite
co-occurred: PronType=EMPTY (207; 100%), Case=Abs (126; 61%).
PRON
tokens may have the following values of Definite
:
Def
(27; 13% of non-emptyDefinite
): neure, norberak, neu, norberaren, geu, geure, zenbaitzuk, Geuregan, Geuri, NeukInd
(180; 87% of non-emptyDefinite
): zerbait, ezer, inork, inor, elkarrekin, norbait, elkar, elkarri, norbaitek, zertxobaitEMPTY
(578): gure, nire, nik, euren, guk, ni, zure, gu, nork, beraiek
Definite
seems to be lexical feature of PRON
. 100% lemmas (17) occur only with one value of Definite
.
ADV
27 ADV tokens (1% of all ADV
tokens) have a non-empty value of Definite
.
ADV
tokens may have the following values of Definite
:
Def
(18; 67% of non-emptyDefinite
): samarra, adinakoa, atzokoa, atzokoan, atzokoaren, aurtengoa, aurtengoak, aurtengora, egungoak, gaurkoaInd
(9; 33% of non-emptyDefinite
): seguruenik, gaurko, betirako, biharko, lehenbiziko, samarEMPTY
(5225): atzo, oso, gaur, orain, ondoren, gero, hala, bertan, beti, ondo
Paradigm gaur | Ind | Def |
---|---|---|
gaurko | ||
Number=Sing | gaurkoa |
NUM
19 NUM tokens (1% of all NUM
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which NUM
and Definite
co-occurred: NumType=Card (19; 100%).
NUM
tokens may have the following values of Definite
:
Ind
(19; 100% of non-emptyDefinite
): bana, 16na, 21na, 31na, banarekin, bederaEMPTY
(3547): bat, bi, hiru, batean, baten, batek, lau, batez, bost, sei
SYM
14 SYM tokens (93% of all SYM
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which SYM
and Definite
co-occurred: Number=Sing (13; 93%), Case=Abs (10; 71%), Animacy=EMPTY (9; 64%).
SYM
tokens may have the following values of Definite
:
Def
(14; 100% of non-emptyDefinite
): cm-ko, kg, kv, m, m., cm, kmEMPTY
(1): Kw
Relations with Agreement in Definite
The 10 most frequent relations where parent and child node agree in Definite
:
NOUN –[nmod]–> PROPN (1282; 52%),
NOUN –[conj]–> NOUN (481; 55%),
ADJ –[nsubj]–> NOUN (204; 67%),
PROPN –[nmod]–> PROPN (68; 66%),
NOUN –[nsubj]–> DET (56; 65%),
ADJ –[conj]–> ADJ (52; 53%),
PROPN –[appos]–> PROPN (50; 66%),
PROPN –[appos]–> NOUN (37; 64%),
NOUN –[conj]–> PROPN (32; 53%),
ADJ –[nsubj]–> DET (31; 79%).