DET

This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.

home sl/pos issue tracker

`DET`: determiner

Definition

Determiners are words that modify nouns or noun phrases and express the reference of the noun phrase in context. That is, a determiner may indicate whether the noun is referring to a definite or indefinite element of a class, to a closer or more distant element, to an element belonging to a specified person or thing, to a particular number or quantity, etc.

The traditional grammar of Slovenian does not define determiners as a separate word class. Instead, words that perform the syntactic function of determiners are either categorizied as adverbs (nekaj “some”, veliko “a lot of”, dovolj “enough of” etc.) or pronouns (ta “this”, ves “all”, moj “my”, vsak “each” etc.), regardless of whether they are used as attributives (To.DET besedilo je nerazumljivo. “This text is incomprehensible.”) or substantives (To.PRON sem že slišal. “I have heard this before.”).

Conversion from JOS

Since JOS morphosyntactic specifications do not distinguish substantive and attributive pronouns or quantifying and other adverbs, the conversion is done based on syntactic information. The pronouns modifying a noun are thus marked as DET, otherwise they are marked as PRON. Similarly, the list of adverbs modifying a noun was manually validated to define a closed set of quantifying adverbs marked as DET.

Examples

njegov “his”, njen “her”, naš “our”, njihov “their”, _moj “my”, _vaš “your” etc. (JOS possessive pronouns)
ta “this”, tisti “that”, takšen “such”, tak “such”, _tolikšen “so big” etc. (JOS demonstrative pronouns)
ves “all”, vsak “each”, oba “both”, vsakršen “any” (JOS general pronouns)
svoj “one’s own” (JOS reflexive pronouns)
nekateri “some”, nek “some kind”, isti “identical”, enak “same”, mnog “many” (JOS indefinite pronouns)
kakšen “what kind”, kateri “what type”, čigav “whose” (JOS interrogative pronouns)
noben “no one”, nikakršen “no kind”, nič “nothing” (JOS negative pronouns)
kakršenkoli “any kind of”, katerikoli “any type of”, čigar “whose” (JOS relative pronouns)
nekaj “some”, več “more”, veliko “a lot of”, dovolj “enough of”, pol “half of”, malo “little of” (JOS adverbs)

Treebank Statistics (UD_Slovenian)

There are 63 DET lemmas (0%), 289 DET types (1%) and 3310 DET tokens (2%). Out of 16 observed tags, the rank of DET is: 8 in number of lemmas, 7 in number of types and 13 in number of tokens.

The 10 most frequent DET lemmas: ta, svoj, ves, njegov, nekaj, naš, njen, več, njihov, vsak

The 10 most frequent DET types: nekaj, ta, več, svoje, tem, te, svojo, vse, to, vseh

The 10 most frequent ambiguous lemmas: ta (PRON 807, DET 552), svoj (DET 456, PRON 23), ves (DET 289, PRON 198, ADV 1), njegov (DET 244, PRON 6), nekaj (DET 166, PRON 65, ADV 19), naš (DET 141, PRON 3), njen (DET 135, PRON 4), več (DET 125, PART 79, ADV 54), njihov (DET 117, PRON 2), vsak (DET 117, PRON 21)

The 10 most frequent ambiguous types: nekaj (DET 147, PRON 58, ADV 17), ta (DET 85, PRON 31), več (DET 121, PART 79, ADV 50), svoje (DET 107, PRON 12), tem (PRON 177, DET 96, NOUN 1, ADV 1), te (DET 78, PRON 23), svojo (DET 83, PRON 4), vse (PRON 88, DET 74, ADV 38), to (PRON 273, DET 63), vseh (DET 61, PRON 8)

nekaj
- DET 147: Simpatična uradna stran z nekaj prav zanimivimi rubrikami .
- PRON 58: A nekaj v meni mi ni dovolilo , da bi zaploskala .
- ADV 17: Za nemoteno izvajanje zakona manjka nekaj več kot milijardo tolarjev .
ta
- DET 85: Saj država ni nekakšen moloh , ki nemo gleda na ta narod .
- PRON 31: Megan je pogledala proti Molly , ta pa je zmignila z rameni .
več
- DET 121: Za zadovoljitev pomembne želje so pripravljeni vložiti več truda .
- PART 79: Tragika te ženske : na koncu ji noben zdravnik ni več verjel .
- ADV 50: Po svojem duhu je več kot vsa snovnost .
svoje
- DET 107: Svetujemo : Bodite pozitivni do sebe in svoje okolice .
- PRON 12: Dunaj pa imenujejo tudi drugi narodi po svoje .
tem
- PRON 177: S tem nikakor ni zmanjšan pomen enotne volje državljanov Slovenije .
- DET 96: V tem trenutku je smiselno govoriti o prisilni poravnavi .
- NOUN 1: Dijaki v njem prikažejo svoje poznavanje obravnavanih tem iz književnosti .
- ADV 1: Čim boljše bo v Sloveniji znanje tujih jezikov , tem bolje se bo Slovenija sporazumevala s svetom .
te
- DET 78: Tragika te ženske : na koncu ji noben zdravnik ni več verjel .
- PRON 23: Če te ne bi bilo , ne bi pomagal niti izredno ugoden splet okoliščin .
svojo
- DET 83: • Prepoznajte svojo sitost .
- PRON 4: Počasi se je pomikala med vrstami in iskala svojo .
vse
- PRON 88: Ko vse skupaj zavre , vložimo fileje in jih dušimo dobrih 10 minut .
- DET 74: Na vse prireditve je vstop prost .
- ADV 38: Slišim namreč vse več glasov o nepravilnostih .
to
- PRON 273: Vsaj to v ničemer ne more biti sporno .
- DET 63: Za to pomoč smo jim seveda tudi plačali .
vseh
- DET 61: Te preiskovalne metode uporabljajo v vseh vrstah medicine .
- PRON 8: To je popoln poraz nekega koncepta , popoln poraz vseh nas .

Morphology

The form / lemma ratio of DET is 4.587302 (the average of all parts of speech is 1.894262).

The 1st highest number of forms (12) was observed with the lemma “tisti”: tist, tista, tiste, tistega, tistem, tistemu, tisti, tistih, tistim, tistimi, tistmu, tisto.

The 2nd highest number of forms (11) was observed with the lemma “naš”: naš, naša, naše, našega, našem, našemu, naši, naših, našim, našimi, našo.

The 3rd highest number of forms (11) was observed with the lemma “njihov”: njihov, njihova, njihove, njihovega, njihovem, njihovemu, njihovi, njihovih, njihovim, njihovimi, njihovo.

DET occurs with 10 features: sl-feat/PronType (3310; 100% instances), sl-feat/Case (2855; 86% instances), sl-feat/Gender (2855; 86% instances), sl-feat/Number (2855; 86% instances), sl-feat/Poss (1260; 38% instances), sl-feat/Number[psor] (804; 24% instances), sl-feat/Person (804; 24% instances), sl-feat/Reflex (456; 14% instances), sl-feat/Degree (455; 14% instances), sl-feat/Gender[psor] (379; 11% instances)

DET occurs with 33 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Degree=Cmp, Degree=Pos, Degree=Sup, Gender=Fem, Gender=Masc, Gender=Neut, Gender[psor]=Fem, Gender[psor]=Masc, Gender[psor]=Neut, Number=Dual, Number=Plur, Number=Sing, Number[psor]=Dual, Number[psor]=Plur, Number[psor]=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Dem, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes

DET occurs with 384 feature combinations. The most frequent feature combination is Degree=Pos|PronType=Ind (288 tokens). Examples: nekaj, veliko, dovolj, malo, pol, mnogo, par, preveč, dosti, nešteto

Relations

DET nodes are attached to their parents using 1 different relations: sl-dep/det (3310; 100% instances)

Parents of DET nodes belong to 2 different parts of speech: NOUN (3289; 99% instances), PROPN (21; 1% instances)

3187 (96%) DET nodes are leaves.

111 (3%) DET nodes have one child.

12 (0%) DET nodes have two children.

The highest child degree of a DET node is 2.

Children of DET nodes are attached using 6 different relations: sl-dep/advmod (69; 51% instances), sl-dep/mwe (53; 39% instances), sl-dep/cc (5; 4% instances), sl-dep/conj (5; 4% instances), sl-dep/nmod (2; 1% instances), sl-dep/case (1; 1% instances)

Children of DET nodes belong to 7 different parts of speech: SCONJ (48; 36% instances), ADV (46; 34% instances), PART (28; 21% instances), CONJ (5; 4% instances), PRON (5; 4% instances), ADJ (2; 1% instances), ADP (1; 1% instances)

Treebank Statistics (UD_Slovenian-SST)

There are 44 DET lemmas (1%), 171 DET types (3%) and 712 DET tokens (2%). Out of 16 observed tags, the rank of DET is: 11 in number of lemmas, 7 in number of types and 14 in number of tokens.

The 10 most frequent DET lemmas: ta, naš, nek, ves, tak, tisti, moj, kakšen, vsak, svoj

The 10 most frequent DET types: ta, to, te, tem, teh, vse, tega, tej, neko, nekaj

The 10 most frequent ambiguous lemmas: ta (PRON 613, DET 208), naš (DET 47, PRON 6), nek (DET 43, PRON 3), ves (PRON 54, DET 42), tak (DET 33, PRON 16), tisti (PRON 53, DET 32), moj (DET 27, PRON 8), kakšen (DET 26, PRON 20), vsak (DET 22, PRON 7), svoj (DET 21, PRON 4)

The 10 most frequent ambiguous types: ta (DET 79, PRON 28), to (PRON 489, DET 26, X 1), te (PRON 29, DET 21, ADV 21), tem (PRON 29, DET 21), teh (DET 20, PRON 3), vse (PRON 41, DET 17, ADV 7), tega (PRON 31, DET 15), neko (DET 12, PRON 1), nekaj (PRON 29, ADV 23, DET 11), take (DET 11, PRON 4)

ta
- DET 79: eee poleg one ta glavne … poleg one ki je ki je bila najbolj in
- PRON 28: pol pa v oklepaju piše le za ta to pa to vizo lahko trideset dni ostaneš
to
- PRON 489: pol pa v oklepaju piše le za ta to pa to vizo lahko trideset dni ostaneš
- DET 26: pol pa v oklepaju piše le za ta to pa to vizo lahko trideset dni ostaneš
- X 1: mi smo kar eee kolega ni povedal v letošnjem letu intenzivirali obnavljanje in eee eee gradnjo na progi in pri tem opravili s [gap] to [gap] opisano povečanje tovornega prometa navkljub
te
- PRON 29: edino to da se nisi bal da te bo kdo počil kje
- DET 21: ja pa so vam pol tudi gnoj dali za te njive pa ?
- ADV 21: no te se pa lahko v sredo dobiva ne ?
tem
- PRON 29: [gap] s tem reče nesposobna si [speaker:laughter]
- DET 21: ni govora v tem primeru o otrocih ne
teh
- DET 20: glej točko a zrcali geotrikotnik imaš glej koliko imaš teh trikotnikov
- PRON 3: to je brat od tega [name:personal] pa od teh
vse
- PRON 41: res no vse vse jaz ne vem česa vse ni imel … ja ja ja
- DET 17: ne ja kaj pa je te ja kaj pa more more saj to eee vse hudo mine ne eee ja
- ADV 7: [gap] tole so vse karavanke in tukaj je en sicer ne to je že pohorje zdaj pa [gap]
tega
- PRON 31: sosed ima tam par sodov tega [gap]
- DET 15: ja prositi za tega duha slavljenja ne
neko
- DET 12: doživetje dogodivščino neko ljubezensko srečanje
- PRON 1: ki bi rekel ti konfliktni interesi tudi ustvarjajo potem to neko nek vladajoči razred ne iz katerega pravzaprav ko enkrat prideš vanj ne moreš več izpasti
nekaj
- PRON 29: ja zdaj n [gap] pravijo da bo še nekaj menda država dala ali kaj ne
- ADV 23: ali boš buljil skozi okoli ali boš pa tam nekaj telefon drkal
- DET 11: no v kranju pa imamo nekaj malega dežja in šest stopinj
take
- DET 11: bolje take vaje kakor pol leta [gap]
- PRON 4: samo špago sem mogel omotati pa take

Morphology

The form / lemma ratio of DET is 3.886364 (the average of all parts of speech is 1.575031).

The 1st highest number of forms (10) was observed with the lemma “svoj”: svoj, svoja, svoje, svojega, svojem, svojemu, svojih, svojim, svojimi, svojo.

The 2nd highest number of forms (9) was observed with the lemma “ta”: ta, te, tega, teh, tej, tem, temi, ti, to.

The 3rd highest number of forms (9) was observed with the lemma “ves”: ves, vsa, vse, vsega, vseh, vsem, vsemi, vsi, vso.

DET occurs with 9 features: sl-feat/PronType (712; 100% instances), sl-feat/Case (660; 93% instances), sl-feat/Gender (660; 93% instances), sl-feat/Number (660; 93% instances), sl-feat/Number[psor] (113; 16% instances), sl-feat/Person (113; 16% instances), sl-feat/Poss (113; 16% instances), sl-feat/Degree (52; 7% instances), sl-feat/Gender[psor] (9; 1% instances)

DET occurs with 30 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Degree=Cmp, Degree=Pos, Degree=Sup, Gender=Fem, Gender=Masc, Gender=Neut, Gender[psor]=Fem, Gender[psor]=Masc, Number=Dual, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Dem, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot

DET occurs with 178 feature combinations. The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing|PronType=Dem (42 tokens). Examples: ta, tak, tale, tisti, oni, takšen

Relations

DET nodes are attached to their parents using 3 different relations: sl-dep/det (710; 100% instances), sl-dep/dobj (1; 0% instances), sl-dep/reparandum (1; 0% instances)

Parents of DET nodes belong to 9 different parts of speech: NOUN (642; 90% instances), ADJ (30; 4% instances), PROPN (17; 2% instances), PRON (10; 1% instances), NUM (4; 1% instances), X (4; 1% instances), VERB (3; 0% instances), ADV (1; 0% instances), DET (1; 0% instances)

684 (96%) DET nodes are leaves.

23 (3%) DET nodes have one child.

2 (0%) DET nodes have two children.

3 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 3.

Children of DET nodes are attached using 6 different relations: sl-dep/advmod (15; 42% instances), sl-dep/reparandum (15; 42% instances), sl-dep/discourse:filler (3; 8% instances), sl-dep/amod (1; 3% instances), sl-dep/cc (1; 3% instances), sl-dep/conj (1; 3% instances)

Children of DET nodes belong to 9 different parts of speech: ADV (13; 36% instances), X (9; 25% instances), PRON (5; 14% instances), INTJ (3; 8% instances), PART (2; 6% instances), ADJ (1; 3% instances), ADP (1; 3% instances), CONJ (1; 3% instances), DET (1; 3% instances)

DET in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]

DET: determiner

Definition

Conversion from JOS

Examples

Treebank Statistics (UD_Slovenian)

Morphology

Relations

Treebank Statistics (UD_Slovenian-SST)

Morphology

Relations

`DET`: determiner