Statistics of DET in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Slovenian-SSJ: POS Tags: `DET`

There are 68 DET lemmas (0%), 357 DET types (1%) and 9352 DET tokens (4%). Out of 17 observed tags, the rank of DET is: 8 in number of lemmas, 8 in number of types and 10 in number of tokens.

The 10 most frequent DET lemmas: ta, ves, svoj, kateri, njegov, več, nekaj, tisti, njen, vsak

The 10 most frequent DET types: to, tem, vse, več, ta, tega, nekaj, svoje, veliko, te

The 10 most frequent ambiguous lemmas: ves (DET 894, ADV 1), več (DET 325, PART 123), nekaj (DET 290, PRON 115), isti (DET 78, PRON 1), pol (DET 49, NOUN 2), mnogo (DET 15, ADV 1), oni (DET 13, ADJ 2), par (NOUN 23, DET 5), četrt (DET 2, NOUN 1)

The 10 most frequent ambiguous types: to (DET 612, X 6), tem (DET 468, NOUN 5, ADV 2), vse (DET 290, ADV 73), več (DET 306, PART 123), nekaj (DET 261, PRON 97), veliko (DET 158, ADJ 59), te (DET 140, PRON 25), malo (DET 85, ADJ 8), ti (PRON 44, DET 29), pol (DET 48, ADV 3)

to
- DET 612: Vsaj to v ničemer ne more biti sporno .
- X 6: Push to Talk ali po slovensko Pritisni in govori .
tem
- DET 468: S tem nikakor ni zmanjšan pomen enotne volje državljanov Slovenije .
- NOUN 5: Od tem , da je treba pridobiti pisno soglasje za reeksport .
- ADV 2: Čim prej pridem domov , tem bolje . «
vse
- DET 290: Ne energija “ biti vse ali nič “ , ampak preprosto energija biti .
- ADV 73: Slišim namreč vse več glasov o nepravilnostih .
več
- DET 306: Za zadovoljitev pomembne želje so pripravljeni vložiti več truda .
- PART 123: Tragika te ženske : na koncu ji noben zdravnik ni več verjel .
nekaj
- DET 261: Simpatična uradna stran z nekaj prav zanimivimi rubrikami .
- PRON 97: A nekaj v meni mi ni dovolilo , da bi zaploskala .
veliko
- DET 158: Posebej kadar se jih nabere veliko , tako kot Janezu Janši .
- ADJ 59: Kralj Alfonz VI. je prispeval veliko donacijo za novo cerkev .
te
- DET 140: Če te ne bi bilo , ne bi pomagal niti izredno ugoden splet okoliščin .
- PRON 25: Če mož pije in te pretepa , se bo treba z
malo
- DET 85: Samo malo , malo , malo . «
- ADJ 8: Vse bolj pogumno pa se malo gospodarstvo razvija tudi na področju negospodarstva in prav tu je slutiti nadaljnji razvoj .
ti
- PRON 44: » Verjamem ti , « je mehko rekla .
- DET 29: V zadnjih 20 letih so se ti cilji stalno spreminjali .
pol
- DET 48: V zadnjih petih urah sva se premaknila za slabe pol milje .
- ADV 3: — Ma , morš izpast totalno navdušen , sam pol pa vseen zajebat .

Morphology

The form / lemma ratio of DET is 5.250000 (the average of all parts of speech is 1.932008).

The 1st highest number of forms (13) was observed with the lemma “tisti”: tist, tista, tiste, tistega, tistem, tistemu, tistga, tisti, tistih, tistim, tistimi, tistmu, tisto.

The 2nd highest number of forms (12) was observed with the lemma “kakšen”: kakšen, kakšenmu, kakšna, kakšne, kakšnega, kakšnem, kakšnemu, kakšni, kakšnih, kakšnim, kakšnimi, kakšno.

The 3rd highest number of forms (12) was observed with the lemma “naš”: naš, naša, naše, našega, našem, našemu, naši, naših, našim, našima, našimi, našo.

DET occurs with 11 features: PronType (9350; 100% instances), Case (7978; 85% instances), Gender (7978; 85% instances), Number (7978; 85% instances), Poss (2294; 25% instances), Number[psor] (1479; 16% instances), Person (1476; 16% instances), Reflex (818; 9% instances), Gender[psor] (702; 8% instances), Degree (2; 0% instances), Typo (1; 0% instances)

DET occurs with 32 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Degree=Pos, Gender=Fem, Gender=Masc, Gender=Neut, Gender[psor]=Fem, Gender[psor]=Masc, Gender[psor]=Neut, Number=Dual, Number=Plur, Number=Sing, Number[psor]=Dual, Number[psor]=Plur, Number[psor]=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Dem, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes, Typo=Yes

DET occurs with 477 feature combinations. The most frequent feature combination is PronType=Ind (1279 tokens). Examples: več, nekaj, veliko, manj, dovolj, malo, pol, preveč, največ, nekatere

Relations

DET nodes are attached to their parents using 21 different relations: det (5878; 63% instances), obl (1133; 12% instances), nsubj (871; 9% instances), advmod (568; 6% instances), obj (336; 4% instances), nmod (162; 2% instances), conj (104; 1% instances), root (71; 1% instances), orphan (43; 0% instances), parataxis (42; 0% instances), fixed (34; 0% instances), iobj (28; 0% instances), appos (22; 0% instances), acl (15; 0% instances), ccomp (15; 0% instances), advcl (12; 0% instances), xcomp (9; 0% instances), cc (4; 0% instances), csubj (3; 0% instances), amod (1; 0% instances), dislocated (1; 0% instances)

Parents of DET nodes belong to 14 different parts of speech: NOUN (6176; 66% instances), VERB (2089; 22% instances), ADJ (531; 6% instances), NUM (128; 1% instances), DET (119; 1% instances), ADV (79; 1% instances), (71; 1% instances), PROPN (68; 1% instances), PRON (47; 1% instances), CCONJ (16; 0% instances), ADP (15; 0% instances), PART (8; 0% instances), X (3; 0% instances), AUX (2; 0% instances)

7179 (77%) DET nodes are leaves.

1581 (17%) DET nodes have one child.

358 (4%) DET nodes have two children.

234 (3%) DET nodes have three or more children.

The highest child degree of a DET node is 11.

Children of DET nodes are attached using 28 different relations: case (1195; 37% instances), acl (449; 14% instances), advmod (440; 14% instances), punct (282; 9% instances), fixed (127; 4% instances), cc (109; 3% instances), nmod (97; 3% instances), cop (92; 3% instances), obl (84; 3% instances), nsubj (72; 2% instances), orphan (54; 2% instances), conj (53; 2% instances), det (45; 1% instances), advcl (37; 1% instances), appos (25; 1% instances), parataxis (22; 1% instances), aux (18; 1% instances), mark (18; 1% instances), obj (9; 0% instances), discourse (6; 0% instances), nummod (4; 0% instances), cc:preconj (3; 0% instances), amod (2; 0% instances), csubj (2; 0% instances), dep (1; 0% instances), goeswith (1; 0% instances), iobj (1; 0% instances), vocative (1; 0% instances)

Children of DET nodes belong to 16 different parts of speech: ADP (1158; 36% instances), VERB (442; 14% instances), PUNCT (282; 9% instances), PART (253; 8% instances), NOUN (238; 7% instances), ADV (210; 6% instances), SCONJ (148; 5% instances), CCONJ (127; 4% instances), DET (119; 4% instances), AUX (113; 3% instances), ADJ (89; 3% instances), PRON (31; 1% instances), PROPN (22; 1% instances), NUM (8; 0% instances), X (7; 0% instances), INTJ (2; 0% instances)

Treebank Statistics: UD_Slovenian-SSJ: POS Tags: DET

Morphology

Relations

Treebank Statistics: UD_Slovenian-SSJ: POS Tags: `DET`