home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Slovenian-SSJ: POS Tags: ADJ

There are 3879 ADJ lemmas (23%), 8176 ADJ types (25%) and 15062 ADJ tokens (11%). Out of 16 observed tags, the rank of ADJ is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent ADJ lemmas: drug, velik, nov, prvi, slovenski, dober, sam, evropski, star, zadnji

The 10 most frequent ADJ types: drugi, prvi, mogoče, druge, sam, novo, drugih, nove, različnih, veliko

The 10 most frequent ambiguous lemmas: mlad (ADJ 72, NOUN 1), pravi (ADJ 66, NOUN 2), dolg (ADJ 60, NOUN 7), svet (NOUN 133, ADJ 16), moški (NOUN 27, ADJ 14), poceni (ADJ 14, ADV 1), gost (NOUN 18, ADJ 12), razen (ADJ 11, ADP 8, CCONJ 1), gol (ADJ 7, NOUN 2), peti (ADJ 7, VERB 5)

The 10 most frequent ambiguous types: mogoče (ADJ 66, ADV 6), sam (ADJ 50, PART 7), veliko (DET 88, ADJ 36), pravi (VERB 33, ADJ 24), jasno (ADJ 21, ADV 8), dobro (ADV 49, ADJ 22, NOUN 2), težko (ADJ 21, ADV 19), prihodnje (ADJ 17, ADV 1), lepo (ADJ 15, ADV 13), delovno (ADJ 8, ADV 1)

Morphology

The form / lemma ratio of ADJ is 2.107760 (the average of all parts of speech is 1.892155).

The 1st highest number of forms (29) was observed with the lemma “velik”: največja, največje, največjega, največjem, največji, največjih, največjim, največjima, največjo, tavelzga, velik, velika, velike, velikega, velikem, velikemu, veliki, velikih, velikim, velikimi, veliko, večja, večje, večjega, večjem, večji, večjih, večjimi, večjo.

The 2nd highest number of forms (24) was observed with the lemma “dober”: boljša, boljše, boljšega, boljšem, boljši, boljših, dober, dobra, dobre, dobrega, dobrem, dobri, dobrih, dobrim, dobrimi, dobro, najboljša, najboljše, najboljšega, najboljšem, najboljši, najboljših, najboljšim, najboljšo.

The 3rd highest number of forms (24) was observed with the lemma “majhen”: majhen, majhna, majhne, majhnega, majhnemu, majhni, majhnih, majhnim, majhnimi, majhno, manjša, manjše, manjšem, manjšemu, manjši, manjših, manjšim, manjšimi, manjšo, mejhen, najmanjša, najmanjše, najmanjši, najmanjših.

ADJ occurs with 8 features: Case (15062; 100% instances), Gender (15062; 100% instances), Number (15062; 100% instances), Degree (14379; 95% instances), Definite (2074; 14% instances), VerbForm (1931; 13% instances), Poss (386; 3% instances), NumType (315; 2% instances)

ADJ occurs with 21 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Definite=Def, Definite=Ind, Degree=Cmp, Degree=Pos, Degree=Sup, Gender=Fem, Gender=Masc, Gender=Neut, NumType=Mult, NumType=Ord, Number=Dual, Number=Plur, Number=Sing, Poss=Yes, VerbForm=Part

ADJ occurs with 262 feature combinations. The most frequent feature combination is Case=Nom|Degree=Pos|Gender=Fem|Number=Sing (1031 tokens). Examples: sama, velika, slovenska, nova, edina, stara, lepa, prava, dobra, primerna

Relations

ADJ nodes are attached to their parents using 15 different relations: amod (11783; 78% instances), root (1048; 7% instances), conj (817; 5% instances), obl (280; 2% instances), acl (260; 2% instances), advcl (168; 1% instances), ccomp (166; 1% instances), parataxis (145; 1% instances), xcomp (136; 1% instances), nsubj (135; 1% instances), obj (50; 0% instances), csubj (41; 0% instances), fixed (18; 0% instances), iobj (12; 0% instances), nmod (3; 0% instances)

Parents of ADJ nodes belong to 11 different parts of speech: NOUN (11720; 78% instances), VERB (1064; 7% instances), (1048; 7% instances), ADJ (824; 5% instances), PROPN (261; 2% instances), PRON (72; 0% instances), DET (29; 0% instances), ADV (20; 0% instances), NUM (13; 0% instances), X (10; 0% instances), INTJ (1; 0% instances)

10999 (73%) ADJ nodes are leaves.

1644 (11%) ADJ nodes have one child.

447 (3%) ADJ nodes have two children.

1972 (13%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 11.

Children of ADJ nodes are attached using 24 different relations: punct (2381; 19% instances), cop (1911; 15% instances), advmod (1859; 15% instances), nsubj (1086; 9% instances), obl (1080; 9% instances), conj (851; 7% instances), cc (645; 5% instances), mark (569; 5% instances), aux (536; 4% instances), nmod (261; 2% instances), case (257; 2% instances), parataxis (224; 2% instances), csubj (220; 2% instances), obj (170; 1% instances), advcl (167; 1% instances), ccomp (73; 1% instances), nummod (59; 0% instances), amod (25; 0% instances), acl (17; 0% instances), cc:preconj (13; 0% instances), discourse (10; 0% instances), xcomp (8; 0% instances), expl (4; 0% instances), appos (1; 0% instances)

Children of ADJ nodes belong to 16 different parts of speech: AUX (2447; 20% instances), PUNCT (2381; 19% instances), NOUN (2068; 17% instances), ADV (1216; 10% instances), ADJ (824; 7% instances), CCONJ (774; 6% instances), VERB (734; 6% instances), SCONJ (568; 5% instances), PART (476; 4% instances), DET (270; 2% instances), ADP (238; 2% instances), PRON (186; 1% instances), PROPN (166; 1% instances), NUM (72; 1% instances), X (4; 0% instances), INTJ (3; 0% instances)