This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home et/pos issue tracker

ADJ: adjective

Definition

Adjectives are words that typically modify nouns and specify their properties or attributes. They may also function as predicates as in Suvi on soe ‘The summer is warm’.
Also pro-adjectives, e.g. selline ‘such’, niisugune ‘such’ , missugune ‘which’ etc and attributive ordinal numerals like esimene ‘first’, teine ‘second’ etc are labelled ADJ according to Estonian UD annotation.
Attributive or predicative participles, e.g. valvav mees ‘guarding man’, valvatav mees ‘man who is guarded’ möödunud nädal ‘last week’, lõhutud vaas ‘broken vase’ also get the ADJ label.


Treebank Statistics (UD_Estonian)

There are 4591 ADJ lemmas (16%), 8193 ADJ types (16%) and 19421 ADJ tokens (8%). Out of 15 observed tags, the rank of ADJ is: 2 in number of lemmas, 2 in number of types and 5 in number of tokens.

The 10 most frequent ADJ lemmas: suur, uus, suurem, selline, esimene, mõni, iga, sama, hea, viimane

The 10 most frequent ADJ types: kogu, suur, hea, sama, iga, võimalik, suurem, eesti, selline, uue

The 10 most frequent ambiguous lemmas: suur (ADJ 389, NOUN 3), esimene (ADJ 256, PRON 48), mõni (ADJ 235, PRON 9), iga (ADJ 234, PRON 7), sama (ADJ 227, PRON 5), hea (ADJ 223, NOUN 20), viimane (ADJ 206, NOUN 8), mingi (ADJ 197, PRON 3), kogu (ADJ 152, NOUN 19), väike (ADJ 131, NOUN 4)

The 10 most frequent ambiguous types: kogu (ADJ 130, NOUN 9), hea (ADJ 109, NOUN 11), sama (ADJ 99, PRON 1), iga (ADJ 81, PRON 3), esimene (ADJ 55, PRON 9), igal (ADJ 51, PRON 1), mingi (ADJ 58, PRON 1), samal (ADJ 41, PRON 1), vana (ADJ 47, NOUN 6), viimane (ADJ 32, NOUN 2)

Morphology

The form / lemma ratio of ADJ is 1.784579 (the average of all parts of speech is 1.839644).

The 1st highest number of forms (22) was observed with the lemma “mõni”: mõnd, mõnda, mõndagi, mõne, mõned, mõnede, mõnedele, mõnedes, mõnedki, mõnegi, mõneks, mõnel, mõnele, mõnelegi, mõnelgi, mõnelt, mõnes, mõneski, mõnesse, mõnest, mõni, mõnigi.

The 2nd highest number of forms (21) was observed with the lemma “väike”: Väikegi, Väikestele, väike, väike-, väikese, väikesed, väikeseid, väikeseks, väikesel, väikesele, väikeses, väikesesse, väikesest, väikesi, väikest, väikeste, väikestel, väikestesse, väiksed, väikseid, väikses.

The 3rd highest number of forms (18) was observed with the lemma “selline”: selline, sellise, sellised, selliseid, selliseidki, selliseks, sellisel, sellisena, sellises, sellisest, sellist, selliste, sellistega, sellistel, sellistele, sellistelt, sellistes, sellistest.

ADJ occurs with 11 features: Case (16950; 87% instances), Number (16900; 87% instances), Degree (16252; 84% instances), VerbForm (3510; 18% instances), Voice (3507; 18% instances), Tense (3396; 17% instances), NumType (1187; 6% instances), NumForm (1106; 6% instances), PronType (1021; 5% instances), Abbr (41; 0% instances), Hyph (11; 0% instances)

ADJ occurs with 37 feature-value pairs: Abbr=Yes, Case=Abe, Case=Abl, Case=Add, Case=Ade, Case=All, Case=Com, Case=Ela, Case=Ess, Case=Gen, Case=Ill, Case=Ine, Case=Nom, Case=Par, Case=Ter, Case=Tra, Degree=Cmp, Degree=Pos, Degree=Sup, Hyph=Yes, NumForm=Digit, NumForm=Letter, NumForm=Roman, NumType=Ord, Number=Plur, Number=Sing, PronType=Dem, PronType=Ind, PronType=Int, PronType=Int,Rel, PronType=Rel, Tense=Past, Tense=Pres, VerbForm=Part, VerbForm=Sup, Voice=Act, Voice=Pass

ADJ occurs with 269 feature combinations. The most frequent feature combination is Case=Nom|Degree=Pos|Number=Sing (3634 tokens). Examples: suur, võimalik, hea, uus, raske, viimane, oluline, väike, selge, kindel

Relations

ADJ nodes are attached to their parents using 19 different relations: amod (12728; 66% instances), acl (2783; 14% instances), root (1343; 7% instances), conj (1144; 6% instances), xcomp (449; 2% instances), dep (267; 1% instances), ccomp (122; 1% instances), nsubj (117; 1% instances), parataxis (97; 0% instances), acl:relcl (95; 0% instances), dobj (87; 0% instances), nmod (72; 0% instances), advcl (39; 0% instances), nsubj:cop (36; 0% instances), advmod:quant (14; 0% instances), csubj (14; 0% instances), list (9; 0% instances), cc:preconj (4; 0% instances), name (1; 0% instances)

Parents of ADJ nodes belong to 13 different parts of speech: NOUN (14628; 75% instances), VERB (1692; 9% instances), ROOT (1343; 7% instances), ADJ (1018; 5% instances), PROPN (389; 2% instances), PRON (149; 1% instances), NUM (109; 1% instances), ADV (81; 0% instances), SYM (7; 0% instances), AUX (2; 0% instances), ADP (1; 0% instances), INTJ (1; 0% instances), X (1; 0% instances)

13111 (68%) ADJ nodes are leaves.

3020 (16%) ADJ nodes have one child.

962 (5%) ADJ nodes have two children.

2328 (12%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 15.

Children of ADJ nodes are attached using 29 different relations: punct (2795; 17% instances), advmod (2793; 17% instances), nmod (2743; 17% instances), cop (2050; 12% instances), nsubj:cop (1598; 10% instances), conj (1126; 7% instances), cc (909; 5% instances), mark (453; 3% instances), advcl (358; 2% instances), dobj (348; 2% instances), csubj:cop (274; 2% instances), dep (221; 1% instances), parataxis (198; 1% instances), amod (144; 1% instances), xcomp (119; 1% instances), nummod (68; 0% instances), case (63; 0% instances), csubj (63; 0% instances), compound:prt (58; 0% instances), det (38; 0% instances), cc:preconj (35; 0% instances), acl:relcl (30; 0% instances), nsubj (24; 0% instances), discourse (14; 0% instances), appos (7; 0% instances), advmod:quant (6; 0% instances), list (6; 0% instances), vocative (6; 0% instances), aux (1; 0% instances)

Children of ADJ nodes belong to 15 different parts of speech: NOUN (3797; 23% instances), VERB (3268; 20% instances), ADV (3004; 18% instances), PUNCT (2795; 17% instances), ADJ (1018; 6% instances), CONJ (903; 5% instances), PRON (768; 5% instances), PROPN (448; 3% instances), SCONJ (377; 2% instances), NUM (82; 0% instances), ADP (63; 0% instances), INTJ (14; 0% instances), SYM (8; 0% instances), X (2; 0% instances), AUX (1; 0% instances)


ADJ in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]