home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Estonian-EDT: POS Tags: ADJ

There are 6366 ADJ lemmas (17%), 12110 ADJ types (17%) and 30841 ADJ tokens (8%). Out of 16 observed tags, the rank of ADJ is: 2 in number of lemmas, 2 in number of types and 5 in number of tokens.

The 10 most frequent ADJ lemmas: suur, uus, suurem, esimene, viimane, hea, erinev, võimalik, oluline, väike

The 10 most frequent ADJ types: suur, hea, võimalik, suurem, eesti, uue, raske, suure, oluline, esimene

The 10 most frequent ambiguous lemmas: suur (ADJ 588, NOUN 4, PROPN 1), uus (ADJ 489, NOUN 2, PROPN 1), esimene (ADJ 393, DET 51, PRON 23, NOUN 1), viimane (ADJ 371, NOUN 19), hea (ADJ 339, NOUN 25), väike (ADJ 213, NOUN 2), keskmine (ADJ 190, NOUN 2), teine (DET 457, PRON 289, ADJ 189, NUM 1), järgmine (ADJ 180, NOUN 1), parem (ADJ 164, ADV 18)

The 10 most frequent ambiguous types: hea (ADJ 172, NOUN 14), eesti (ADJ 118, PROPN 2), esimene (ADJ 68, DET 14, PRON 3), vana (ADJ 67, NOUN 10), seotud (ADJ 75, VERB 64), esimest (ADJ 59, PRON 2), tehtud (ADJ 71, VERB 48, NOUN 1), teatud (ADJ 60, VERB 2), viimane (ADJ 41, NOUN 5), esimese (ADJ 56, DET 11, PRON 2)

Morphology

The form / lemma ratio of ADJ is 1.902293 (the average of all parts of speech is 1.912184).

The 1st highest number of forms (25) was observed with the lemma “väike”: Väikegi, väike, väike-, väikese, väikesed, väikeseid, väikeseks, väikesel, väikesele, väikesena, väikeses, väikesesse, väikesest, väikesi, väikest, väikeste, väikesteks, väikestele, väikestesse, väikestest, väikse, väiksed, väikseid, väiksel, väikses.

The 2nd highest number of forms (19) was observed with the lemma “järgmine”: järgmine, järgmise, järgmised, järgmiseid, järgmiseks, järgmisel, järgmisele, järgmisena, järgmises, järgmisesse, järgmisest, järgmisi, järgmisse, järgmist, järgmiste, järgmisteks, järgmistel, järgmistesse, järgmistest.

The 3rd highest number of forms (19) was observed with the lemma “suur”: suur, suurde, suure, suured, suureks, suurel, suurele, suurelt, suures, suurest, suuri, suurt, suurte, suurteks, suurtel, suurtele, suurtes, suurtesse, suurtest.

ADJ occurs with 12 features: Degree (27800; 90% instances), Case (26718; 87% instances), Number (26645; 86% instances), VerbForm (6528; 21% instances), Voice (6521; 21% instances), Tense (6387; 21% instances), NumType (2202; 7% instances), NumForm (2047; 7% instances), PronType (258; 1% instances), Abbr (83; 0% instances), Hyph (31; 0% instances), Foreign (1; 0% instances)

ADJ occurs with 40 feature-value pairs: Abbr=Yes, Case=Abe, Case=Abl, Case=Add, Case=Ade, Case=All, Case=Com, Case=Ela, Case=Ess, Case=Gen, Case=Ill, Case=Ine, Case=Nom, Case=Par, Case=Ter, Case=Tra, Degree=Cmp, Degree=Pos, Degree=Sup, Foreign=Yes, Hyph=Yes, NumForm=Digit, NumForm=Letter, NumForm=Roman, NumType=Card, NumType=Ord, Number=Plur, Number=Sing, PronType=Dem, PronType=Ind, PronType=Int, PronType=Int,Rel, PronType=Rel, PronType=Tot, Tense=Past, Tense=Pres, VerbForm=Part, VerbForm=Sup, Voice=Act, Voice=Pass

ADJ occurs with 260 feature combinations. The most frequent feature combination is Case=Nom|Degree=Pos|Number=Sing (5867 tokens). Examples: suur, võimalik, hea, oluline, raske, uus, väike, keskmine, viimane, selge

Relations

ADJ nodes are attached to their parents using 25 different relations: amod (19268; 62% instances), acl (5296; 17% instances), root (2079; 7% instances), conj (1934; 6% instances), xcomp (665; 2% instances), advcl (320; 1% instances), parataxis (313; 1% instances), ccomp (244; 1% instances), acl:relcl (224; 1% instances), nmod (114; 0% instances), obj (113; 0% instances), nsubj (104; 0% instances), nsubj:cop (55; 0% instances), obl (36; 0% instances), csubj (29; 0% instances), csubj:cop (18; 0% instances), appos (7; 0% instances), orphan (7; 0% instances), advmod (4; 0% instances), flat (4; 0% instances), advmod:quant (2; 0% instances), goeswith (2; 0% instances), case (1; 0% instances), compound:prt (1; 0% instances), nummod (1; 0% instances)

Parents of ADJ nodes belong to 14 different parts of speech: NOUN (23682; 77% instances), VERB (2388; 8% instances), (2079; 7% instances), ADJ (1641; 5% instances), PROPN (544; 2% instances), PRON (217; 1% instances), NUM (170; 1% instances), ADV (96; 0% instances), DET (16; 0% instances), INTJ (3; 0% instances), AUX (2; 0% instances), CCONJ (1; 0% instances), SYM (1; 0% instances), X (1; 0% instances)

19586 (64%) ADJ nodes are leaves.

6002 (19%) ADJ nodes have one child.

1429 (5%) ADJ nodes have two children.

3824 (12%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 13.

Children of ADJ nodes are attached using 32 different relations: obl (4645; 17% instances), punct (4527; 16% instances), advmod (4455; 16% instances), cop (3319; 12% instances), nsubj:cop (2640; 10% instances), conj (2026; 7% instances), cc (1504; 5% instances), advcl (786; 3% instances), mark (759; 3% instances), obj (623; 2% instances), csubj:cop (555; 2% instances), aux (427; 2% instances), parataxis (352; 1% instances), xcomp (180; 1% instances), amod (173; 1% instances), nummod (123; 0% instances), compound:prt (76; 0% instances), case (75; 0% instances), det (69; 0% instances), nmod (63; 0% instances), cc:preconj (61; 0% instances), acl:relcl (44; 0% instances), ccomp (24; 0% instances), nsubj (22; 0% instances), appos (21; 0% instances), csubj (17; 0% instances), discourse (16; 0% instances), vocative (10; 0% instances), flat (6; 0% instances), orphan (5; 0% instances), acl (4; 0% instances), fixed (1; 0% instances)

Children of ADJ nodes belong to 15 different parts of speech: NOUN (6717; 24% instances), ADV (4805; 17% instances), PUNCT (4527; 16% instances), AUX (3745; 14% instances), VERB (1813; 7% instances), ADJ (1641; 6% instances), CCONJ (1502; 5% instances), PRON (1176; 4% instances), PROPN (688; 2% instances), SCONJ (635; 2% instances), NUM (190; 1% instances), ADP (77; 0% instances), DET (71; 0% instances), INTJ (16; 0% instances), SYM (5; 0% instances)