home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Georgian-GLC: POS Tags: ADJ

There are 2379 ADJ lemmas (26%), 3122 ADJ types (20%) and 9175 ADJ tokens (15%). Out of 16 observed tags, the rank of ADJ is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent ADJ lemmas: ძველი, სხვადასხვა, ქიმიური, დიდი, საერთაშორისო, ბერძნული, ძირითადი, ახალი, თანამედროვე, ფიზიკური

The 10 most frequent ADJ types: სხვადასხვა, საერთაშორისო, ქიმიური, დიდი, თანამედროვე, ძირითადი, ბერძნ., უფრო, სამეცნიერო, ახალი

The 10 most frequent ambiguous lemmas: ქიმიური (ADJ 105, ADV 1), დიდი (ADJ 103, ADV 1), ბერძნული (ADJ 91, ADV 2), სახელმწიფო (ADJ 69, NOUN 26), უფრო (ADJ 66, ADV 2), მნიშვნელოვანი (ADJ 63, ADV 6), მთავარი (ADJ 60, NOUN 2), ქართული (ADJ 59, NOUN 1), ზოგადი (ADJ 57, ADV 12), ისტორიული (ADJ 50, ADV 3)

The 10 most frequent ambiguous types: უფრო (ADJ 66, ADV 2), სახელმწიფო (ADJ 49, NOUN 3), საბაჟო (ADJ 37, NOUN 1), მთავარი (ADJ 36, NOUN 1), მსოფლიო (ADJ 33, NOUN 17), გამოყენებითი (ADJ 19, NOUN 1), ბოლო (ADJ 17, NOUN 3), შუა (ADJ 17, ADP 7), შედარებით (ADJ 13, NOUN 13), ძირითად (ADJ 13, ADV 1)

Morphology

The form / lemma ratio of ADJ is 1.312316 (the average of all parts of speech is 1.677821).

The 1st highest number of forms (8) was observed with the lemma “სახელმწიფო”: სახ., სახელმწიფო, სახელმწიფოებ, სახელმწიფოები, სახელმწიფოების, სახელმწიფოთა, სახელმწიფოს, სახელმწიფოსა.

The 2nd highest number of forms (7) was observed with the lemma “ახალი”: ახ., ახალ, ახალი, ახალმა, ახლად, უახლეს, უახლესი.

The 3rd highest number of forms (7) was observed with the lemma “დიდი”: დიდ, დიდად, დიდადა, დიდი, დიდმა, უდიდეს, უდიდესი.

ADJ occurs with 6 features: Number (8931; 97% instances), Case (8930; 97% instances), Degree (1273; 14% instances), Abbr (182; 2% instances), PartType (14; 0% instances), AdpType (2; 0% instances)

ADJ occurs with 14 feature-value pairs: Abbr=Yes, AdpType=Post, Case=Dat, Case=Erg, Case=Ess, Case=Gen, Case=Ins, Case=Nom, Degree=Cmp, Degree=Pos, Degree=Sup, Number=Plur, Number=Sing, PartType=Emp

ADJ occurs with 39 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing (2836 tokens). Examples: ძირითადი, დაკავშირებული, საერთაშორისო, ცნობილი, სამეცნიერო, ქიმიური, თანამედროვე, შესაძლებელი, ფიზიკური, გავრცელებული

Relations

ADJ nodes are attached to their parents using 20 different relations: amod (7683; 84% instances), conj (644; 7% instances), root (345; 4% instances), acl (128; 1% instances), obl (76; 1% instances), appos (61; 1% instances), nsubj (49; 1% instances), advcl (41; 0% instances), parataxis (33; 0% instances), ccomp (30; 0% instances), obj (24; 0% instances), xcomp (21; 0% instances), nmod (14; 0% instances), advmod (9; 0% instances), obl:tmod (6; 0% instances), iobj (4; 0% instances), acl:relcl (3; 0% instances), csubj (2; 0% instances), nsubj:pass (1; 0% instances), orphan (1; 0% instances)

Parents of ADJ nodes belong to 13 different parts of speech: NOUN (7160; 78% instances), ADJ (1021; 11% instances), VERB (354; 4% instances), (345; 4% instances), X (145; 2% instances), PROPN (77; 1% instances), ADV (35; 0% instances), PRON (19; 0% instances), NUM (12; 0% instances), AUX (3; 0% instances), PART (2; 0% instances), ADP (1; 0% instances), SYM (1; 0% instances)

6502 (71%) ADJ nodes are leaves.

1629 (18%) ADJ nodes have one child.

346 (4%) ADJ nodes have two children.

698 (8%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 10.

Children of ADJ nodes are attached using 28 different relations: punct (1014; 19% instances), obl (696; 13% instances), conj (642; 12% instances), cop (549; 10% instances), nsubj (491; 9% instances), amod (435; 8% instances), cc (415; 8% instances), nmod (297; 6% instances), advmod (244; 5% instances), case (93; 2% instances), nummod (89; 2% instances), mark (82; 2% instances), det (67; 1% instances), appos (60; 1% instances), acl (45; 1% instances), ccomp (35; 1% instances), parataxis (34; 1% instances), advcl (29; 1% instances), det:poss (22; 0% instances), obl:tmod (19; 0% instances), obj (14; 0% instances), advmod:lmod (6; 0% instances), aux (4; 0% instances), nsubj:pass (3; 0% instances), orphan (3; 0% instances), xcomp (2; 0% instances), acl:relcl (1; 0% instances), nsubj:outer (1; 0% instances)

Children of ADJ nodes belong to 15 different parts of speech: NOUN (1373; 25% instances), ADJ (1021; 19% instances), PUNCT (1014; 19% instances), AUX (553; 10% instances), CCONJ (415; 8% instances), PRON (264; 5% instances), ADV (217; 4% instances), VERB (124; 2% instances), NUM (104; 2% instances), ADP (95; 2% instances), SCONJ (86; 2% instances), PROPN (83; 2% instances), PART (35; 1% instances), X (5; 0% instances), SYM (3; 0% instances)