home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Georgian-GLC: POS Tags: ADJ

There are 2377 ADJ lemmas (26%), 3118 ADJ types (20%) and 8997 ADJ tokens (15%). Out of 16 observed tags, the rank of ADJ is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent ADJ lemmas: ძველი, სხვადასხვა, ქიმიური, დიდი, საერთაშორისო, ბერძნული, ძირითადი, ახალი, თანამედროვე, ფიზიკური

The 10 most frequent ADJ types: სხვადასხვა, საერთაშორისო, ქიმიური, დიდი, თანამედროვე, ძირითადი, ბერძნ., უფრო, სამეცნიერო, ახალი

The 10 most frequent ambiguous lemmas: სახელმწიფო (ADJ 69, NOUN 26), უფრო (ADJ 66, ADV 2), მთავარი (ADJ 60, NOUN 2), ქართული (ADJ 59, NOUN 1), საბაჟო (ADJ 39, NOUN 1), ფართო (ADJ 39, ADV 22), დაკავშირებული (ADJ 37, VERB 19), მსოფლიო (ADJ 33, NOUN 30), განსხვავებული (ADJ 21, VERB 2), მიღებული (ADJ 19, VERB 10)

The 10 most frequent ambiguous types: უფრო (ADJ 66, ADV 2), სახელმწიფო (ADJ 49, NOUN 3), საბაჟო (ADJ 37, NOUN 1), მთავარი (ADJ 36, NOUN 1), მსოფლიო (ADJ 33, NOUN 17), დაკავშირებული (ADJ 27, VERB 19), გამოყენებითი (ADJ 19, NOUN 1), დამოკიდებული (ADJ 18, VERB 3), ბოლო (ADJ 17, NOUN 3), შუა (ADJ 17, ADP 7)

Morphology

The form / lemma ratio of ADJ is 1.311737 (the average of all parts of speech is 1.674782).

The 1st highest number of forms (8) was observed with the lemma “სახელმწიფო”: სახ., სახელმწიფო, სახელმწიფოებ, სახელმწიფოები, სახელმწიფოების, სახელმწიფოთა, სახელმწიფოს, სახელმწიფოსა.

The 2nd highest number of forms (7) was observed with the lemma “ახალი”: ახ., ახალ, ახალი, ახალმა, ახლად, უახლეს, უახლესი.

The 3rd highest number of forms (7) was observed with the lemma “დიდი”: დიდ, დიდად, დიდადა, დიდი, დიდმა, უდიდეს, უდიდესი.

ADJ occurs with 7 features: Number (8752; 97% instances), Case (8751; 97% instances), Degree (1274; 14% instances), Abbr (183; 2% instances), PartType (14; 0% instances), AdpType (2; 0% instances), ExtPos (1; 0% instances)

ADJ occurs with 15 feature-value pairs: Abbr=Yes, AdpType=Post, Case=Dat, Case=Erg, Case=Ess, Case=Gen, Case=Ins, Case=Nom, Degree=Cmp, Degree=Pos, Degree=Sup, ExtPos=ADV, Number=Plur, Number=Sing, PartType=Emp

ADJ occurs with 40 feature combinations. The most frequent feature combination is Case=Gen|Number=Sing (2725 tokens). Examples: სხვადასხვა, ქიმიური, საერთაშორისო, მსოფლიო, ისტორიული, ფიზიკური, ქართული, თანამედროვე, სამეცნიერო, სახელმწიფო

Relations

ADJ nodes are attached to their parents using 19 different relations: amod (7723; 86% instances), conj (637; 7% instances), root (206; 2% instances), acl (80; 1% instances), obl (74; 1% instances), appos (62; 1% instances), nsubj (49; 1% instances), advcl (32; 0% instances), parataxis (27; 0% instances), ccomp (23; 0% instances), obj (23; 0% instances), xcomp (21; 0% instances), nmod (18; 0% instances), advmod (9; 0% instances), obl:tmod (6; 0% instances), iobj (4; 0% instances), acl:relcl (1; 0% instances), nsubj:pass (1; 0% instances), orphan (1; 0% instances)

Parents of ADJ nodes belong to 13 different parts of speech: NOUN (7151; 79% instances), ADJ (1007; 11% instances), VERB (333; 4% instances), (206; 2% instances), X (145; 2% instances), PROPN (82; 1% instances), ADV (35; 0% instances), PRON (18; 0% instances), NUM (12; 0% instances), AUX (3; 0% instances), PART (3; 0% instances), ADP (1; 0% instances), SYM (1; 0% instances)

6495 (72%) ADJ nodes are leaves.

1670 (19%) ADJ nodes have one child.

348 (4%) ADJ nodes have two children.

484 (5%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 10.

Children of ADJ nodes are attached using 27 different relations: punct (819; 18% instances), conj (624; 14% instances), obl (484; 11% instances), amod (436; 10% instances), cc (407; 9% instances), nmod (325; 7% instances), cop (319; 7% instances), nsubj (305; 7% instances), advmod (193; 4% instances), case (92; 2% instances), nummod (88; 2% instances), det (66; 1% instances), mark (66; 1% instances), appos (59; 1% instances), acl (37; 1% instances), parataxis (28; 1% instances), ccomp (26; 1% instances), det:poss (22; 0% instances), advcl (21; 0% instances), obj (8; 0% instances), obl:tmod (6; 0% instances), advmod:lmod (5; 0% instances), aux (3; 0% instances), orphan (3; 0% instances), xcomp (2; 0% instances), compound (1; 0% instances), fixed (1; 0% instances)

Children of ADJ nodes belong to 15 different parts of speech: NOUN (1026; 23% instances), ADJ (1007; 23% instances), PUNCT (819; 18% instances), CCONJ (407; 9% instances), AUX (322; 7% instances), PRON (222; 5% instances), ADV (170; 4% instances), NUM (102; 2% instances), VERB (97; 2% instances), ADP (92; 2% instances), PROPN (78; 2% instances), SCONJ (67; 2% instances), PART (30; 1% instances), X (4; 0% instances), SYM (3; 0% instances)