home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_Georgian-GLC: POS Tags: NOUN

There are 662 NOUN lemmas (37%), 1125 NOUN types (40%) and 1645 NOUN tokens (24%). Out of 14 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: ეპისკოპოსი, ეკლესია, ღმერთი, კანონი, კრება, მამა, სიტყუაჲ, ქალაქი, დიაკონი, თავი

The 10 most frequent NOUN types: მამათა, ღმრთისა, ეპისკოპოსთა, ეპისკოპოსმან, ქალაქსა, ეკლესიათა, მეფისა, ეკლესიასა, ჴელთ, კანონნი

The 10 most frequent ambiguous lemmas: თავი (NOUN 17, PRON 1), მსახურება (NOUN 16, VERB 7), ჟამი (NOUN 14, ADV 1), ცხორება (NOUN 11, VERB 1), სამღდელოჲ (NOUN 10, ADJ 3), ქმნა (NOUN 10, VERB 5), განჩინება (NOUN 8, VERB 5), სამღდელო (ADJ 8, NOUN 8, ADV 2), დასხმა (NOUN 7, VERB 1), მოქმედება (NOUN 7, VERB 3)

The 10 most frequent ambiguous types: სამღდელოთა (NOUN 5, ADJ 4), ყოფაჲ (VERB 8, NOUN 5), სამღდელოსა (ADJ 5, NOUN 4), სამღდელოჲსა (NOUN 4, ADJ 2), მიმართისა (NOUN 3, ADP 2), თავს (NOUN 2, PRON 1), ნებსით (NOUN 2, ADV 1), სამეუფოსა (ADJ 6, NOUN 2), ყოფისა (VERB 3, NOUN 2), ამისთჳს (ADV 1, NOUN 1)

Morphology

The form / lemma ratio of NOUN is 1.699396 (the average of all parts of speech is 1.566123).

The 1st highest number of forms (13) was observed with the lemma “ეპისკოპოსი”: ეპისკოპოს, ეპისკოპოსად, ეპისკოპოსთა, ეპისკოპოსთასა, ეპისკოპოსთაჲ, ეპისკოპოსი, ეპისკოპოსისა, ეპისკოპოსისანი, ეპისკოპოსისაცა, ეპისკოპოსისაჲ, ეპისკოპოსმან, ეპისკოპოსნი, ეპისკოპოსსა.

The 2nd highest number of forms (11) was observed with the lemma “საქმე”: საქმე, საქმედ, საქმეთა, საქმეთანი, საქმეთასა, საქმეთაჲსა, საქმენი, საქმესა, საქმით, საქმისა, საქმისასა.

The 3rd highest number of forms (11) was observed with the lemma “სიტყუაჲ”: სიტყუად, სიტყუათა, სიტყუათასა, სიტყუათაჲ, სიტყუანი, სიტყუასა, სიტყუაჲ, სიტყჳსა, სიტყჳსასა, სიტყჳსაჲ, სიტყჳსაჲთა.

NOUN occurs with 4 features: Case (1645; 100% instances), Number (1645; 100% instances), Case[stack] (155; 9% instances), PartType (12; 1% instances)

NOUN occurs with 11 feature-value pairs: Case=Dat, Case=Erg, Case=Ess, Case=Gen, Case=Ins, Case=Nom, Case=Voc, Case[stack]=Gen, Number=Plur, Number=Sing, PartType=Emp

NOUN occurs with 28 feature combinations. The most frequent feature combination is Case=Gen|Number=Sing (395 tokens). Examples: ღმრთისა, მეფისა, ეპისკოპოსისა, კრებისა, მთავარეპისკოპოსისა, უფლისა, ცხორებისა, ეკლესიისა, თავისა, მიზეზისა

Relations

NOUN nodes are attached to their parents using 19 different relations: nmod (573; 35% instances), obl (376; 23% instances), conj (192; 12% instances), obj (186; 11% instances), nsubj (141; 9% instances), appos (39; 2% instances), iobj (38; 2% instances), xcomp (23; 1% instances), advcl (19; 1% instances), nsubj:pass (16; 1% instances), root (13; 1% instances), ccomp (8; 0% instances), acl:relcl (6; 0% instances), amod (5; 0% instances), compound (3; 0% instances), parataxis (3; 0% instances), vocative (2; 0% instances), acl (1; 0% instances), case (1; 0% instances)

Parents of NOUN nodes belong to 11 different parts of speech: VERB (708; 43% instances), NOUN (641; 39% instances), ADP (58; 4% instances), PROPN (58; 4% instances), ADJ (52; 3% instances), PRON (45; 3% instances), ADV (40; 2% instances), AUX (23; 1% instances), (13; 1% instances), NUM (5; 0% instances), PART (2; 0% instances)

402 (24%) NOUN nodes are leaves.

601 (37%) NOUN nodes have one child.

381 (23%) NOUN nodes have two children.

261 (16%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 25.

Children of NOUN nodes are attached using 29 different relations: nmod (580; 24% instances), amod (303; 13% instances), det (268; 11% instances), cc (217; 9% instances), case (200; 8% instances), conj (192; 8% instances), punct (111; 5% instances), advmod (99; 4% instances), obl (51; 2% instances), mark (50; 2% instances), nummod (43; 2% instances), acl:relcl (39; 2% instances), nsubj (39; 2% instances), cop (32; 1% instances), advmod:neg (30; 1% instances), appos (25; 1% instances), advcl (21; 1% instances), det:poss (16; 1% instances), obj (14; 1% instances), acl (12; 1% instances), xcomp (9; 0% instances), dep (6; 0% instances), parataxis (4; 0% instances), advmod:emph (3; 0% instances), ccomp (3; 0% instances), nsubj:pass (3; 0% instances), compound (2; 0% instances), iobj (2; 0% instances), flat (1; 0% instances)

Children of NOUN nodes belong to 14 different parts of speech: NOUN (641; 27% instances), PRON (445; 19% instances), ADP (217; 9% instances), CCONJ (217; 9% instances), ADJ (215; 9% instances), VERB (192; 8% instances), PUNCT (111; 5% instances), ADV (100; 4% instances), PROPN (61; 3% instances), NUM (52; 2% instances), SCONJ (50; 2% instances), PART (37; 2% instances), AUX (36; 2% instances), X (1; 0% instances)