home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Georgian-GLC: POS Tags: NOUN

There are 456 NOUN lemmas (41%), 597 NOUN types (43%) and 675 NOUN tokens (29%). Out of 13 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: შეფასება, ადამიანი, ენა, სიტყვა, უნივერსიტეტი, ხელისუფლება, კისერი, საქმიანობა, უფლება, ქვეყანა

The 10 most frequent NOUN types: ადამიანის, საქმიანობის, საუბარი, სექტორის, უნივერსიტეტის, შეფასება, შეფასების, ხელისუფლების, გზა, ენის

The 10 most frequent ambiguous lemmas: კავკასია (NOUN 2, PROPN 1), თავი (PRON 4, NOUN 1), მეზობელი (ADJ 1, NOUN 1), სამართალდამცავი (ADJ 2, NOUN 1), სახელმწიფო (ADJ 1, NOUN 1), ქართველი (ADJ 2, NOUN 1)

The 10 most frequent ambiguous types: დღეს (ADV 2, NOUN 2), თავი (NOUN 1, PRON 1), კავკასია (NOUN 1, PROPN 1), მეზობელი (ADJ 1, NOUN 1), სამართალდამცავი (ADJ 2, NOUN 1)

Morphology

The form / lemma ratio of NOUN is 1.309211 (the average of all parts of speech is 1.235874).

The 1st highest number of forms (5) was observed with the lemma “ენა”: ენა, ენებ, ენების, ენით, ენის.

The 2nd highest number of forms (5) was observed with the lemma “ხალხი”: ხალხების, ხალხებს, ხალხთა, ხალხის, ხალხს.

The 3rd highest number of forms (4) was observed with the lemma “მიზანი”: მიზან, მიზნად, მიზნით, მიზნის.

NOUN occurs with 5 features: Case (670; 99% instances), Number (670; 99% instances), Animacy (669; 99% instances), PartType (11; 2% instances), Abbr (6; 1% instances)

NOUN occurs with 12 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Case=Dat, Case=Erg, Case=Ess, Case=Gen, Case=Ins, Case=Nom, Number=Plur, Number=Sing, PartType=Emp

NOUN occurs with 27 feature combinations. The most frequent feature combination is Animacy=Inan|Case=Gen|Number=Sing (164 tokens). Examples: საქმიანობის, სექტორის, უნივერსიტეტის, შეფასების, ენის, უსაფრთხოების, ბრძოლის, კისრის, პროცესის, სოციალიზმის

Relations

NOUN nodes are attached to their parents using 12 different relations: nmod (202; 30% instances), obl (153; 23% instances), nsubj (127; 19% instances), obj (108; 16% instances), conj (46; 7% instances), iobj (17; 3% instances), root (9; 1% instances), nsubj:pass (6; 1% instances), amod (3; 0% instances), acl (2; 0% instances), advcl (1; 0% instances), parataxis (1; 0% instances)

Parents of NOUN nodes belong to 9 different parts of speech: VERB (334; 49% instances), NOUN (240; 36% instances), ADJ (56; 8% instances), ADV (15; 2% instances), PROPN (9; 1% instances), (9; 1% instances), AUX (5; 1% instances), PRON (5; 1% instances), ADP (2; 0% instances)

150 (22%) NOUN nodes are leaves.

295 (44%) NOUN nodes have one child.

152 (23%) NOUN nodes have two children.

78 (12%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 7.

Children of NOUN nodes are attached using 21 different relations: nmod (304; 34% instances), amod (161; 18% instances), case (122; 14% instances), punct (70; 8% instances), conj (41; 5% instances), cc (38; 4% instances), obl (23; 3% instances), advmod (18; 2% instances), det:poss (18; 2% instances), nummod (18; 2% instances), nsubj (13; 1% instances), cop (12; 1% instances), mark (11; 1% instances), ccomp (10; 1% instances), acl (8; 1% instances), obj (6; 1% instances), advcl (3; 0% instances), flat:name (3; 0% instances), advmod:lmod (2; 0% instances), parataxis (2; 0% instances), iobj (1; 0% instances)

Children of NOUN nodes belong to 13 different parts of speech: NOUN (240; 27% instances), ADJ (239; 27% instances), ADP (124; 14% instances), PRON (73; 8% instances), PUNCT (70; 8% instances), CCONJ (38; 4% instances), ADV (20; 2% instances), NUM (19; 2% instances), VERB (18; 2% instances), PROPN (15; 2% instances), AUX (12; 1% instances), SCONJ (11; 1% instances), PART (5; 1% instances)