Treebank Statistics: UD_Georgian-GNC: POS Tags: NOUN
There are 1700 NOUN lemmas (37%), 2864 NOUN types (38%) and 4820 NOUN tokens (21%).
Out of 15 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN lemmas: დედა, დრო, ადამიანი, ხელი, ოთახი, კითხვა, წელი, სახლი, თვალი, კაცი
The 10 most frequent NOUN types: დედა, ოთახ, დროს, დრო, კითხვა, წლის, სამყარო, კაცი, პასუხი, ადამიანი
The 10 most frequent ambiguous lemmas: დედა (NOUN 88, ADJ 1), კითხვა (NOUN 42, VERB 27), თვალი (NOUN 36, ADJ 2), დღე (NOUN 29, ADV 1), თავი (PRON 40, NOUN 26), ცხოვრება (NOUN 23, VERB 15), სიცოცხლე (NOUN 18, INTJ 1), ხე (NOUN 13, PROPN 1), საღამო (NOUN 12, ADJ 1), ფიქრი (VERB 24, NOUN 10)
The 10 most frequent ambiguous types: დედა (NOUN 48, ADJ 1), სიცოცხლე (NOUN 11, INTJ 1), თავს (PRON 13, NOUN 10), თავი (PRON 13, NOUN 9), დღე (NOUN 8, ADV 1), დღეს (ADV 12, NOUN 8), თვალები (NOUN 6, ADJ 2), თავ (PRON 8, NOUN 5), მოდი (NOUN 5, VERB 1), ბოლო (ADJ 6, NOUN 4)
- დედა
- სიცოცხლე
- თავს
- თავი
- დღე
- დღეს
- თვალები
- თავ
- მოდი
- ბოლო
Morphology
The form / lemma ratio of NOUN is 1.684706 (the average of all parts of speech is 1.616911).
The 1st highest number of forms (12) was observed with the lemma “ადამიანი”: ადამიან, ადამიანად, ადამიანებ, ადამიანები, ადამიანების, ადამიანებს, ადამიანებსაც, ადამიანთა, ადამიანი, ადამიანის, ადამიანმა, ადამიანს.
The 2nd highest number of forms (12) was observed with the lemma “წელი”: წელ, წელი, წელს, წლა, წლებ, წლები, წლების, წლებმა, წლი, წლის, წლისა, წლისამ.
The 3rd highest number of forms (12) was observed with the lemma “ხელი”: ხელ, ხელები, ხელებით, ხელების, ხელებიც, ხელებს, ხელთ, ხელი, ხელით, ხელის, ხელიც, ხელს.
NOUN occurs with 6 features: Case (4817; 100% instances), Number (4739; 98% instances), Animacy (769; 16% instances), VerbForm (547; 11% instances), Encl (104; 2% instances), Abbr (3; 0% instances)
NOUN occurs with 14 feature-value pairs: Abbr=Yes, Animacy=Anim, Case=Dat, Case=Erg, Case=Ess, Case=Gen, Case=Ins, Case=Nom, Case=Voc, Encl=C, Encl=Ve, Number=Plur, Number=Sing, VerbForm=Vnoun
NOUN occurs with 65 feature combinations.
The most frequent feature combination is Case=Nom|Number=Sing (1095 tokens).
Examples: პასუხი, დრო, ოთახი, სამყარო, კაბა, ამბავი, ადგილი, კარი, მიზანი, ფული
Relations
NOUN nodes are attached to their parents using 27 different relations: obl (1266; 26% instances), nsubj (951; 20% instances), obj (886; 18% instances), nmod (748; 16% instances), conj (325; 7% instances), iobj (284; 6% instances), root (150; 3% instances), xcomp (45; 1% instances), parataxis (28; 1% instances), appos (26; 1% instances), advcl (19; 0% instances), ccomp (18; 0% instances), vocative (18; 0% instances), ccomp:speech (9; 0% instances), orphan (9; 0% instances), obl:iobj (7; 0% instances), acl (6; 0% instances), csubj (6; 0% instances), fixed (6; 0% instances), obl:agent (3; 0% instances), acl:relcl (2; 0% instances), amod (2; 0% instances), nmod:pred (2; 0% instances), advcl:relcl (1; 0% instances), csubj:outer (1; 0% instances), flat (1; 0% instances), nmod:iobj (1; 0% instances)
Parents of NOUN nodes belong to 9 different parts of speech: VERB (3211; 67% instances), NOUN (1063; 22% instances), ADJ (219; 5% instances), (150; 3% instances), PROPN (93; 2% instances), ADV (39; 1% instances), PRON (39; 1% instances), DET (4; 0% instances), NUM (2; 0% instances)
1479 (31%) NOUN nodes are leaves.
1921 (40%) NOUN nodes have one child.
929 (19%) NOUN nodes have two children.
491 (10%) NOUN nodes have three or more children.
The highest child degree of a NOUN node is 10.
Children of NOUN nodes are attached using 30 different relations: amod (1064; 19% instances), case (1048; 18% instances), nmod (871; 15% instances), punct (482; 8% instances), det (365; 6% instances), conj (331; 6% instances), det:poss (270; 5% instances), cc (244; 4% instances), nummod (198; 3% instances), cop (165; 3% instances), advmod (158; 3% instances), nsubj (105; 2% instances), acl:relcl (99; 2% instances), mark (56; 1% instances), parataxis (50; 1% instances), appos (41; 1% instances), advmod:neg (40; 1% instances), obl (36; 1% instances), advcl (26; 0% instances), acl (20; 0% instances), nmod:name (10; 0% instances), orphan (8; 0% instances), csubj (4; 0% instances), discourse (4; 0% instances), nmod:iobj (3; 0% instances), nmod:pred (3; 0% instances), aux (2; 0% instances), obl:iobj (2; 0% instances), nmod:agent (1; 0% instances), obl:agent (1; 0% instances)
Children of NOUN nodes belong to 15 different parts of speech: ADP (1072; 19% instances), ADJ (1071; 19% instances), NOUN (1063; 19% instances), PRON (669; 12% instances), PUNCT (482; 8% instances), CCONJ (242; 4% instances), ADV (212; 4% instances), NUM (205; 4% instances), VERB (198; 3% instances), PROPN (192; 3% instances), AUX (167; 3% instances), DET (98; 2% instances), SCONJ (23; 0% instances), PART (9; 0% instances), INTJ (4; 0% instances)