Treebank Statistics: UD_Greek-Cretan: POS Tags: NOUN
There are 373 NOUN lemmas (36%), 437 NOUN types (30%) and 602 NOUN tokens (14%).
Out of 15 observed tags, the rank of NOUN is: 1 in number of lemmas, 2 in number of types and 4 in number of tokens.
The 10 most frequent NOUN lemmas: σπίτι, χρόνος, βασιλιάς, χέρι, άντρας, γιατρός, γυναίκα, κουμπάρος, πόδι, άνθρωπος
The 10 most frequent NOUN types: σπίτι, βασιλιάς, χρόνια, θεια, νερό, χάρη, γάλα, κερά, κύρη, παιδί
The 10 most frequent ambiguous lemmas: σ (NOUN 3, ADP 2), Νικολής (PROPN 15, NOUN 2), Μιχάλης (NOUN 1, PROPN 1), αγάς (ADJ 1, NOUN 1), καρκάνα (NOUN 1, PROPN 1), πλούσος (ADJ 1, NOUN 1), τάδε (DET 1, NOUN 1), φορά (DET 1, NOUN 1), φτωχός (ADJ 3, NOUN 1)
The 10 most frequent ambiguous types: σ (ADP 66, NOUN 3), Μιχαλιό (PROPN 2, NOUN 1), Νικολάρος (PROPN 5, NOUN 1), καρκάνα (NOUN 1, PROPN 1), φτωχός (ADJ 2, NOUN 1)
- σ
- Μιχαλιό
- Νικολάρος
- καρκάνα
- φτωχός
Morphology
The form / lemma ratio of NOUN is 1.171582 (the average of all parts of speech is 1.384100).
The 1st highest number of forms (4) was observed with the lemma “γιατρός”: γιάτρακας, γιατρέ, γιατρούς, γιατρός.
The 2nd highest number of forms (4) was observed with the lemma “κοπελιά”: κοπελίτσα, κοπελιά, κοπελιές, κοπελούδες.
The 3rd highest number of forms (4) was observed with the lemma “κουμπάρος”: κουμπάροι, κουμπάρος, κουμπαράκια, κουμπαρίγκο.
NOUN occurs with 5 features: Gender (601; 100% instances), Number (598; 99% instances), Case (596; 99% instances), Degree (37; 6% instances), NumType (1; 0% instances)
NOUN occurs with 12 feature-value pairs: Case=Acc, Case=Gen, Case=Nom, Case=Voc, Degree=Aug, Degree=Dim, Gender=Fem, Gender=Masc, Gender=Neut, NumType=Sets, Number=Plur, Number=Sing
NOUN occurs with 40 feature combinations.
The most frequent feature combination is Case=Acc|Gender=Fem|Number=Sing (99 tokens).
Examples: κορφή, νύχτα, πόρτα, βελόνα, γης, γριά, γυναίκα, κουβέντα, μάνα, μαζώχτρα
Relations
NOUN nodes are attached to their parents using 20 different relations: obj (169; 28% instances), obl (129; 21% instances), nsubj (113; 19% instances), conj (43; 7% instances), nmod (42; 7% instances), vocative (28; 5% instances), root (15; 2% instances), xcomp (13; 2% instances), compound (10; 2% instances), appos (8; 1% instances), iobj (7; 1% instances), parataxis (7; 1% instances), ccomp (6; 1% instances), case (3; 0% instances), advcl (2; 0% instances), fixed (2; 0% instances), orphan (2; 0% instances), acl:relcl (1; 0% instances), compound:redup (1; 0% instances), discourse (1; 0% instances)
Parents of NOUN nodes belong to 8 different parts of speech: VERB (450; 75% instances), NOUN (88; 15% instances), ADJ (18; 3% instances), PROPN (16; 3% instances), (15; 2% instances), ADV (7; 1% instances), DET (4; 1% instances), PRON (4; 1% instances)
88 (15%) NOUN nodes are leaves.
176 (29%) NOUN nodes have one child.
201 (33%) NOUN nodes have two children.
137 (23%) NOUN nodes have three or more children.
The highest child degree of a NOUN node is 8.
Children of NOUN nodes are attached using 26 different relations: det (416; 39% instances), nmod (184; 17% instances), case (112; 10% instances), punct (108; 10% instances), cc (55; 5% instances), amod (41; 4% instances), conj (40; 4% instances), acl:relcl (21; 2% instances), nummod (13; 1% instances), advmod (12; 1% instances), appos (12; 1% instances), cop (12; 1% instances), nsubj (10; 1% instances), discourse (9; 1% instances), orphan (5; 0% instances), advcl (4; 0% instances), vocative (3; 0% instances), acl (2; 0% instances), aux (2; 0% instances), compound (2; 0% instances), expl (2; 0% instances), obj (2; 0% instances), obl (2; 0% instances), parataxis (2; 0% instances), ccomp (1; 0% instances), compound:redup (1; 0% instances)
Children of NOUN nodes belong to 14 different parts of speech: DET (414; 39% instances), PRON (142; 13% instances), PUNCT (108; 10% instances), ADP (104; 10% instances), NOUN (88; 8% instances), CCONJ (56; 5% instances), ADJ (46; 4% instances), VERB (33; 3% instances), PROPN (25; 2% instances), ADV (20; 2% instances), AUX (14; 1% instances), NUM (14; 1% instances), INTJ (6; 1% instances), PART (3; 0% instances)