Treebank Statistics: UD_Greek-Lesbian: POS Tags: NOUN
There are 414 NOUN lemmas (36%), 546 NOUN types (26%) and 788 NOUN tokens (13%).
Out of 16 observed tags, the rank of NOUN is: 1 in number of lemmas, 2 in number of types and 4 in number of tokens.
The 10 most frequent NOUN lemmas: σπίτ, χουριό, άνθρουπους, μουρό, μάνα, μέρα, χρόνους, γναίκα, πράμα, χέρ
The 10 most frequent NOUN types: μάνα, μέρα, σπίτ’, μουρό, χουριό, χρόνια, πράμα, χωριό, ώρα, μουρά
The 10 most frequent ambiguous lemmas: παπάς (NOUN 2, ADJ 1), θιός (PROPN 4, NOUN 1)
The 10 most frequent ambiguous types: μουρά (NOUN 6, INTJ 1), παπάς (NOUN 2, ADJ 1), Κουτσλιά (NOUN 1, PROPN 1), μέσ’ (ADP 3, NOUN 1), παρά (ADP 2, NOUN 1)
- μουρά
- παπάς
- Κουτσλιά
- NOUN 1: Άμανι βράδιασει γη Κουτσλιά πιάσει να διαλαγεί « Κύριοι γιου Δησέφ πλει 100 τσιφάλια κατσίτσις προς 200 φράγκα τη μια , όποιους ενδιαφέριτι ας πα σ’ Κουλουμαρίγιας του καφινέ να εύρ’ του Δησέφ να τα κανουνίσειν » .
- PROPN 1: Σ του χουριό άμανι κατέφτσει , πήγι ίσια τσι ηύρι ντ’ Κουτσλιά του ντιλάς .
- μέσ’
- παρά
Morphology
The form / lemma ratio of NOUN is 1.318841 (the average of all parts of speech is 1.820961).
The 1st highest number of forms (9) was observed with the lemma “άνθρουπους”: άθριπους, άθρουπι, άθρωπο, άνθρωπο, άνθρωπος, άνθρωπους, αθρώπ’, ανθρώπ’, θρώπ’.
The 2nd highest number of forms (8) was observed with the lemma “σπίτ”: σπίκι, σπίκια, σπίκ’, σπίτια, σπίτ’, σπιτ, σπιτιού, σπτελ.
The 3rd highest number of forms (7) was observed with the lemma “χρόνους”: χρονών, χρουνό, χρούνια, χρόνια, χρόνο, χρόνος, χρόνου.
NOUN occurs with 5 features: Number (786; 100% instances), Case (785; 100% instances), Gender (784; 99% instances), Degree (26; 3% instances), Typo (23; 3% instances)
NOUN occurs with 12 feature-value pairs: Case=Acc, Case=Gen, Case=Nom, Case=Voc, Degree=Aug, Degree=Dim, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Typo=Yes
NOUN occurs with 44 feature combinations.
The most frequent feature combination is Case=Acc|Gender=Neut|Number=Sing (186 tokens).
Examples: σπίτ’, χουριό, χέρ’, λάδ’, σπίκ’, χωριό, μουρό, στρώμα, βράδ, μωρό
Relations
NOUN nodes are attached to their parents using 20 different relations: obl (228; 29% instances), obj (220; 28% instances), nsubj (163; 21% instances), root (49; 6% instances), conj (23; 3% instances), vocative (23; 3% instances), nmod (21; 3% instances), parataxis (12; 2% instances), xcomp (11; 1% instances), appos (7; 1% instances), orphan (6; 1% instances), ccomp (5; 1% instances), compound:redup (5; 1% instances), advcl (4; 1% instances), dislocated (4; 1% instances), nsubj:pass (3; 0% instances), amod (1; 0% instances), compound (1; 0% instances), iobj (1; 0% instances), reparandum (1; 0% instances)
Parents of NOUN nodes belong to 10 different parts of speech: VERB (628; 80% instances), NOUN (65; 8% instances), (49; 6% instances), ADJ (20; 3% instances), PROPN (10; 1% instances), ADV (8; 1% instances), INTJ (3; 0% instances), NUM (2; 0% instances), PRON (2; 0% instances), DET (1; 0% instances)
81 (10%) NOUN nodes are leaves.
266 (34%) NOUN nodes have one child.
272 (35%) NOUN nodes have two children.
169 (21%) NOUN nodes have three or more children.
The highest child degree of a NOUN node is 9.
Children of NOUN nodes are attached using 24 different relations: det (628; 43% instances), case (209; 14% instances), nmod (177; 12% instances), punct (107; 7% instances), cc (57; 4% instances), amod (45; 3% instances), cop (34; 2% instances), nummod (34; 2% instances), conj (30; 2% instances), discourse (21; 1% instances), nsubj (17; 1% instances), acl:relcl (16; 1% instances), advmod (12; 1% instances), appos (12; 1% instances), parataxis (12; 1% instances), orphan (7; 0% instances), advcl (6; 0% instances), mark (6; 0% instances), compound:redup (5; 0% instances), obl (5; 0% instances), dislocated (2; 0% instances), acl (1; 0% instances), aux (1; 0% instances), reparandum (1; 0% instances)
Children of NOUN nodes belong to 16 different parts of speech: DET (634; 44% instances), ADP (188; 13% instances), PRON (139; 10% instances), PUNCT (107; 7% instances), NOUN (65; 4% instances), CCONJ (55; 4% instances), ADJ (45; 3% instances), ADV (42; 3% instances), NUM (37; 3% instances), VERB (37; 3% instances), AUX (35; 2% instances), PROPN (31; 2% instances), INTJ (20; 1% instances), SCONJ (6; 0% instances), PART (3; 0% instances), X (1; 0% instances)