Treebank Statistics: UD_Greek-GLCII: POS Tags: NUM
There are 39 NUM lemmas (2%), 43 NUM types (1%) and 80 NUM tokens (1%).
Out of 16 observed tags, the rank of NUM is: 7 in number of lemmas, 8 in number of types and 14 in number of tokens.
The 10 most frequent NUM lemmas: ένας, δύο, τρεις, πέντε, 3, δέκα, ενας, σαράντα, 4, 5
The 10 most frequent NUM types: δύο, ένα, πέντε, μια, 3, δέκα, δυο, μία, σαράντα, τρία
The 10 most frequent ambiguous lemmas: ένας (DET 93, NUM 12), ενας (DET 16, NUM 3)
The 10 most frequent ambiguous types: ένα (DET 38, NUM 4), μια (DET 39, NUM 3), μία (DET 7, NUM 2), ενα (DET 10, NUM 2)
- ένα
- μια
- μία
- ενα
Morphology
The form / lemma ratio of NUM is 1.102564 (the average of all parts of speech is 1.387814).
The 1st highest number of forms (3) was observed with the lemma “ένας”: ένα, μία, μια.
The 2nd highest number of forms (2) was observed with the lemma “δύο”: δυο, δύο.
The 3rd highest number of forms (2) was observed with the lemma “ενας”: ενα, ενασ.
NUM occurs with 5 features: NumType (74; 93% instances), Case (41; 51% instances), Gender (41; 51% instances), Number (41; 51% instances), Foreign (1; 1% instances)
NUM occurs with 10 feature-value pairs: Case=Acc, Case=Nom, Foreign=Yes, Gender=Fem, Gender=Masc, Gender=Neut, NumType=Card, NumType=Sets, Number=Plur, Number=Sing
NUM occurs with 13 feature combinations.
The most frequent feature combination is NumType=Card (33 tokens).
Examples: δύο, πέντε, 3, σαράντα, ένα, 4, 5, μια, δέκα, δυο
Relations
NUM nodes are attached to their parents using 10 different relations: nummod (42; 53% instances), obl (11; 14% instances), discourse (7; 9% instances), root (7; 9% instances), compound (6; 8% instances), conj (3; 4% instances), nsubj (1; 1% instances), obj (1; 1% instances), parataxis (1; 1% instances), xcomp (1; 1% instances)
Parents of NUM nodes belong to 7 different parts of speech: NOUN (42; 53% instances), VERB (20; 25% instances), NUM (7; 9% instances), (7; 9% instances), ADJ (2; 3% instances), DET (1; 1% instances), PROPN (1; 1% instances)
39 (49%) NUM nodes are leaves.
19 (24%) NUM nodes have one child.
10 (13%) NUM nodes have two children.
12 (15%) NUM nodes have three or more children.
The highest child degree of a NUM node is 5.
Children of NUM nodes are attached using 12 different relations: punct (20; 24% instances), det (12; 14% instances), obl (11; 13% instances), case (9; 11% instances), cop (8; 10% instances), advmod (6; 7% instances), compound (5; 6% instances), conj (5; 6% instances), nsubj (3; 4% instances), cc (2; 2% instances), csubj (2; 2% instances), nmod (1; 1% instances)
Children of NUM nodes belong to 11 different parts of speech: PUNCT (20; 24% instances), NOUN (13; 15% instances), DET (12; 14% instances), ADP (9; 11% instances), AUX (8; 10% instances), NUM (7; 8% instances), ADV (6; 7% instances), ADJ (3; 4% instances), VERB (3; 4% instances), CCONJ (2; 2% instances), PROPN (1; 1% instances)