Treebank Statistics: UD_Greek-GUD: POS Tags: NUM
There are 25 NUM lemmas (1%), 33 NUM types (1%) and 127 NUM tokens (0%).
Out of 17 observed tags, the rank of NUM is: 9 in number of lemmas, 10 in number of types and 14 in number of tokens.
The 10 most frequent NUM lemmas: δύο, ένας, τρεις, δέκα, πενήντα, τέσσερις, 16, εννέα, πέντε, 14/5
The 10 most frequent NUM types: δύο, δυο, τρεις, ένα, ένας, δέκα, μία, πενήντα, μια, 16
The 10 most frequent ambiguous lemmas: ένας (DET 224, NUM 17)
The 10 most frequent ambiguous types: ένα (DET 71, NUM 4), ένας (DET 10, NUM 2), μία (DET 7, NUM 4), μια (DET 99, NUM 2), έναν (DET 23, NUM 1)
- ένα
- ένας
- μία
- μια
- έναν
Morphology
The form / lemma ratio of NUM is 1.320000 (the average of all parts of speech is 1.675929).
The 1st highest number of forms (5) was observed with the lemma “ένας”: ένα, έναν, ένας, μία, μια.
The 2nd highest number of forms (3) was observed with the lemma “δύο”: δυο, δύο, ντυο.
The 3rd highest number of forms (3) was observed with the lemma “τρεις”: τρία, τρεις, τριών.
NUM occurs with 4 features: NumType (127; 100% instances), Case (121; 95% instances), Number (121; 95% instances), Gender (120; 94% instances)
NUM occurs with 10 feature-value pairs: Case=Acc, Case=Gen, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, NumType=Card, NumType=Frac, Number=Plur, Number=Sing
NUM occurs with 17 feature combinations.
The most frequent feature combination is Case=Acc|Gender=Neut|Number=Plur|NumType=Card (25 tokens).
Examples: δυο, δύο, δέκα, πέντε, τρακόσια, διακόσια, πενήντα, σαράντα, τέσσερα, τρία
Relations
NUM nodes are attached to their parents using 12 different relations: nummod (82; 65% instances), nsubj (12; 9% instances), obl (9; 7% instances), root (7; 6% instances), nmod (5; 4% instances), conj (3; 2% instances), compound (2; 2% instances), csubj (2; 2% instances), obj (2; 2% instances), ccomp (1; 1% instances), dislocated (1; 1% instances), xcomp (1; 1% instances)
Parents of NUM nodes belong to 9 different parts of speech: NOUN (79; 62% instances), VERB (25; 20% instances), (7; 6% instances), ADJ (4; 3% instances), NUM (4; 3% instances), ADV (3; 2% instances), PROPN (3; 2% instances), DET (1; 1% instances), SYM (1; 1% instances)
87 (69%) NUM nodes are leaves.
10 (8%) NUM nodes have one child.
17 (13%) NUM nodes have two children.
13 (10%) NUM nodes have three or more children.
The highest child degree of a NUM node is 6.
Children of NUM nodes are attached using 19 different relations: det (28; 29% instances), case (13; 14% instances), punct (12; 13% instances), cc (8; 8% instances), cop (7; 7% instances), nmod (6; 6% instances), conj (5; 5% instances), mark (3; 3% instances), compound (2; 2% instances), nsubj (2; 2% instances), nummod (2; 2% instances), acl:relcl (1; 1% instances), advcl (1; 1% instances), advmod (1; 1% instances), amod (1; 1% instances), appos (1; 1% instances), obl (1; 1% instances), orphan (1; 1% instances), vocative (1; 1% instances)
Children of NUM nodes belong to 13 different parts of speech: DET (28; 29% instances), ADP (12; 13% instances), PUNCT (12; 13% instances), NOUN (11; 11% instances), CCONJ (8; 8% instances), AUX (7; 7% instances), NUM (4; 4% instances), PROPN (3; 3% instances), SCONJ (3; 3% instances), VERB (3; 3% instances), ADJ (2; 2% instances), ADV (2; 2% instances), PRON (1; 1% instances)