home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Greek-GUD: POS Tags: NUM

There are 25 NUM lemmas (1%), 32 NUM types (1%) and 121 NUM tokens (0%). Out of 17 observed tags, the rank of NUM is: 10 in number of lemmas, 10 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: δύο, τρεις, ένας, δέκα, πενήντα, τέσσερις, 16, εννέα, πέντε, 14/5

The 10 most frequent NUM types: δύο, δυο, τρεις, ένα, δέκα, μία, πενήντα, 16, εννιά, μια

The 10 most frequent ambiguous lemmas: ένας (DET 227, NUM 11, SCONJ 1)

The 10 most frequent ambiguous types: ένα (DET 71, NUM 4), μία (DET 7, NUM 4), μια (DET 97, SCONJ 3, NUM 1), Ένας (DET 3, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.280000 (the average of all parts of speech is 1.660999).

The 1st highest number of forms (4) was observed with the lemma “ένας”: Ένας, ένα, μία, μια.

The 2nd highest number of forms (3) was observed with the lemma “δύο”: δυο, δύο, ντυο.

The 3rd highest number of forms (3) was observed with the lemma “τρεις”: τρία, τρεις, τριών.

NUM occurs with 4 features: NumType (117; 97% instances), Case (114; 94% instances), Number (114; 94% instances), Gender (113; 93% instances)

NUM occurs with 11 feature-value pairs: Case=Acc, Case=Gen, Case=Nom, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, NumType=Card, NumType=Frac, Number=Plur, Number=Sing

NUM occurs with 20 feature combinations. The most frequent feature combination is Case=Acc|Gender=Fem|Number=Plur|NumType=Card (24 tokens). Examples: δύο, δυο, τρεις, δέκα, έντεκα, οκτώ, πεντακόσιες

Relations

NUM nodes are attached to their parents using 12 different relations: nummod (31; 26% instances), nmod (28; 23% instances), compound (19; 16% instances), nsubj (15; 12% instances), obl (11; 9% instances), amod (5; 4% instances), conj (3; 2% instances), obj (3; 2% instances), root (2; 2% instances), xcomp (2; 2% instances), flat (1; 1% instances), orphan (1; 1% instances)

Parents of NUM nodes belong to 10 different parts of speech: NOUN (79; 65% instances), VERB (22; 18% instances), ADJ (4; 3% instances), AUX (4; 3% instances), ADV (3; 2% instances), PROPN (3; 2% instances), NUM (2; 2% instances), (2; 2% instances), DET (1; 1% instances), SYM (1; 1% instances)

84 (69%) NUM nodes are leaves.

14 (12%) NUM nodes have one child.

17 (14%) NUM nodes have two children.

6 (5%) NUM nodes have three or more children.

The highest child degree of a NUM node is 4.

Children of NUM nodes are attached using 13 different relations: det (27; 40% instances), case (11; 16% instances), cc (9; 13% instances), nmod (6; 9% instances), punct (4; 6% instances), conj (3; 4% instances), amod (2; 3% instances), acl:relcl (1; 1% instances), advmod (1; 1% instances), appos (1; 1% instances), nummod (1; 1% instances), obl (1; 1% instances), orphan (1; 1% instances)

Children of NUM nodes belong to 11 different parts of speech: DET (27; 40% instances), ADP (9; 13% instances), CCONJ (9; 13% instances), NOUN (5; 7% instances), PROPN (4; 6% instances), PUNCT (4; 6% instances), ADJ (3; 4% instances), ADV (3; 4% instances), NUM (2; 3% instances), PRON (1; 1% instances), VERB (1; 1% instances)