home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Greek-Cretan: POS Tags: NUM

There are 16 NUM lemmas (2%), 17 NUM types (1%) and 25 NUM tokens (1%). Out of 15 observed tags, the rank of NUM is: 8 in number of lemmas, 9 in number of types and 15 in number of tokens.

The 10 most frequent NUM lemmas: δύο, πέντε, σαράντα, τρεις, δεκαεφτά, διακόσιοι, δυο, είκοσι, εκατό, εκατόν

The 10 most frequent NUM types: δυο, πέντε, σαράντα, τρεις, Εκατό, δεκαεφτά, διακόσιους, δύο, είκοσι, εκατόν

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NUM is 1.062500 (the average of all parts of speech is 1.384100).

The 1st highest number of forms (2) was observed with the lemma “δύο”: δυο, δύο.

The 2nd highest number of forms (2) was observed with the lemma “τρεις”: τρία, τρεις.

The 3rd highest number of forms (1) was observed with the lemma “δεκαεφτά”: δεκαεφτά.

NUM occurs with 4 features: NumType (25; 100% instances), Number (23; 92% instances), Case (22; 88% instances), Gender (22; 88% instances)

NUM occurs with 8 feature-value pairs: Case=Acc, Case=Gen, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, NumType=Card, Number=Plur

NUM occurs with 9 feature combinations. The most frequent feature combination is Case=Acc|Gender=Neut|Number=Plur|NumType=Card (8 tokens). Examples: πέντε, δεκαεφτά, δυο, είκοσι, οχτακόσα, τρία, χίλια

Relations

NUM nodes are attached to their parents using 6 different relations: nummod (15; 60% instances), compound (3; 12% instances), obl (3; 12% instances), conj (2; 8% instances), nmod (1; 4% instances), nsubj (1; 4% instances)

Parents of NUM nodes belong to 5 different parts of speech: NOUN (14; 56% instances), NUM (5; 20% instances), VERB (3; 12% instances), ADJ (2; 8% instances), DET (1; 4% instances)

15 (60%) NUM nodes are leaves.

4 (16%) NUM nodes have one child.

5 (20%) NUM nodes have two children.

1 (4%) NUM nodes have three or more children.

The highest child degree of a NUM node is 3.

Children of NUM nodes are attached using 7 different relations: det (4; 24% instances), case (3; 18% instances), compound (3; 18% instances), cc (2; 12% instances), conj (2; 12% instances), punct (2; 12% instances), advmod (1; 6% instances)

Children of NUM nodes belong to 6 different parts of speech: NUM (5; 29% instances), DET (4; 24% instances), ADP (2; 12% instances), ADV (2; 12% instances), CCONJ (2; 12% instances), PUNCT (2; 12% instances)