home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Ancient_Greek-Perseus: POS Tags: NUM

There are 26 NUM lemmas (0%), 44 NUM types (0%) and 230 NUM tokens (0%). Out of 15 observed tags, the rank of NUM is: 10 in number of lemmas, 12 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: δύο, εἴκοσι, τρεῖς, τεσσαράκοντα, ἑκατόν, πέντε, ἐννέα, πεντήκοντα, τριάκοντα, δέκα

The 10 most frequent NUM types: δύω, δύο, ἑκατὸν, εἴκοσι, τεσσαράκοντα, ἐννέα, πεντήκοντα, δέκα, πέντε, τρεῖς

The 10 most frequent ambiguous lemmas: δύο (NUM 54, ADJ 40, NOUN 1), εἴκοσι (NUM 20, ADJ 17), τρεῖς (ADJ 21, NUM 16), τεσσαράκοντα (NUM 14, ADJ 6), ἑκατόν (NUM 13, ADJ 11), πέντε (NUM 12, ADJ 8), ἐννέα (NUM 12, ADJ 6), πεντήκοντα (ADJ 13, NUM 11), τριάκοντα (ADJ 17, NUM 11), δέκα (ADJ 16, NUM 10)

The 10 most frequent ambiguous types: δύω (NUM 22, VERB 3), δύο (ADJ 25, NUM 18), ἑκατὸν (NUM 13, ADJ 10, PRON 1), εἴκοσι (ADJ 17, NUM 12), τεσσαράκοντα (NUM 12, ADJ 3), ἐννέα (NUM 12, ADJ 4), πεντήκοντα (ADJ 13, NUM 11), δέκα (ADJ 16, NUM 10), πέντε (NUM 10, ADJ 8), τρεῖς (NUM 10, ADJ 7)

Morphology

The form / lemma ratio of NUM is 1.692308 (the average of all parts of speech is 3.010372).

The 1st highest number of forms (4) was observed with the lemma “δύο”: δυοῖν, δύ̓, δύο, δύω.

The 2nd highest number of forms (4) was observed with the lemma “εἴκοσι”: εἴκοσί, εἴκοσι, ἐείκοσι, ἐείκοσιν.

The 3rd highest number of forms (4) was observed with the lemma “τρεῖς”: τρία, τρεῖς, τρισὶ, τριῶν.

NUM occurs with 3 features: Gender (5; 2% instances), Number (5; 2% instances), Case (4; 2% instances)

NUM occurs with 5 feature-value pairs: Case=Dat, Case=Nom, Gender=Masc, Gender=Neut, Number=Plur

NUM occurs with 4 feature combinations. The most frequent feature combination is _ (225 tokens). Examples: δύω, δύο, ἑκατὸν, εἴκοσι, τεσσαράκοντα, ἐννέα, πεντήκοντα, δέκα, πέντε, τρεῖς

Relations

NUM nodes are attached to their parents using 8 different relations: nummod (201; 87% instances), conj (12; 5% instances), nsubj (4; 2% instances), advcl (3; 1% instances), obj (3; 1% instances), obl (3; 1% instances), root (2; 1% instances), xcomp (2; 1% instances)

Parents of NUM nodes belong to 9 different parts of speech: NOUN (184; 80% instances), VERB (13; 6% instances), ADJ (11; 5% instances), NUM (11; 5% instances), PRON (4; 2% instances), DET (3; 1% instances), (2; 1% instances), ADP (1; 0% instances), ADV (1; 0% instances)

192 (83%) NUM nodes are leaves.

17 (7%) NUM nodes have one child.

13 (6%) NUM nodes have two children.

8 (3%) NUM nodes have three or more children.

The highest child degree of a NUM node is 8.

Children of NUM nodes are attached using 15 different relations: conj (17; 20% instances), cc (15; 18% instances), punct (14; 17% instances), advmod (10; 12% instances), cop (8; 10% instances), nmod (4; 5% instances), nsubj (3; 4% instances), appos (2; 2% instances), case (2; 2% instances), det (2; 2% instances), obl (2; 2% instances), advcl (1; 1% instances), amod (1; 1% instances), mark (1; 1% instances), nummod (1; 1% instances)

Children of NUM nodes belong to 13 different parts of speech: PUNCT (14; 17% instances), CCONJ (13; 16% instances), ADJ (12; 14% instances), NUM (11; 13% instances), AUX (8; 10% instances), ADV (6; 7% instances), PART (5; 6% instances), VERB (5; 6% instances), NOUN (3; 4% instances), ADP (2; 2% instances), DET (2; 2% instances), PRON (1; 1% instances), SCONJ (1; 1% instances)