This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home grc/pos issue tracker

NUM: numeral

Definition

In Ancient Greek grammar “numeral” is the PoS reserved for cardinal and ordinal adjectives, as well as adverbs such as ἅπαξ ‘once’. A list for them can be found in Smyth 1920: 102-106.

In accordance with the UD guidelines, only cardinal numbers are tagged as NUM, whether they are adjective or substantivized adjectives. Ordinal numbers are, following the UD guidelines, tagged as adjectives, while adverb numerals receive the PoS ADV.

Examples

References

Smyth, Herbert Weir. 1920. A Greek Grammar for Colleges. New York: American Book Company (Perseus Digital Library; Internet Archive).


Treebank Statistics (UD_Ancient_Greek)

There are 26 NUM lemmas (0%), 53 NUM types (0%) and 277 NUM tokens (0%). Out of 13 observed tags, the rank of NUM is: 9 in number of lemmas, 10 in number of types and 11 in number of tokens.

The 10 most frequent NUM lemmas: δύο, εἴκοσι, τρεῖς, δέκα, πέντε, πεντήκοντα, ἐννέα, ἑκατόν, ἕξ, δώδεκα

The 10 most frequent NUM types: δύο, δύω, πεντήκοντα, ἐννέα, δέκα, εἴκοσι, πέντε, ἓξ, δυοῖν, τρεῖς

The 10 most frequent ambiguous lemmas: δύο (NUM 71, ADJ 50, NOUN 1), εἴκοσι (NUM 24, ADJ 21), τρεῖς (ADJ 26, NUM 16), δέκα (ADJ 18, NUM 14), πέντε (ADJ 16, NUM 14), πεντήκοντα (ADJ 20, NUM 14), ἐννέα (NUM 14, ADJ 2), ἑκατόν (ADJ 20, NUM 13), ἕξ (NUM 12, ADJ 4), δώδεκα (NUM 10, ADJ 4)

The 10 most frequent ambiguous types: δύο (NUM 32, ADJ 28), πεντήκοντα (ADJ 20, NUM 14), ἐννέα (NUM 14, ADJ 1), δέκα (ADJ 18, NUM 12), εἴκοσι (ADJ 21, NUM 12), πέντε (ADJ 16, NUM 12), ἓξ (NUM 12, ADJ 4), δυοῖν (NUM 11, ADJ 6), τρεῖς (ADJ 11, NUM 10), ἑκατὸν (ADJ 17, NUM 10, PRON 1)

Morphology

The form / lemma ratio of NUM is 2.038462 (the average of all parts of speech is 3.041201).

The 1st highest number of forms (5) was observed with the lemma “εἴκοσι”: εἴκοσ̓, εἴκοσί, εἴκοσι, ἐείκοσι, ἐείκοσιν.

The 2nd highest number of forms (5) was observed with the lemma “εἷς”: Μιᾶς, εἷς, ἑνὸς, ἓν, ἕνα.

The 3rd highest number of forms (5) was observed with the lemma “τρεῖς”: τρί̓, τρία, τρεῖς, τρισὶ, τριῶν.

NUM occurs with 3 features: Gender (14; 5% instances), Number (14; 5% instances), Case (13; 5% instances)

NUM occurs with 9 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing

NUM occurs with 10 feature combinations. The most frequent feature combination is _ (263 tokens). Examples: δύο, δύω, πεντήκοντα, ἐννέα, δέκα, εἴκοσι, πέντε, ἓξ, δυοῖν, τρεῖς

Relations

NUM nodes are attached to their parents using 9 different relations: amod (224; 81% instances), conj (21; 8% instances), iobj (11; 4% instances), advmod (7; 3% instances), nsubj (7; 3% instances), dobj (3; 1% instances), root (2; 1% instances), nsubjpass (1; 0% instances), xcomp (1; 0% instances)

Parents of NUM nodes belong to 8 different parts of speech: NOUN (206; 74% instances), VERB (31; 11% instances), NUM (17; 6% instances), ADJ (11; 4% instances), PRON (6; 2% instances), DET (3; 1% instances), ROOT (2; 1% instances), ADP (1; 0% instances)

215 (78%) NUM nodes are leaves.

30 (11%) NUM nodes have one child.

21 (8%) NUM nodes have two children.

11 (4%) NUM nodes have three or more children.

The highest child degree of a NUM node is 14.

Children of NUM nodes are attached using 12 different relations: cc (22; 17% instances), conj (20; 16% instances), punct (19; 15% instances), advmod (15; 12% instances), amod (9; 7% instances), case (9; 7% instances), nmod (9; 7% instances), cop (6; 5% instances), det (6; 5% instances), nsubj (5; 4% instances), xcomp (5; 4% instances), mark (1; 1% instances)

Children of NUM nodes belong to 11 different parts of speech: CONJ (19; 15% instances), PUNCT (19; 15% instances), ADJ (17; 13% instances), NUM (17; 13% instances), ADV (15; 12% instances), ADP (10; 8% instances), VERB (9; 7% instances), NOUN (7; 6% instances), DET (6; 5% instances), PRON (6; 5% instances), SCONJ (1; 1% instances)


Treebank Statistics (UD_Ancient_Greek-PROIEL)

There are 80 NUM lemmas (1%), 183 NUM types (1%) and 1485 NUM tokens (1%). Out of 13 observed tags, the rank of NUM is: 6 in number of lemmas, 7 in number of types and 11 in number of tokens.

The 10 most frequent NUM lemmas: εἷς, δύο, ἑπτά, τρεῖς, δώδεκα, δέκα, πέντε, τέσσαρες, ἑκατόν, τριάκοντα

The 10 most frequent NUM types: δύο, εἷς, ἑπτὰ, δώδεκα, ἓν, δέκα, πέντε, ἕνα, τρεῖς, μίαν

The 10 most frequent ambiguous lemmas: διακόσιοι (NUM 22, ADJ 2), τετρακόσιοι (NUM 14, ADJ 2), δέκατος (ADJ 6, NUM 1), χίλιος (ADJ 1, NUM 1), ἑκατοστός (ADJ 1, NUM 1)

The 10 most frequent ambiguous types: εἷς (NUM 101, ADP 2), τετρακόσιοι (NUM 2, ADJ 1), εἶς (VERB 6, NUM 1), τετρακοσίας (NUM 1, ADJ 1)

Morphology

The form / lemma ratio of NUM is 2.287500 (the average of all parts of speech is 3.412038).

The 1st highest number of forms (16) was observed with the lemma “εἷς”: εἶς, εἷς, μία, μίαν, μιᾶς, μιᾷ, μιῆς, μιῇ, ἐνὶ, ἑνί, ἑνός, ἑνὶ, ἑνὸς, ἓν, ἕν, ἕνα.

The 2nd highest number of forms (11) was observed with the lemma “τέσσαρες”: τέσσαρα, τέσσαρας, τέσσαρες, τέσσαρσιν, τέσσερα, τέσσερας, τέσσερες, τέσσερσι, τέτορες, τεσσάρων, τεσσέρων.

The 3rd highest number of forms (10) was observed with the lemma “διακόσιοι”: διακοσίους, διακοσίων, διηκοσίας, διηκοσίων, διηκοσιέων, διηκόσια, διηκόσιαί, διηκόσιαι, διηκόσιοί, διηκόσιοι.

NUM occurs with 3 features: Case (740; 50% instances), Number (740; 50% instances), Gender (711; 48% instances)

NUM occurs with 11 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Gender=Fem, Gender=Fem,Masc, Gender=Masc, Gender=Masc,Neut, Gender=Neut, Number=Plur, Number=Sing

NUM occurs with 33 feature combinations. The most frequent feature combination is _ (745 tokens). Examples: δύο, ἑπτὰ, δώδεκα, δέκα, πέντε, τεσσεράκοντα, εἴκοσι, ἑκατὸν, τριήκοντα, πεντήκοντα

Relations

NUM nodes are attached to their parents using 15 different relations: nummod (919; 62% instances), nsubj (111; 7% instances), conj (98; 7% instances), nmod (75; 5% instances), dobj (72; 5% instances), remnant (47; 3% instances), iobj (46; 3% instances), root (38; 3% instances), appos (24; 2% instances), nsubjpass (16; 1% instances), xcomp (16; 1% instances), advcl (14; 1% instances), advmod (4; 0% instances), ccomp (4; 0% instances), csubjpass (1; 0% instances)

Parents of NUM nodes belong to 11 different parts of speech: NOUN (916; 62% instances), VERB (293; 20% instances), NUM (121; 8% instances), ADJ (55; 4% instances), ROOT (38; 3% instances), PRON (23; 2% instances), PROPN (17; 1% instances), ADV (13; 1% instances), SCONJ (4; 0% instances), ADP (3; 0% instances), DET (2; 0% instances)

974 (66%) NUM nodes are leaves.

260 (18%) NUM nodes have one child.

156 (11%) NUM nodes have two children.

95 (6%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 17 different relations: nmod (194; 21% instances), cc (129; 14% instances), det (125; 13% instances), conj (120; 13% instances), case (82; 9% instances), cop (53; 6% instances), nsubj (47; 5% instances), advmod (40; 4% instances), discourse (36; 4% instances), appos (30; 3% instances), remnant (22; 2% instances), acl (20; 2% instances), amod (11; 1% instances), neg (10; 1% instances), advcl (7; 1% instances), mark (7; 1% instances), nummod (3; 0% instances)

Children of NUM nodes belong to 11 different parts of speech: NOUN (136; 15% instances), CONJ (129; 14% instances), DET (125; 13% instances), NUM (121; 13% instances), VERB (96; 10% instances), ADV (86; 9% instances), ADP (84; 9% instances), ADJ (73; 8% instances), PRON (55; 6% instances), PROPN (24; 3% instances), SCONJ (7; 1% instances)


NUM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]