NUM
: numeral
Definition
In Ancient Greek grammar “numeral” is the PoS reserved for cardinal and ordinal adjectives, as well as adverbs such as ἅπαξ ‘once’. A list for them can be found in Smyth 1920: 102-106.
In accordance with the UD guidelines, only cardinal numbers are tagged as NUM, whether they are adjective or substantivized adjectives. Ordinal numbers are, following the UD guidelines, tagged as adjectives, while adverb numerals receive the PoS ADV.
Examples
- τρεῖς, τρία “three”
- πεντεκαίδεκα “fifteen”
- ὀκτακόσιοι “eighty”
References
Smyth, Herbert Weir. 1920. A Greek Grammar for Colleges. New York: American Book Company (Perseus Digital Library; Internet Archive).
Treebank Statistics (UD_Ancient_Greek)
There are 26 NUM
lemmas (0%), 53 NUM
types (0%) and 277 NUM
tokens (0%).
Out of 13 observed tags, the rank of NUM
is: 9 in number of lemmas, 10 in number of types and 11 in number of tokens.
The 10 most frequent NUM
lemmas: δύο, εἴκοσι, τρεῖς, δέκα, πέντε, πεντήκοντα, ἐννέα, ἑκατόν, ἕξ, δώδεκα
The 10 most frequent NUM
types: δύο, δύω, πεντήκοντα, ἐννέα, δέκα, εἴκοσι, πέντε, ἓξ, δυοῖν, τρεῖς
The 10 most frequent ambiguous lemmas: δύο (NUM 71, ADJ 50, NOUN 1), εἴκοσι (NUM 24, ADJ 21), τρεῖς (ADJ 26, NUM 16), δέκα (ADJ 18, NUM 14), πέντε (ADJ 16, NUM 14), πεντήκοντα (ADJ 20, NUM 14), ἐννέα (NUM 14, ADJ 2), ἑκατόν (ADJ 20, NUM 13), ἕξ (NUM 12, ADJ 4), δώδεκα (NUM 10, ADJ 4)
The 10 most frequent ambiguous types: δύο (NUM 32, ADJ 28), πεντήκοντα (ADJ 20, NUM 14), ἐννέα (NUM 14, ADJ 1), δέκα (ADJ 18, NUM 12), εἴκοσι (ADJ 21, NUM 12), πέντε (ADJ 16, NUM 12), ἓξ (NUM 12, ADJ 4), δυοῖν (NUM 11, ADJ 6), τρεῖς (ADJ 11, NUM 10), ἑκατὸν (ADJ 17, NUM 10, PRON 1)
- δύο
- πεντήκοντα
- ἐννέα
- δέκα
- εἴκοσι
- πέντε
- ἓξ
- δυοῖν
- τρεῖς
- ἑκατὸν
Morphology
The form / lemma ratio of NUM
is 2.038462 (the average of all parts of speech is 3.041201).
The 1st highest number of forms (5) was observed with the lemma “εἴκοσι”: εἴκοσ̓, εἴκοσί, εἴκοσι, ἐείκοσι, ἐείκοσιν.
The 2nd highest number of forms (5) was observed with the lemma “εἷς”: Μιᾶς, εἷς, ἑνὸς, ἓν, ἕνα.
The 3rd highest number of forms (5) was observed with the lemma “τρεῖς”: τρί̓, τρία, τρεῖς, τρισὶ, τριῶν.
NUM
occurs with 3 features: Gender (14; 5% instances), Number (14; 5% instances), Case (13; 5% instances)
NUM
occurs with 9 feature-value pairs: Case=Acc
, Case=Dat
, Case=Gen
, Case=Nom
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Number=Plur
, Number=Sing
NUM
occurs with 10 feature combinations.
The most frequent feature combination is _
(263 tokens).
Examples: δύο, δύω, πεντήκοντα, ἐννέα, δέκα, εἴκοσι, πέντε, ἓξ, δυοῖν, τρεῖς
Relations
NUM
nodes are attached to their parents using 9 different relations: amod (224; 81% instances), conj (21; 8% instances), iobj (11; 4% instances), advmod (7; 3% instances), nsubj (7; 3% instances), dobj (3; 1% instances), root (2; 1% instances), nsubjpass (1; 0% instances), xcomp (1; 0% instances)
Parents of NUM
nodes belong to 8 different parts of speech: NOUN (206; 74% instances), VERB (31; 11% instances), NUM (17; 6% instances), ADJ (11; 4% instances), PRON (6; 2% instances), DET (3; 1% instances), ROOT (2; 1% instances), ADP (1; 0% instances)
215 (78%) NUM
nodes are leaves.
30 (11%) NUM
nodes have one child.
21 (8%) NUM
nodes have two children.
11 (4%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 14.
Children of NUM
nodes are attached using 12 different relations: cc (22; 17% instances), conj (20; 16% instances), punct (19; 15% instances), advmod (15; 12% instances), amod (9; 7% instances), case (9; 7% instances), nmod (9; 7% instances), cop (6; 5% instances), det (6; 5% instances), nsubj (5; 4% instances), xcomp (5; 4% instances), mark (1; 1% instances)
Children of NUM
nodes belong to 11 different parts of speech: CONJ (19; 15% instances), PUNCT (19; 15% instances), ADJ (17; 13% instances), NUM (17; 13% instances), ADV (15; 12% instances), ADP (10; 8% instances), VERB (9; 7% instances), NOUN (7; 6% instances), DET (6; 5% instances), PRON (6; 5% instances), SCONJ (1; 1% instances)
Treebank Statistics (UD_Ancient_Greek-PROIEL)
There are 80 NUM
lemmas (1%), 183 NUM
types (1%) and 1485 NUM
tokens (1%).
Out of 13 observed tags, the rank of NUM
is: 6 in number of lemmas, 7 in number of types and 11 in number of tokens.
The 10 most frequent NUM
lemmas: εἷς, δύο, ἑπτά, τρεῖς, δώδεκα, δέκα, πέντε, τέσσαρες, ἑκατόν, τριάκοντα
The 10 most frequent NUM
types: δύο, εἷς, ἑπτὰ, δώδεκα, ἓν, δέκα, πέντε, ἕνα, τρεῖς, μίαν
The 10 most frequent ambiguous lemmas: διακόσιοι (NUM 22, ADJ 2), τετρακόσιοι (NUM 14, ADJ 2), δέκατος (ADJ 6, NUM 1), χίλιος (ADJ 1, NUM 1), ἑκατοστός (ADJ 1, NUM 1)
The 10 most frequent ambiguous types: εἷς (NUM 101, ADP 2), τετρακόσιοι (NUM 2, ADJ 1), εἶς (VERB 6, NUM 1), τετρακοσίας (NUM 1, ADJ 1)
- εἷς
- τετρακόσιοι
- εἶς
- τετρακοσίας
- NUM 1: ξεῖνόν τέ σε ποιεῦμαι ἐμὸν καὶ τὰς τετρακοσίας μυριάδας τοι τῶν στατήρων ἀποπλήσω παρ’ ἐμεωυτοῦ δοὺς τὰς ἑπτὰ χιλιάδας ἵνα μή τοι ἐπιδεέες ἔωσι αἱ τετρακόσιαι μυριάδες ἑπτὰ χιλιάδων ἀλλὰ ᾖ τοι ἀπαρτιλογίη ὑπ’ ἐμέο πεπληρωμένη
- ADJ 1: ἐπείτε γὰρ τάχιστά σε ἐπυθόμην ἐπὶ θάλασσαν καταβαίνοντα τὴν Ἑλληνίδα βουλόμενός τοι δοῦναι ἐς τὸν πόλεμον χρήματα ἐξεμάνθανον καὶ εὗρον λογιζόμενος ἀργυρίου μὲν δύο χιλιάδας ἐούσας μοι ταλάντων χρυσίου δὲ τετρακοσίας μυριάδας στατήρων Δαρεικῶν ἐπιδεούσας ἑπτὰ χιλιάδων
Morphology
The form / lemma ratio of NUM
is 2.287500 (the average of all parts of speech is 3.412038).
The 1st highest number of forms (16) was observed with the lemma “εἷς”: εἶς, εἷς, μία, μίαν, μιᾶς, μιᾷ, μιῆς, μιῇ, ἐνὶ, ἑνί, ἑνός, ἑνὶ, ἑνὸς, ἓν, ἕν, ἕνα.
The 2nd highest number of forms (11) was observed with the lemma “τέσσαρες”: τέσσαρα, τέσσαρας, τέσσαρες, τέσσαρσιν, τέσσερα, τέσσερας, τέσσερες, τέσσερσι, τέτορες, τεσσάρων, τεσσέρων.
The 3rd highest number of forms (10) was observed with the lemma “διακόσιοι”: διακοσίους, διακοσίων, διηκοσίας, διηκοσίων, διηκοσιέων, διηκόσια, διηκόσιαί, διηκόσιαι, διηκόσιοί, διηκόσιοι.
NUM
occurs with 3 features: Case (740; 50% instances), Number (740; 50% instances), Gender (711; 48% instances)
NUM
occurs with 11 feature-value pairs: Case=Acc
, Case=Dat
, Case=Gen
, Case=Nom
, Gender=Fem
, Gender=Fem,Masc
, Gender=Masc
, Gender=Masc,Neut
, Gender=Neut
, Number=Plur
, Number=Sing
NUM
occurs with 33 feature combinations.
The most frequent feature combination is _
(745 tokens).
Examples: δύο, ἑπτὰ, δώδεκα, δέκα, πέντε, τεσσεράκοντα, εἴκοσι, ἑκατὸν, τριήκοντα, πεντήκοντα
Relations
NUM
nodes are attached to their parents using 15 different relations: nummod (919; 62% instances), nsubj (111; 7% instances), conj (98; 7% instances), nmod (75; 5% instances), dobj (72; 5% instances), remnant (47; 3% instances), iobj (46; 3% instances), root (38; 3% instances), appos (24; 2% instances), nsubjpass (16; 1% instances), xcomp (16; 1% instances), advcl (14; 1% instances), advmod (4; 0% instances), ccomp (4; 0% instances), csubjpass (1; 0% instances)
Parents of NUM
nodes belong to 11 different parts of speech: NOUN (916; 62% instances), VERB (293; 20% instances), NUM (121; 8% instances), ADJ (55; 4% instances), ROOT (38; 3% instances), PRON (23; 2% instances), PROPN (17; 1% instances), ADV (13; 1% instances), SCONJ (4; 0% instances), ADP (3; 0% instances), DET (2; 0% instances)
974 (66%) NUM
nodes are leaves.
260 (18%) NUM
nodes have one child.
156 (11%) NUM
nodes have two children.
95 (6%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 7.
Children of NUM
nodes are attached using 17 different relations: nmod (194; 21% instances), cc (129; 14% instances), det (125; 13% instances), conj (120; 13% instances), case (82; 9% instances), cop (53; 6% instances), nsubj (47; 5% instances), advmod (40; 4% instances), discourse (36; 4% instances), appos (30; 3% instances), remnant (22; 2% instances), acl (20; 2% instances), amod (11; 1% instances), neg (10; 1% instances), advcl (7; 1% instances), mark (7; 1% instances), nummod (3; 0% instances)
Children of NUM
nodes belong to 11 different parts of speech: NOUN (136; 15% instances), CONJ (129; 14% instances), DET (125; 13% instances), NUM (121; 13% instances), VERB (96; 10% instances), ADV (86; 9% instances), ADP (84; 9% instances), ADJ (73; 8% instances), PRON (55; 6% instances), PROPN (24; 3% instances), SCONJ (7; 1% instances)
NUM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]