NUM

This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.

home grc/pos issue tracker

`NUM`: numeral

Definition

In Ancient Greek grammar “numeral” is the PoS reserved for cardinal and ordinal adjectives, as well as adverbs such as ἅπαξ ‘once’. A list for them can be found in Smyth 1920: 102-106.

In accordance with the UD guidelines, only cardinal numbers are tagged as NUM, whether they are adjective or substantivized adjectives. Ordinal numbers are, following the UD guidelines, tagged as adjectives, while adverb numerals receive the PoS ADV.

Examples

τρεῖς, τρία “three”
πεντεκαίδεκα “fifteen”
ὀκτακόσιοι “eighty”

References

Smyth, Herbert Weir. 1920. A Greek Grammar for Colleges. New York: American Book Company (Perseus Digital Library; Internet Archive).

Treebank Statistics (UD_Ancient_Greek)

There are 26 NUM lemmas (0%), 53 NUM types (0%) and 277 NUM tokens (0%). Out of 13 observed tags, the rank of NUM is: 9 in number of lemmas, 10 in number of types and 11 in number of tokens.

The 10 most frequent NUM lemmas: δύο, εἴκοσι, τρεῖς, δέκα, πέντε, πεντήκοντα, ἐννέα, ἑκατόν, ἕξ, δώδεκα

The 10 most frequent NUM types: δύο, δύω, πεντήκοντα, ἐννέα, δέκα, εἴκοσι, πέντε, ἓξ, δυοῖν, τρεῖς

The 10 most frequent ambiguous lemmas: δύο (NUM 71, ADJ 50, NOUN 1), εἴκοσι (NUM 24, ADJ 21), τρεῖς (ADJ 26, NUM 16), δέκα (ADJ 18, NUM 14), πέντε (ADJ 16, NUM 14), πεντήκοντα (ADJ 20, NUM 14), ἐννέα (NUM 14, ADJ 2), ἑκατόν (ADJ 20, NUM 13), ἕξ (NUM 12, ADJ 4), δώδεκα (NUM 10, ADJ 4)

The 10 most frequent ambiguous types: δύο (NUM 32, ADJ 28), πεντήκοντα (ADJ 20, NUM 14), ἐννέα (NUM 14, ADJ 1), δέκα (ADJ 18, NUM 12), εἴκοσι (ADJ 21, NUM 12), πέντε (ADJ 16, NUM 12), ἓξ (NUM 12, ADJ 4), δυοῖν (NUM 11, ADJ 6), τρεῖς (ADJ 11, NUM 10), ἑκατὸν (ADJ 17, NUM 10, PRON 1)

δύο
- NUM 32: Ἄνδρες δύο ἐμάχοντο τίνες τῶν θεῶν μείζους , Θησεὺς ἢ Ἡρακλῆς .
- ADJ 28: δύο γὰρ ἅμαξαι ἐναντίαι ἀλλήλαις τοὺς λίθους ἐπῆγον .
πεντήκοντα
- ADJ 20: οὗτοι μὲν οὖν μετ̓ ἀλλήλων καλῶς πολιτευόμενοι διετέλεσαν ἔτη πεντήκοντα καὶ ὀκτώ ·
- NUM 14: αὗται μὲν Νηρῆος ἀμύμονος ἐξεγένοντο κοῦραι πεντήκοντα , ἀμύμονα ἔργα ἰδυῖαι .
ἐννέα
- NUM 14: ἐννέα νῆας στεῖλα , θοῶς δ̓ ἐσαγείρατο λαός .
- ADJ 1: τότε δὲ τὰ πολλὰ τῶν πολιτικῶν οἱ ἐννέα ἄρχοντες ἔπρασσον .
δέκα
- ADJ 18: ἀλλ̓ ἐξαπέστειλεν ἄνδρας δέκα τοὺς ἐπισκεψομένους ὑπὲρ τῶν πραγμάτων .
- NUM 12: ὄγχνας μοι δῶκας τρισκαίδεκα καὶ δέκα μηλέας , συκέας τεσσαράκοντ̓ ·
εἴκοσι
- ADJ 21: ὅδε ὁ νόμος ἴσος ἐγράφη καὶ ὅμοιος , στίχων τριακοσίων εἴκοσι τριῶν .
- NUM 12: ὧν εἴκοσι τοὺς ἐπιφανεστάτους Ἕρμιππος ἀνέγραψε ·
πέντε
- ADJ 16: μνᾶς πραξαμένη δὲ πέντε τὰς σπονδὰς ποοῦ καὶ στρῶσον ἡμῖν ἔνδον .
- NUM 12: σταδίους δὲ πέντε καὶ τεσσεράκοντα διακομίσαντες ἀπίκοντο ἐς τὸ ἱρόν .
ἓξ
- NUM 12: καλῶς ἔχει τὰ πλεῖστ̓ , ἐν ἓξ πυλώμασι ·
- ADJ 4: οὕτω μὲν τὰ τῶν Ἑλλήνων πράγματα ἐφθάρη ἓξ ἔτη πολεμήσαντα ·
δυοῖν
- NUM 11: δυοῖν λόγοιν σε θατέρῳ δωρήσομαι .
- ADJ 6: δυοῖν γὰρ εἶχε προσβολὰς μιασμάτοιν .
τρεῖς
- ADJ 11: καὶ κάνδαυλον δέ τινα ἔλεγον οἱ Λυδοί , οὐχ ἕνα ἀλλὰ τρεῖς ·
- NUM 10: τρεῖς γὰρ δή μιν νύκτας ἔχον , τρία δ̓ ἤματ̓ ἔρυξα ἐν κλισίῃ ·
ἑκατὸν
- ADJ 17: εἶχεν γὰρ οὐκ ἐλάττους ἑκατὸν ἐλεφάντων ·
- NUM 10: ἀτὰρ τὰ πάνθ̓ ὁμοῦ ἑκατὸν προσῆγε συμμιγῆ βοσκήματα .
- PRON 1: τοῦ καὶ ἀπὸ ῥίζης ἑκατὸν κάρα ἐξεπεφύκει ·

Morphology

The form / lemma ratio of NUM is 2.038462 (the average of all parts of speech is 3.041201).

The 1st highest number of forms (5) was observed with the lemma “εἴκοσι”: εἴκοσ̓, εἴκοσί, εἴκοσι, ἐείκοσι, ἐείκοσιν.

The 2nd highest number of forms (5) was observed with the lemma “εἷς”: Μιᾶς, εἷς, ἑνὸς, ἓν, ἕνα.

The 3rd highest number of forms (5) was observed with the lemma “τρεῖς”: τρί̓, τρία, τρεῖς, τρισὶ, τριῶν.

NUM occurs with 3 features: Gender (14; 5% instances), Number (14; 5% instances), Case (13; 5% instances)

NUM occurs with 9 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing

NUM occurs with 10 feature combinations. The most frequent feature combination is _ (263 tokens). Examples: δύο, δύω, πεντήκοντα, ἐννέα, δέκα, εἴκοσι, πέντε, ἓξ, δυοῖν, τρεῖς

Relations

NUM nodes are attached to their parents using 9 different relations: amod (224; 81% instances), conj (21; 8% instances), iobj (11; 4% instances), advmod (7; 3% instances), nsubj (7; 3% instances), dobj (3; 1% instances), root (2; 1% instances), nsubjpass (1; 0% instances), xcomp (1; 0% instances)

Parents of NUM nodes belong to 8 different parts of speech: NOUN (206; 74% instances), VERB (31; 11% instances), NUM (17; 6% instances), ADJ (11; 4% instances), PRON (6; 2% instances), DET (3; 1% instances), ROOT (2; 1% instances), ADP (1; 0% instances)

215 (78%) NUM nodes are leaves.

30 (11%) NUM nodes have one child.

21 (8%) NUM nodes have two children.

11 (4%) NUM nodes have three or more children.

The highest child degree of a NUM node is 14.

Children of NUM nodes are attached using 12 different relations: cc (22; 17% instances), conj (20; 16% instances), punct (19; 15% instances), advmod (15; 12% instances), amod (9; 7% instances), case (9; 7% instances), nmod (9; 7% instances), cop (6; 5% instances), det (6; 5% instances), nsubj (5; 4% instances), xcomp (5; 4% instances), mark (1; 1% instances)

Children of NUM nodes belong to 11 different parts of speech: CONJ (19; 15% instances), PUNCT (19; 15% instances), ADJ (17; 13% instances), NUM (17; 13% instances), ADV (15; 12% instances), ADP (10; 8% instances), VERB (9; 7% instances), NOUN (7; 6% instances), DET (6; 5% instances), PRON (6; 5% instances), SCONJ (1; 1% instances)

Treebank Statistics (UD_Ancient_Greek-PROIEL)

There are 80 NUM lemmas (1%), 183 NUM types (1%) and 1485 NUM tokens (1%). Out of 13 observed tags, the rank of NUM is: 6 in number of lemmas, 7 in number of types and 11 in number of tokens.

The 10 most frequent NUM lemmas: εἷς, δύο, ἑπτά, τρεῖς, δώδεκα, δέκα, πέντε, τέσσαρες, ἑκατόν, τριάκοντα

The 10 most frequent NUM types: δύο, εἷς, ἑπτὰ, δώδεκα, ἓν, δέκα, πέντε, ἕνα, τρεῖς, μίαν

The 10 most frequent ambiguous lemmas: διακόσιοι (NUM 22, ADJ 2), τετρακόσιοι (NUM 14, ADJ 2), δέκατος (ADJ 6, NUM 1), χίλιος (ADJ 1, NUM 1), ἑκατοστός (ADJ 1, NUM 1)

The 10 most frequent ambiguous types: εἷς (NUM 101, ADP 2), τετρακόσιοι (NUM 2, ADJ 1), εἶς (VERB 6, NUM 1), τετρακοσίας (NUM 1, ADJ 1)

εἷς
- NUM 101: Καὶ ἰδοὺ εἷς προσελθὼν αὐτῷ εἶπεν
- ADP 2: κύριος ὁ θεὸς ἡμῶν κύριος εἷς ἐστιν
τετρακόσιοι
- NUM 2: ἀπὸ δὲ Κορίνθου τετρακόσιοι καὶ ἀπὸ Φλειοῦντος διηκόσιοι καὶ Μυκηναίων ὀγδώκοντα
- ADJ 1: οὗτοι στάδιοι τῆς περιόδου τῆς πόλιος γίνονται συνάπαντες ὀγδώκοντα καὶ τετρακόσιοι
εἶς
- VERB 6: Ἀρτάβανε πατρὸς εἶς τοῦ ἐμοῦ ἀδελφεός
- NUM 1: εἶς δὲ οὐ σύ μοι τοῦδε τοῦ κακοῦ αἴτιος εἰ μὴ ὅσον ἀέκων ἐξεργάσαο ἀλλὰ θεῶν κού τις ὅς μοι καὶ πάλαι προεσήμαινε τὰ μέλλοντα ἔσεσθαι
τετρακοσίας
- NUM 1: ξεῖνόν τέ σε ποιεῦμαι ἐμὸν καὶ τὰς τετρακοσίας μυριάδας τοι τῶν στατήρων ἀποπλήσω παρ’ ἐμεωυτοῦ δοὺς τὰς ἑπτὰ χιλιάδας ἵνα μή τοι ἐπιδεέες ἔωσι αἱ τετρακόσιαι μυριάδες ἑπτὰ χιλιάδων ἀλλὰ ᾖ τοι ἀπαρτιλογίη ὑπ’ ἐμέο πεπληρωμένη
- ADJ 1: ἐπείτε γὰρ τάχιστά σε ἐπυθόμην ἐπὶ θάλασσαν καταβαίνοντα τὴν Ἑλληνίδα βουλόμενός τοι δοῦναι ἐς τὸν πόλεμον χρήματα ἐξεμάνθανον καὶ εὗρον λογιζόμενος ἀργυρίου μὲν δύο χιλιάδας ἐούσας μοι ταλάντων χρυσίου δὲ τετρακοσίας μυριάδας στατήρων Δαρεικῶν ἐπιδεούσας ἑπτὰ χιλιάδων

Morphology

The form / lemma ratio of NUM is 2.287500 (the average of all parts of speech is 3.412038).

The 1st highest number of forms (16) was observed with the lemma “εἷς”: εἶς, εἷς, μία, μίαν, μιᾶς, μιᾷ, μιῆς, μιῇ, ἐνὶ, ἑνί, ἑνός, ἑνὶ, ἑνὸς, ἓν, ἕν, ἕνα.

The 2nd highest number of forms (11) was observed with the lemma “τέσσαρες”: τέσσαρα, τέσσαρας, τέσσαρες, τέσσαρσιν, τέσσερα, τέσσερας, τέσσερες, τέσσερσι, τέτορες, τεσσάρων, τεσσέρων.

The 3rd highest number of forms (10) was observed with the lemma “διακόσιοι”: διακοσίους, διακοσίων, διηκοσίας, διηκοσίων, διηκοσιέων, διηκόσια, διηκόσιαί, διηκόσιαι, διηκόσιοί, διηκόσιοι.

NUM occurs with 3 features: Case (740; 50% instances), Number (740; 50% instances), Gender (711; 48% instances)

NUM occurs with 11 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Gender=Fem, Gender=Fem,Masc, Gender=Masc, Gender=Masc,Neut, Gender=Neut, Number=Plur, Number=Sing

NUM occurs with 33 feature combinations. The most frequent feature combination is _ (745 tokens). Examples: δύο, ἑπτὰ, δώδεκα, δέκα, πέντε, τεσσεράκοντα, εἴκοσι, ἑκατὸν, τριήκοντα, πεντήκοντα

Relations

NUM nodes are attached to their parents using 15 different relations: nummod (919; 62% instances), nsubj (111; 7% instances), conj (98; 7% instances), nmod (75; 5% instances), dobj (72; 5% instances), remnant (47; 3% instances), iobj (46; 3% instances), root (38; 3% instances), appos (24; 2% instances), nsubjpass (16; 1% instances), xcomp (16; 1% instances), advcl (14; 1% instances), advmod (4; 0% instances), ccomp (4; 0% instances), csubjpass (1; 0% instances)

Parents of NUM nodes belong to 11 different parts of speech: NOUN (916; 62% instances), VERB (293; 20% instances), NUM (121; 8% instances), ADJ (55; 4% instances), ROOT (38; 3% instances), PRON (23; 2% instances), PROPN (17; 1% instances), ADV (13; 1% instances), SCONJ (4; 0% instances), ADP (3; 0% instances), DET (2; 0% instances)

974 (66%) NUM nodes are leaves.

260 (18%) NUM nodes have one child.

156 (11%) NUM nodes have two children.

95 (6%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 17 different relations: nmod (194; 21% instances), cc (129; 14% instances), det (125; 13% instances), conj (120; 13% instances), case (82; 9% instances), cop (53; 6% instances), nsubj (47; 5% instances), advmod (40; 4% instances), discourse (36; 4% instances), appos (30; 3% instances), remnant (22; 2% instances), acl (20; 2% instances), amod (11; 1% instances), neg (10; 1% instances), advcl (7; 1% instances), mark (7; 1% instances), nummod (3; 0% instances)

Children of NUM nodes belong to 11 different parts of speech: NOUN (136; 15% instances), CONJ (129; 14% instances), DET (125; 13% instances), NUM (121; 13% instances), VERB (96; 10% instances), ADV (86; 9% instances), ADP (84; 9% instances), ADJ (73; 8% instances), PRON (55; 6% instances), PROPN (24; 3% instances), SCONJ (7; 1% instances)

NUM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]

NUM: numeral

Definition

Examples

References

Treebank Statistics (UD_Ancient_Greek)

Morphology

Relations

Treebank Statistics (UD_Ancient_Greek-PROIEL)

Morphology

Relations

`NUM`: numeral