home la/pos edit page issue tracker

NUM: numeral

This document is a placeholder for the language-specific documentation for NUM.


Treebank Statistics (UD_Latin)

There are 52 NUM lemmas (1%), 81 NUM types (1%) and 246 NUM tokens (1%). Out of 12 observed tags, the rank of NUM is: 9 in number of lemmas, 6 in number of types and 10 in number of tokens.

The 10 most frequent NUM lemmas: septem1, quattuor1, mille1, tres1, NUMERAL1, centum1, duodecim1, decem1, unus, duo

The 10 most frequent NUM types: septem, quattuor, decem, centum, duodecim, mille, tres, milibus, quadraginta, quinque

The 10 most frequent ambiguous lemmas: mille1 (NUM 21, NOUN 3, ADJ 1), unus (NUM 8, ADV 1), duo (NUM 6, ADJ 1), primus (NUM 6, ADV 3), quinque1 (NUM 5, X 2), secundus1 (ADJ 10, NUM 4, ADP 3), decimus1 (NUM 3, ADJ 1), bis (ADV 3, NUM 2), tot1 (ADV 4, NUM 2, NOUN 1, ADJ 1), C1 (X 12, NOUN 2, NUM 1)

The 10 most frequent ambiguous types: quinque (NUM 4, X 2), tribus (NUM 5, NOUN 1), milia (NUM 4, NOUN 2, ADJ 1), duobus (ADJ 7, NUM 3), primus (ADJ 10, NUM 3), singulas (NUM 3, ADJ 2), unus (ADJ 11, NUM 2), bis (ADV 9, NUM 1), secundum (ADP 7, NUM 2, ADJ 2), singulos (NUM 2, ADJ 2)

Morphology

The form / lemma ratio of NUM is 1.557692 (the average of all parts of speech is 1.829408).

The 1st highest number of forms (10) was observed with the lemma “NUMERAL1”: CC, DC, IIII, LIII., VI, VIII, X, XII, XV, v

The 2nd highest number of forms (6) was observed with the lemma “mille1”: M, milia, milibus, milium, mille, millia

The 3rd highest number of forms (5) was observed with the lemma “unus”: una, uni, uno, unum, unus

NUM occurs with 3 features: Case (51; 21% instances), Gender (51; 21% instances), Number (49; 20% instances)

NUM occurs with 9 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing

NUM occurs with 20 feature combinations. The most frequent feature combination is _ (195 tokens). Examples: septem, quattuor, decem, centum, duodecim, mille, tres, milibus, quadraginta, quinque

Relations

NUM nodes are attached to their parents using 10 different relations: nummod (192; 78% instances), advmod (13; 5% instances), dobj (13; 5% instances), nsubj (12; 5% instances), nsubjpass (6; 2% instances), conj (4; 2% instances), iobj (2; 1% instances), xcomp (2; 1% instances), acl (1; 0% instances), nmod (1; 0% instances)

Parents of NUM nodes belong to 7 different parts of speech: NOUN (161; 65% instances), VERB (41; 17% instances), NUM (25; 10% instances), ADJ (11; 4% instances), ADV (5; 2% instances), PUNCT (2; 1% instances), PRON (1; 0% instances)

192 (78%) NUM nodes are leaves.

38 (15%) NUM nodes have one child.

11 (4%) NUM nodes have two children.

5 (2%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 12 different relations: nummod (21; 26% instances), advmod (13; 16% instances), amod (10; 13% instances), nmod (10; 13% instances), punct (9; 11% instances), case (5; 6% instances), conj (5; 6% instances), cc (3; 4% instances), cop (1; 1% instances), mark (1; 1% instances), neg (1; 1% instances), nsubj (1; 1% instances)

Children of NUM nodes belong to 9 different parts of speech: NUM (25; 31% instances), ADV (10; 13% instances), NOUN (10; 13% instances), PUNCT (9; 11% instances), CONJ (8; 10% instances), ADJ (7; 9% instances), ADP (5; 6% instances), VERB (5; 6% instances), SCONJ (1; 1% instances)


Treebank Statistics (UD_Latin-ITT)

There are 15 NUM lemmas (0%), 190 NUM types (2%) and 1888 NUM tokens (1%). Out of 14 observed tags, the rank of NUM is: 9 in number of lemmas, 5 in number of types and 11 in number of tokens.

The 10 most frequent NUM lemmas: unus, _, duo, tres, quatuor, mille, decem, octoginta, quinque, sex

The 10 most frequent NUM types: unum, una, unius, uno, unam, unus, duo, iii, 1, 2

The 10 most frequent ambiguous lemmas: __ (NUM 614, X 498, NOUN 6), _decem (NUM 4, ADV 1)

The 10 most frequent ambiguous types: uno (NUM 112, ADJ 1), vi (NUM 15, NOUN 10), quatuor (NUM 12, NOUN 1), primum (ADJ 300, NUM 1)

Morphology

The form / lemma ratio of NUM is 12.666667 (the average of all parts of speech is 3.609102).

The 1st highest number of forms (158) was observed with the lemma “”: _1, 1-1, 1-10, 1-11, 1-13, 1-16, 1-17, 1-20, 1-26, 1-3, 1-31, 1-37, 1-5, 10, 10-2, 10-6, 10-8, 11, 11-11, 11-14, 11-21, 11-25, 11-33, 11-35, 11-36, 11-7, 12-13, 12-16, 12-17, 13-1, 13-2, 13-4, 13-9, 14, 14-21, 14-22, 14-6, 142-5, 15, 15-10, 15-11, 15-29, 16, 16-16, 16-27, 16-4, 17, 17-1, 17-3, 18-37, 19, 19-17, 2, 2-10, 2-11, 2-2, 2-21, 2-25, 2-3, 2-6, 2-9, 20, 20-3, 22, 22-17, 23-19, 23-28, 23-29, 25-1, 25-15, 25-23, 26, 26-3, 3, 3-10, 3-11, 3-14, 3-17, 3-18, 3-19, 3-25, 3-4, 3-6, 3-8, 3-9, 31-3, 32-4, 32-40, 33-15, 33-19, 33-3, 34-10, 35-11, 36-26, 37-18, 38-33, 39, 39-24, 4, 4-13, 4-17, 4-19, 4-24, 4-4, 4-5, 4-6, 40-14, 40-18, 41-1, 41-2, 42, 42-15, 42-2, 44, 45, 48-5, 5, 5-17, 5-48, 5-8, 5-9, 54-13, 55-8, 59-15, 6, 6-1, 6-21, 6-4, 6-7, 7, 7-10, 7-14, 7-21, 7-22, 8, 8-12, 8-16, 8-17, 8-7, 8-8, 9, 9-19, 9-4, 9-5, i, ii, iii, iv, ix, v, vi, vii, viii, x, xciv, xi, xii, xv

The 2nd highest number of forms (10) was observed with the lemma “unus”: duae, primum, tres, una, unam, uni, unius, uno, unum, unus

The 3rd highest number of forms (8) was observed with the lemma “duo”: duabus, duae, duarum, duas, duo, duobus, duorum, duos

NUM occurs with 7 features: NumType (1274; 67% instances), Gender (1261; 67% instances), Case (1245; 66% instances), Degree (1245; 66% instances), Number (1245; 66% instances), la-feat/NumForm (614; 33% instances), Tense (1; 0% instances)

NUM occurs with 14 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Nom, Degree=Pos, Gender=Fem, Gender=Masc, Gender=Neut, NumForm=Digit, NumType=Card, Number=Plur, Number=Sing, Tense=Pres

NUM occurs with 36 feature combinations. The most frequent feature combination is NumForm=Digit (614 tokens). Examples: iii, 1, 2, i, ii, vii, viii, iv, 3, vi

Relations

NUM nodes are attached to their parents using 19 different relations: nummod (1009; 53% instances), advmod (355; 19% instances), nsubj (100; 5% instances), root (78; 4% instances), dobj (69; 4% instances), nsubjpass (63; 3% instances), conj (43; 2% instances), dep (37; 2% instances), advcl (33; 2% instances), csubj (29; 2% instances), acl (25; 1% instances), compound (16; 1% instances), xcomp (9; 0% instances), iobj (8; 0% instances), appos (4; 0% instances), csubjpass (4; 0% instances), parataxis (3; 0% instances), ccomp (2; 0% instances), cop (1; 0% instances)

Parents of NUM nodes belong to 10 different parts of speech: NOUN (720; 38% instances), VERB (628; 33% instances), X (272; 14% instances), ROOT (78; 4% instances), PRON (65; 3% instances), ADJ (62; 3% instances), NUM (35; 2% instances), CONJ (18; 1% instances), ADV (8; 0% instances), PUNCT (2; 0% instances)

1053 (56%) NUM nodes are leaves.

238 (13%) NUM nodes have one child.

268 (14%) NUM nodes have two children.

329 (17%) NUM nodes have three or more children.

The highest child degree of a NUM node is 13.

Children of NUM nodes are attached using 24 different relations: nmod (418; 19% instances), case (380; 17% instances), punct (357; 16% instances), cop (180; 8% instances), cc (179; 8% instances), nsubj (120; 5% instances), conj (118; 5% instances), mark (88; 4% instances), amod (63; 3% instances), la-dep/advmod:emph (56; 3% instances), acl (48; 2% instances), neg (37; 2% instances), advmod (34; 2% instances), advcl (31; 1% instances), dep (21; 1% instances), dobj (19; 1% instances), compound (16; 1% instances), appos (14; 1% instances), csubj (9; 0% instances), nsubjpass (8; 0% instances), ccomp (4; 0% instances), auxpass (1; 0% instances), csubjpass (1; 0% instances), xcomp (1; 0% instances)

Children of NUM nodes belong to 13 different parts of speech: NOUN (387; 18% instances), ADP (383; 17% instances), PUNCT (357; 16% instances), VERB (271; 12% instances), PRON (158; 7% instances), CONJ (146; 7% instances), ADV (134; 6% instances), ADJ (116; 5% instances), X (113; 5% instances), SCONJ (84; 4% instances), NUM (35; 2% instances), DET (18; 1% instances), AUX (1; 0% instances)


Treebank Statistics (UD_Latin-PROIEL)

There are 69 NUM lemmas (1%), 111 NUM types (0%) and 1378 NUM tokens (1%). Out of 13 observed tags, the rank of NUM is: 6 in number of lemmas, 8 in number of types and 10 in number of tokens.

The 10 most frequent NUM lemmas: unus, duo, mille, septem, tres, duodecim, quinque, quattuor, decem, octo

The 10 most frequent NUM types: unum, unus, septem, milia, duodecim, duo, quinque, una, quattuor, uno

The 10 most frequent ambiguous lemmas: unus (NUM 395, ADJ 14), tres (NUM 76, ADJ 1), historia (NOUN 5, NUM 1, ADJ 1), octavus (ADJ 11, NOUN 1, NUM 1)

The 10 most frequent ambiguous types: unus (NUM 115, ADJ 6), una (NUM 61, ADV 28, ADJ 1), uno (NUM 41, ADJ 2), tribus (NUM 26, NOUN 6), uni (NUM 9, ADJ 2), tris (NUM 4, ADJ 1), Octaua (NOUN 1, NUM 1), historiis (NOUN 1, NUM 1), unas (NUM 1, ADJ 1), unis (NUM 1, ADJ 1)

Morphology

The form / lemma ratio of NUM is 1.608696 (the average of all parts of speech is 3.304580).

The 1st highest number of forms (9) was observed with the lemma “duo”: duabus, duae, duarum, duas, due, duo, duobus, duorum, duos

The 2nd highest number of forms (9) was observed with the lemma “unus”: una, unam, unas, uni, unis, unius, uno, unum, unus

The 3rd highest number of forms (5) was observed with the lemma “ducenti”: ducenta, ducentae, ducentis, ducentorum, ducentos

NUM occurs with 3 features: Case (837; 61% instances), Number (837; 61% instances), Gender (801; 58% instances)

NUM occurs with 12 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Nom, Gender=Fem, Gender=Fem,Masc, Gender=Masc, Gender=Masc,Neut, Gender=Neut, Number=Plur, Number=Sing

NUM occurs with 41 feature combinations. The most frequent feature combination is _ (541 tokens). Examples: septem, duodecim, quinque, quattuor, decem, octo, sex, quadraginta, centum, triginta

Relations

NUM nodes are attached to their parents using 16 different relations: nummod (881; 64% instances), nsubj (112; 8% instances), nmod (71; 5% instances), dobj (69; 5% instances), appos (50; 4% instances), remnant (43; 3% instances), iobj (39; 3% instances), conj (34; 2% instances), root (30; 2% instances), nsubjpass (19; 1% instances), advmod (9; 1% instances), advcl (8; 1% instances), xcomp (6; 0% instances), ccomp (5; 0% instances), csubjpass (1; 0% instances), dep (1; 0% instances)

Parents of NUM nodes belong to 10 different parts of speech: NOUN (779; 57% instances), VERB (270; 20% instances), NUM (176; 13% instances), ADJ (47; 3% instances), PRON (30; 2% instances), ROOT (30; 2% instances), ADV (28; 2% instances), PROPN (15; 1% instances), INTJ (2; 0% instances), X (1; 0% instances)

939 (68%) NUM nodes are leaves.

249 (18%) NUM nodes have one child.

114 (8%) NUM nodes have two children.

76 (6%) NUM nodes have three or more children.

The highest child degree of a NUM node is 8.

Children of NUM nodes are attached using 20 different relations: nmod (221; 29% instances), nummod (105; 14% instances), advmod (71; 9% instances), case (70; 9% instances), conj (46; 6% instances), cc (43; 6% instances), cop (40; 5% instances), acl (39; 5% instances), nsubj (30; 4% instances), remnant (27; 4% instances), appos (17; 2% instances), discourse (16; 2% instances), amod (9; 1% instances), mark (8; 1% instances), advcl (7; 1% instances), ccomp (3; 0% instances), neg (3; 0% instances), dep (2; 0% instances), dobj (1; 0% instances), iobj (1; 0% instances)

Children of NUM nodes belong to 10 different parts of speech: NUM (176; 23% instances), NOUN (165; 22% instances), VERB (96; 13% instances), ADV (90; 12% instances), ADP (70; 9% instances), ADJ (53; 7% instances), PRON (44; 6% instances), CONJ (43; 6% instances), PROPN (13; 2% instances), SCONJ (9; 1% instances)


NUM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]