NUM
: numeral
This document is a placeholder for the language-specific documentation
for NUM
.
Treebank Statistics (UD_Latin)
There are 52 NUM
lemmas (1%), 81 NUM
types (1%) and 246 NUM
tokens (1%).
Out of 12 observed tags, the rank of NUM
is: 9 in number of lemmas, 6 in number of types and 10 in number of tokens.
The 10 most frequent NUM
lemmas: septem1, quattuor1, mille1, tres1, NUMERAL1, centum1, duodecim1, decem1, unus, duo
The 10 most frequent NUM
types: septem, quattuor, decem, centum, duodecim, mille, tres, milibus, quadraginta, quinque
The 10 most frequent ambiguous lemmas: mille1 (NUM 21, NOUN 3, ADJ 1), unus (NUM 8, ADV 1), duo (NUM 6, ADJ 1), primus (NUM 6, ADV 3), quinque1 (NUM 5, X 2), secundus1 (ADJ 10, NUM 4, ADP 3), decimus1 (NUM 3, ADJ 1), bis (ADV 3, NUM 2), tot1 (ADV 4, NUM 2, NOUN 1, ADJ 1), C1 (X 12, NOUN 2, NUM 1)
The 10 most frequent ambiguous types: quinque (NUM 4, X 2), tribus (NUM 5, NOUN 1), milia (NUM 4, NOUN 2, ADJ 1), duobus (ADJ 7, NUM 3), primus (ADJ 10, NUM 3), singulas (NUM 3, ADJ 2), unus (ADJ 11, NUM 2), bis (ADV 9, NUM 1), secundum (ADP 7, NUM 2, ADJ 2), singulos (NUM 2, ADJ 2)
- quinque
- tribus
- milia
- NUM 4: quod si forte tuis non est contraria votis , at tibi curarum milia quanta dabit
- NOUN 2: tam multa illa meo divisa est milia lecto , quantum Hypanis Veneto dissidet Eridano
- ADJ 1: et cantabant quasi canticum novum ante sedem et ante quattuor animalia et seniores et nemo poterat discere canticum nisi illa centum quadraginta quattuor milia qui empti sunt de terra
- duobus
- primus
- singulas
- unus
- bis
- secundum
- ADP 7: in aperto loco secundum flumen paucae stationes equitum videbantur
- NUM 2: quinque medios consulatus a sexto ad decimum annuos gessit , ceteros aut nouem aut sex aut quattuor aut tribus mensibus , secundum uero paucissimis horis .
- ADJ 2: et cum aperuisset sigillum secundum audivi secundum animal dicens veni
- singulos
Morphology
The form / lemma ratio of NUM
is 1.557692 (the average of all parts of speech is 1.829408).
The 1st highest number of forms (10) was observed with the lemma “NUMERAL1”: CC, DC, IIII, LIII., VI, VIII, X, XII, XV, v
The 2nd highest number of forms (6) was observed with the lemma “mille1”: M, milia, milibus, milium, mille, millia
The 3rd highest number of forms (5) was observed with the lemma “unus”: una, uni, uno, unum, unus
NUM
occurs with 3 features: Case (51; 21% instances), Gender (51; 21% instances), Number (49; 20% instances)
NUM
occurs with 9 feature-value pairs: Case=Abl
, Case=Acc
, Case=Dat
, Case=Nom
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Number=Plur
, Number=Sing
NUM
occurs with 20 feature combinations.
The most frequent feature combination is _
(195 tokens).
Examples: septem, quattuor, decem, centum, duodecim, mille, tres, milibus, quadraginta, quinque
Relations
NUM
nodes are attached to their parents using 10 different relations: nummod (192; 78% instances), advmod (13; 5% instances), dobj (13; 5% instances), nsubj (12; 5% instances), nsubjpass (6; 2% instances), conj (4; 2% instances), iobj (2; 1% instances), xcomp (2; 1% instances), acl (1; 0% instances), nmod (1; 0% instances)
Parents of NUM
nodes belong to 7 different parts of speech: NOUN (161; 65% instances), VERB (41; 17% instances), NUM (25; 10% instances), ADJ (11; 4% instances), ADV (5; 2% instances), PUNCT (2; 1% instances), PRON (1; 0% instances)
192 (78%) NUM
nodes are leaves.
38 (15%) NUM
nodes have one child.
11 (4%) NUM
nodes have two children.
5 (2%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 7.
Children of NUM
nodes are attached using 12 different relations: nummod (21; 26% instances), advmod (13; 16% instances), amod (10; 13% instances), nmod (10; 13% instances), punct (9; 11% instances), case (5; 6% instances), conj (5; 6% instances), cc (3; 4% instances), cop (1; 1% instances), mark (1; 1% instances), neg (1; 1% instances), nsubj (1; 1% instances)
Children of NUM
nodes belong to 9 different parts of speech: NUM (25; 31% instances), ADV (10; 13% instances), NOUN (10; 13% instances), PUNCT (9; 11% instances), CONJ (8; 10% instances), ADJ (7; 9% instances), ADP (5; 6% instances), VERB (5; 6% instances), SCONJ (1; 1% instances)
Treebank Statistics (UD_Latin-ITT)
There are 15 NUM
lemmas (0%), 190 NUM
types (2%) and 1888 NUM
tokens (1%).
Out of 14 observed tags, the rank of NUM
is: 9 in number of lemmas, 5 in number of types and 11 in number of tokens.
The 10 most frequent NUM
lemmas: unus, _, duo, tres, quatuor, mille, decem, octoginta, quinque, sex
The 10 most frequent NUM
types: unum, una, unius, uno, unam, unus, duo, iii, 1, 2
The 10 most frequent ambiguous lemmas: __ (NUM 614, X 498, NOUN 6), _decem (NUM 4, ADV 1)
The 10 most frequent ambiguous types: uno (NUM 112, ADJ 1), vi (NUM 15, NOUN 10), quatuor (NUM 12, NOUN 1), primum (ADJ 300, NUM 1)
- uno
- vi
- quatuor
- primum
Morphology
The form / lemma ratio of NUM
is 12.666667 (the average of all parts of speech is 3.609102).
The 1st highest number of forms (158) was observed with the lemma “”: _1, 1-1, 1-10, 1-11, 1-13, 1-16, 1-17, 1-20, 1-26, 1-3, 1-31, 1-37, 1-5, 10, 10-2, 10-6, 10-8, 11, 11-11, 11-14, 11-21, 11-25, 11-33, 11-35, 11-36, 11-7, 12-13, 12-16, 12-17, 13-1, 13-2, 13-4, 13-9, 14, 14-21, 14-22, 14-6, 142-5, 15, 15-10, 15-11, 15-29, 16, 16-16, 16-27, 16-4, 17, 17-1, 17-3, 18-37, 19, 19-17, 2, 2-10, 2-11, 2-2, 2-21, 2-25, 2-3, 2-6, 2-9, 20, 20-3, 22, 22-17, 23-19, 23-28, 23-29, 25-1, 25-15, 25-23, 26, 26-3, 3, 3-10, 3-11, 3-14, 3-17, 3-18, 3-19, 3-25, 3-4, 3-6, 3-8, 3-9, 31-3, 32-4, 32-40, 33-15, 33-19, 33-3, 34-10, 35-11, 36-26, 37-18, 38-33, 39, 39-24, 4, 4-13, 4-17, 4-19, 4-24, 4-4, 4-5, 4-6, 40-14, 40-18, 41-1, 41-2, 42, 42-15, 42-2, 44, 45, 48-5, 5, 5-17, 5-48, 5-8, 5-9, 54-13, 55-8, 59-15, 6, 6-1, 6-21, 6-4, 6-7, 7, 7-10, 7-14, 7-21, 7-22, 8, 8-12, 8-16, 8-17, 8-7, 8-8, 9, 9-19, 9-4, 9-5, i, ii, iii, iv, ix, v, vi, vii, viii, x, xciv, xi, xii, xv
The 2nd highest number of forms (10) was observed with the lemma “unus”: duae, primum, tres, una, unam, uni, unius, uno, unum, unus
The 3rd highest number of forms (8) was observed with the lemma “duo”: duabus, duae, duarum, duas, duo, duobus, duorum, duos
NUM
occurs with 7 features: NumType (1274; 67% instances), Gender (1261; 67% instances), Case (1245; 66% instances), Degree (1245; 66% instances), Number (1245; 66% instances), la-feat/NumForm (614; 33% instances), Tense (1; 0% instances)
NUM
occurs with 14 feature-value pairs: Case=Abl
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Nom
, Degree=Pos
, Gender=Fem
, Gender=Masc
, Gender=Neut
, NumForm=Digit
, NumType=Card
, Number=Plur
, Number=Sing
, Tense=Pres
NUM
occurs with 36 feature combinations.
The most frequent feature combination is NumForm=Digit
(614 tokens).
Examples: iii, 1, 2, i, ii, vii, viii, iv, 3, vi
Relations
NUM
nodes are attached to their parents using 19 different relations: nummod (1009; 53% instances), advmod (355; 19% instances), nsubj (100; 5% instances), root (78; 4% instances), dobj (69; 4% instances), nsubjpass (63; 3% instances), conj (43; 2% instances), dep (37; 2% instances), advcl (33; 2% instances), csubj (29; 2% instances), acl (25; 1% instances), compound (16; 1% instances), xcomp (9; 0% instances), iobj (8; 0% instances), appos (4; 0% instances), csubjpass (4; 0% instances), parataxis (3; 0% instances), ccomp (2; 0% instances), cop (1; 0% instances)
Parents of NUM
nodes belong to 10 different parts of speech: NOUN (720; 38% instances), VERB (628; 33% instances), X (272; 14% instances), ROOT (78; 4% instances), PRON (65; 3% instances), ADJ (62; 3% instances), NUM (35; 2% instances), CONJ (18; 1% instances), ADV (8; 0% instances), PUNCT (2; 0% instances)
1053 (56%) NUM
nodes are leaves.
238 (13%) NUM
nodes have one child.
268 (14%) NUM
nodes have two children.
329 (17%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 13.
Children of NUM
nodes are attached using 24 different relations: nmod (418; 19% instances), case (380; 17% instances), punct (357; 16% instances), cop (180; 8% instances), cc (179; 8% instances), nsubj (120; 5% instances), conj (118; 5% instances), mark (88; 4% instances), amod (63; 3% instances), la-dep/advmod:emph (56; 3% instances), acl (48; 2% instances), neg (37; 2% instances), advmod (34; 2% instances), advcl (31; 1% instances), dep (21; 1% instances), dobj (19; 1% instances), compound (16; 1% instances), appos (14; 1% instances), csubj (9; 0% instances), nsubjpass (8; 0% instances), ccomp (4; 0% instances), auxpass (1; 0% instances), csubjpass (1; 0% instances), xcomp (1; 0% instances)
Children of NUM
nodes belong to 13 different parts of speech: NOUN (387; 18% instances), ADP (383; 17% instances), PUNCT (357; 16% instances), VERB (271; 12% instances), PRON (158; 7% instances), CONJ (146; 7% instances), ADV (134; 6% instances), ADJ (116; 5% instances), X (113; 5% instances), SCONJ (84; 4% instances), NUM (35; 2% instances), DET (18; 1% instances), AUX (1; 0% instances)
Treebank Statistics (UD_Latin-PROIEL)
There are 69 NUM
lemmas (1%), 111 NUM
types (0%) and 1378 NUM
tokens (1%).
Out of 13 observed tags, the rank of NUM
is: 6 in number of lemmas, 8 in number of types and 10 in number of tokens.
The 10 most frequent NUM
lemmas: unus, duo, mille, septem, tres, duodecim, quinque, quattuor, decem, octo
The 10 most frequent NUM
types: unum, unus, septem, milia, duodecim, duo, quinque, una, quattuor, uno
The 10 most frequent ambiguous lemmas: unus (NUM 395, ADJ 14), tres (NUM 76, ADJ 1), historia (NOUN 5, NUM 1, ADJ 1), octavus (ADJ 11, NOUN 1, NUM 1)
The 10 most frequent ambiguous types: unus (NUM 115, ADJ 6), una (NUM 61, ADV 28, ADJ 1), uno (NUM 41, ADJ 2), tribus (NUM 26, NOUN 6), uni (NUM 9, ADJ 2), tris (NUM 4, ADJ 1), Octaua (NOUN 1, NUM 1), historiis (NOUN 1, NUM 1), unas (NUM 1, ADJ 1), unis (NUM 1, ADJ 1)
- unus
- una
- uno
- tribus
- uni
- tris
- NUM 4: accepi tuas tris iam epistulas
- ADJ 1: a..d..v.Kal..Decembr. tris epistulas a te accepi unam datam a..d..viii.Kal..Novembris in qua me hortaris ut forti animo mensem Ianuarium exspectem ea que quae ad spem putas pertinere de Lentuli studio de Metelli voluntate de tota Pompei ratione perscribis
- Octaua
- historiis
- NOUN 1: sed quia volgo pragmatici homines omnibus historiis praeceptis versibus denique cavere iubent et vetant credere alterum facio ut caveam alterum ut non credam facere non possum
- NUM 1: in quo si quid erit quod homini Attico minus Graecum eruditum que videatur non dicam quod tibi ut opinor Panhormi Lucullus de suis historiis dixerat se quo facilius illas probaret Romani hominis esse idcirco barbara quaedam et σόλοικα dispersisse
- unas
- NUM 1: sed abs te ipso qui me accusas unas mihi scito litteras redditas esse cum et oti ad scribendum plus et facultatem dandi maiorem habueris
- ADJ 1: de me imperatore scribis te ex Philotimi litteris cognosse sed credo te iam in Epiro cum esses binas meas de omnibus rebus accepisse unas a Pindenisso capto alteras Laodicea utrasque tuis pueris datas
- unis
Morphology
The form / lemma ratio of NUM
is 1.608696 (the average of all parts of speech is 3.304580).
The 1st highest number of forms (9) was observed with the lemma “duo”: duabus, duae, duarum, duas, due, duo, duobus, duorum, duos
The 2nd highest number of forms (9) was observed with the lemma “unus”: una, unam, unas, uni, unis, unius, uno, unum, unus
The 3rd highest number of forms (5) was observed with the lemma “ducenti”: ducenta, ducentae, ducentis, ducentorum, ducentos
NUM
occurs with 3 features: Case (837; 61% instances), Number (837; 61% instances), Gender (801; 58% instances)
NUM
occurs with 12 feature-value pairs: Case=Abl
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Nom
, Gender=Fem
, Gender=Fem,Masc
, Gender=Masc
, Gender=Masc,Neut
, Gender=Neut
, Number=Plur
, Number=Sing
NUM
occurs with 41 feature combinations.
The most frequent feature combination is _
(541 tokens).
Examples: septem, duodecim, quinque, quattuor, decem, octo, sex, quadraginta, centum, triginta
Relations
NUM
nodes are attached to their parents using 16 different relations: nummod (881; 64% instances), nsubj (112; 8% instances), nmod (71; 5% instances), dobj (69; 5% instances), appos (50; 4% instances), remnant (43; 3% instances), iobj (39; 3% instances), conj (34; 2% instances), root (30; 2% instances), nsubjpass (19; 1% instances), advmod (9; 1% instances), advcl (8; 1% instances), xcomp (6; 0% instances), ccomp (5; 0% instances), csubjpass (1; 0% instances), dep (1; 0% instances)
Parents of NUM
nodes belong to 10 different parts of speech: NOUN (779; 57% instances), VERB (270; 20% instances), NUM (176; 13% instances), ADJ (47; 3% instances), PRON (30; 2% instances), ROOT (30; 2% instances), ADV (28; 2% instances), PROPN (15; 1% instances), INTJ (2; 0% instances), X (1; 0% instances)
939 (68%) NUM
nodes are leaves.
249 (18%) NUM
nodes have one child.
114 (8%) NUM
nodes have two children.
76 (6%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 8.
Children of NUM
nodes are attached using 20 different relations: nmod (221; 29% instances), nummod (105; 14% instances), advmod (71; 9% instances), case (70; 9% instances), conj (46; 6% instances), cc (43; 6% instances), cop (40; 5% instances), acl (39; 5% instances), nsubj (30; 4% instances), remnant (27; 4% instances), appos (17; 2% instances), discourse (16; 2% instances), amod (9; 1% instances), mark (8; 1% instances), advcl (7; 1% instances), ccomp (3; 0% instances), neg (3; 0% instances), dep (2; 0% instances), dobj (1; 0% instances), iobj (1; 0% instances)
Children of NUM
nodes belong to 10 different parts of speech: NUM (176; 23% instances), NOUN (165; 22% instances), VERB (96; 13% instances), ADV (90; 12% instances), ADP (70; 9% instances), ADJ (53; 7% instances), PRON (44; 6% instances), CONJ (43; 6% instances), PROPN (13; 2% instances), SCONJ (9; 1% instances)
NUM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]