home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latin-Perseus: POS Tags: NUM

There are 28 NUM lemmas (1%), 50 NUM types (0%) and 169 NUM tokens (1%). Out of 16 observed tags, the rank of NUM is: 9 in number of lemmas, 8 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: duo, unus, mille, septem, quattuor, tres, viginti, centum, decem, sex

The 10 most frequent NUM types: quattuor, septem, duo, viginti, centum, decem, mille, millia, una, uno

The 10 most frequent ambiguous lemmas: unus (ADJ 21, NUM 21, ADV 1), octingenti (ADJ 1, NUM 1)

The 10 most frequent ambiguous types: una (NUM 6, ADJ 4), uno (NUM 5, ADJ 1), unum (ADJ 6, NUM 4), tribus (NUM 3, NOUN 1), unus (ADJ 6, NUM 2), M (PROPN 13, NUM 1), unius (ADJ 1, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.785714 (the average of all parts of speech is 2.102438).

The 1st highest number of forms (7) was observed with the lemma “duo”: duabus, duae, duas, duo, duobus, duorum, duos.

The 2nd highest number of forms (6) was observed with the lemma “mille”: M, milia, milibus, mille, millia, millibus.

The 3rd highest number of forms (6) was observed with the lemma “unus”: una, uni, unius, uno, unum, unus.

NUM occurs with 5 features: NumForm (169; 100% instances), NumType (168; 99% instances), Case (66; 39% instances), Gender (66; 39% instances), Number (64; 38% instances)

NUM occurs with 14 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, NumForm=Roman, NumForm=Word, NumType=Card, NumType=Dist, Number=Plur, Number=Sing

NUM occurs with 27 feature combinations. The most frequent feature combination is NumForm=Word|NumType=Card (97 tokens). Examples: quattuor, septem, viginti, centum, decem, mille, quinque, sex, tres, triginta

Relations

NUM nodes are attached to their parents using 10 different relations: nummod (132; 78% instances), nsubj (9; 5% instances), obj (7; 4% instances), flat (6; 4% instances), conj (5; 3% instances), obl (5; 3% instances), nsubj:pass (2; 1% instances), conj:expl (1; 1% instances), nummod:gov (1; 1% instances), xcomp (1; 1% instances)

Parents of NUM nodes belong to 7 different parts of speech: NOUN (115; 68% instances), VERB (19; 11% instances), NUM (18; 11% instances), ADJ (8; 5% instances), DET (4; 2% instances), PROPN (3; 2% instances), ADV (2; 1% instances)

133 (79%) NUM nodes are leaves.

24 (14%) NUM nodes have one child.

12 (7%) NUM nodes have two children.

The highest child degree of a NUM node is 2.

Children of NUM nodes are attached using 11 different relations: nmod (10; 21% instances), nummod (7; 15% instances), flat (6; 13% instances), advmod (5; 10% instances), amod (5; 10% instances), cc (5; 10% instances), conj (5; 10% instances), det (2; 4% instances), acl:relcl (1; 2% instances), case (1; 2% instances), obl (1; 2% instances)

Children of NUM nodes belong to 8 different parts of speech: NUM (18; 38% instances), NOUN (10; 21% instances), ADV (5; 10% instances), CCONJ (5; 10% instances), VERB (5; 10% instances), ADJ (2; 4% instances), DET (2; 4% instances), ADP (1; 2% instances)