home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latin-LLCT: POS Tags: NUM

There are 32 NUM lemmas (1%), 88 NUM types (1%) and 2417 NUM tokens (1%). Out of 15 observed tags, the rank of NUM is: 8 in number of lemmas, 8 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: unus, duo, uiginti, triginta, quinquaginta, tres, quattuor, decem, sex, duodecim

The 10 most frequent NUM types: uno, duas, una, duo, viginti, triginta, tres, quinquaginta, unum, decem

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: una (ADV 597, NUM 278), unam (NUM 27, ADV 1)

Morphology

The form / lemma ratio of NUM is 2.750000 (the average of all parts of speech is 2.628626).

The 1st highest number of forms (9) was observed with the lemma “duo”: dua, duabus, duae, duas, due, dues, duo, duobus, duos.

The 2nd highest number of forms (8) was observed with the lemma “quadraginta”: quadraginta, quaraginta, quatragenta, quatragentas, quatraginta, quatragintas, quatraientas, quatroginta.

The 3rd highest number of forms (6) was observed with the lemma “unus”: una, unam, uno, unu, unum, unus.

NUM occurs with 4 features: NumType (2417; 100% instances), Case (1545; 64% instances), Gender (1545; 64% instances), Number (1545; 64% instances)

NUM occurs with 9 feature-value pairs: Case=Abl, Case=Acc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, NumType=Card, Number=Plur, Number=Sing

NUM occurs with 18 feature combinations. The most frequent feature combination is NumType=Card (872 tokens). Examples: uno, duas, una, viginti, triginta, quinquaginta, duo, decem, sex, quattuor

Relations

NUM nodes are attached to their parents using 16 different relations: nummod (2165; 90% instances), conj (81; 3% instances), compound (61; 3% instances), nsubj (40; 2% instances), nsubj:pass (33; 1% instances), obl (15; 1% instances), obj (7; 0% instances), xcomp (4; 0% instances), dislocated:nsubj (3; 0% instances), parataxis (2; 0% instances), acl:relcl (1; 0% instances), ccomp (1; 0% instances), dislocated:obj (1; 0% instances), dislocated:obl (1; 0% instances), iobj (1; 0% instances), root (1; 0% instances)

Parents of NUM nodes belong to 7 different parts of speech: NOUN (2164; 90% instances), NUM (138; 6% instances), VERB (108; 4% instances), PRON (3; 0% instances), DET (2; 0% instances), ADJ (1; 0% instances), (1; 0% instances)

2092 (87%) NUM nodes are leaves.

240 (10%) NUM nodes have one child.

55 (2%) NUM nodes have two children.

30 (1%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 14 different relations: nmod (142; 31% instances), conj (106; 23% instances), cc (83; 18% instances), compound (61; 13% instances), punct (20; 4% instances), case (14; 3% instances), acl:relcl (13; 3% instances), advmod (6; 1% instances), acl (4; 1% instances), cop (2; 0% instances), obl (2; 0% instances), nsubj (1; 0% instances), parataxis (1; 0% instances), xcomp (1; 0% instances)

Children of NUM nodes belong to 11 different parts of speech: NUM (138; 30% instances), CCONJ (85; 19% instances), DET (83; 18% instances), NOUN (78; 17% instances), PUNCT (20; 4% instances), VERB (19; 4% instances), ADP (14; 3% instances), PRON (8; 2% instances), ADJ (5; 1% instances), ADV (4; 1% instances), AUX (2; 0% instances)