home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latin-CIRCSE: POS Tags: NUM

There are 11 NUM lemmas (0%), 22 NUM types (0%) and 76 NUM tokens (0%). Out of 14 observed tags, the rank of NUM is: 12 in number of lemmas, 11 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: unus, duo, mille, decem, septem, tres, centum, ducenti, quinque, sex

The 10 most frequent NUM types: una, uno, unum, decem, duos, mille, septem, unus, uni, unius

The 10 most frequent ambiguous lemmas: unus (NUM 40, DET 18)

The 10 most frequent ambiguous types: una (NUM 14, ADV 2, DET 2), uno (NUM 10, DET 3), unum (DET 7, NUM 6), unus (NUM 4, DET 2), uni (NUM 3, DET 2)

Morphology

The form / lemma ratio of NUM is 2.000000 (the average of all parts of speech is 2.409205).

The 1st highest number of forms (6) was observed with the lemma “unus”: una, uni, unius, uno, unum, unus.

The 2nd highest number of forms (4) was observed with the lemma “duo”: duas, duo, duobus, duos.

The 3rd highest number of forms (3) was observed with the lemma “tres”: tria, tribus, tris.

NUM occurs with 6 features: NumType (76; 100% instances), InflClass (71; 93% instances), Number (71; 93% instances), Case (56; 74% instances), Gender (56; 74% instances), NumForm (9; 12% instances)

NUM occurs with 19 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Nom, Gender=Fem, Gender=Fem,Masc,Neut, Gender=Masc, Gender=Masc,Neut, Gender=Neut, InflClass=Ind, InflClass=IndEurA, InflClass=IndEurI, InflClass=IndEurO, InflClass=LatPron, NumForm=Word, NumType=Card, Number=Plur, Number=Sing

NUM occurs with 27 feature combinations. The most frequent feature combination is InflClass=Ind|Number=Plur|NumType=Card (15 tokens). Examples: mille, septem, decem, centum

Relations

NUM nodes are attached to their parents using 9 different relations: nummod (58; 76% instances), obj (5; 7% instances), nsubj (4; 5% instances), conj (2; 3% instances), obl (2; 3% instances), root (2; 3% instances), advcl:pred (1; 1% instances), flat (1; 1% instances), obl:arg (1; 1% instances)

Parents of NUM nodes belong to 7 different parts of speech: NOUN (51; 67% instances), VERB (13; 17% instances), PROPN (4; 5% instances), PRON (3; 4% instances), NUM (2; 3% instances), (2; 3% instances), DET (1; 1% instances)

62 (82%) NUM nodes are leaves.

10 (13%) NUM nodes have one child.

1 (1%) NUM nodes have two children.

3 (4%) NUM nodes have three or more children.

The highest child degree of a NUM node is 5.

Children of NUM nodes are attached using 15 different relations: cc (4; 17% instances), acl:relcl (2; 8% instances), advmod:neg (2; 8% instances), conj (2; 8% instances), cop (2; 8% instances), nsubj (2; 8% instances), orphan (2; 8% instances), acl (1; 4% instances), advcl (1; 4% instances), advcl:cmp (1; 4% instances), advcl:relcl (1; 4% instances), case (1; 4% instances), flat (1; 4% instances), nmod (1; 4% instances), obl:lmod (1; 4% instances)

Children of NUM nodes belong to 8 different parts of speech: VERB (7; 29% instances), CCONJ (4; 17% instances), NOUN (4; 17% instances), ADV (2; 8% instances), AUX (2; 8% instances), NUM (2; 8% instances), PART (2; 8% instances), ADP (1; 4% instances)