home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latin-CIRCSE: POS Tags: NUM

There are 11 NUM lemmas (0%), 20 NUM types (0%) and 48 NUM tokens (0%). Out of 14 observed tags, the rank of NUM is: 12 in number of lemmas, 11 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: unus, mille, tres, decem, centum, duo, septem, ducenti, quinque, sex

The 10 most frequent NUM types: una, uno, decem, mille, unum, unus, centum, septem, tria, tris

The 10 most frequent ambiguous lemmas: unus (NUM 24, DET 11)

The 10 most frequent ambiguous types: una (NUM 8, ADV 1, DET 1), uno (NUM 5, DET 2), unum (DET 4, NUM 4), unus (NUM 4, DET 1), uni (DET 2, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.818182 (the average of all parts of speech is 2.130677).

The 1st highest number of forms (6) was observed with the lemma “unus”: una, uni, unius, uno, unum, unus.

The 2nd highest number of forms (3) was observed with the lemma “tres”: tria, tribus, tris.

The 3rd highest number of forms (2) was observed with the lemma “duo”: duo, duos.

NUM occurs with 6 features: NumType (48; 100% instances), InflClass (43; 90% instances), Number (43; 90% instances), Case (33; 69% instances), Gender (33; 69% instances), NumForm (9; 19% instances)

NUM occurs with 19 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Nom, Gender=Fem, Gender=Fem,Masc,Neut, Gender=Masc, Gender=Masc,Neut, Gender=Neut, InflClass=Ind, InflClass=IndEurA, InflClass=IndEurI, InflClass=IndEurO, InflClass=LatPron, NumForm=Word, NumType=Card, Number=Plur, Number=Sing

NUM occurs with 21 feature combinations. The most frequent feature combination is InflClass=Ind|Number=Plur|NumType=Card (10 tokens). Examples: mille, decem, septem, centum

Relations

NUM nodes are attached to their parents using 7 different relations: nummod (40; 83% instances), conj (2; 4% instances), nsubj (2; 4% instances), flat (1; 2% instances), obj (1; 2% instances), obl (1; 2% instances), obl:arg (1; 2% instances)

Parents of NUM nodes belong to 6 different parts of speech: NOUN (35; 73% instances), VERB (6; 13% instances), PROPN (3; 6% instances), NUM (2; 4% instances), DET (1; 2% instances), PRON (1; 2% instances)

40 (83%) NUM nodes are leaves.

6 (13%) NUM nodes have one child.

1 (2%) NUM nodes have two children.

1 (2%) NUM nodes have three or more children.

The highest child degree of a NUM node is 3.

Children of NUM nodes are attached using 9 different relations: cc (3; 27% instances), acl:relcl (1; 9% instances), advmod:neg (1; 9% instances), case (1; 9% instances), conj (1; 9% instances), cop (1; 9% instances), flat (1; 9% instances), nsubj (1; 9% instances), obl:lmod (1; 9% instances)

Children of NUM nodes belong to 7 different parts of speech: CCONJ (3; 27% instances), NOUN (2; 18% instances), NUM (2; 18% instances), ADP (1; 9% instances), AUX (1; 9% instances), PART (1; 9% instances), VERB (1; 9% instances)