home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latin-LLCT: POS Tags: NUM

There are 31 NUM lemmas (1%), 82 NUM types (1%) and 1501 NUM tokens (1%). Out of 15 observed tags, the rank of NUM is: 8 in number of lemmas, 8 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: duo, uiginti, triginta, quinquaginta, tres, quattuor, decem, sex, duodecim, quinque

The 10 most frequent NUM types: duas, duo, viginti, triginta, tres, quinquaginta, decem, sex, quattuor, quinque

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NUM is 2.645161 (the average of all parts of speech is 2.623423).

The 1st highest number of forms (9) was observed with the lemma “duo”: dua, duabus, duae, duas, due, dues, duo, duobus, duos.

The 2nd highest number of forms (8) was observed with the lemma “quadraginta”: quadraginta, quaraginta, quatragenta, quatragentas, quatraginta, quatragintas, quatraientas, quatroginta.

The 3rd highest number of forms (5) was observed with the lemma “sexaginta”: sesaginta, sessaginta, sexaginta, sexagintam, sexsaginta.

NUM occurs with 5 features: NumForm (1501; 100% instances), NumType (1501; 100% instances), Case (629; 42% instances), Gender (629; 42% instances), Number (629; 42% instances)

NUM occurs with 10 feature-value pairs: Case=Abl, Case=Acc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, NumForm=Roman, NumForm=Word, NumType=Card, Number=Plur

NUM occurs with 11 feature combinations. The most frequent feature combination is NumForm=Word|NumType=Card (870 tokens). Examples: viginti, triginta, quinquaginta, decem, sex, quattuor, quinque, duodecim, centum, octo

Relations

NUM nodes are attached to their parents using 8 different relations: nummod (1361; 91% instances), conj (74; 5% instances), flat (59; 4% instances), xcomp (3; 0% instances), acl:relcl (1; 0% instances), advcl:cmp (1; 0% instances), nsubj:pass (1; 0% instances), root (1; 0% instances)

Parents of NUM nodes belong to 6 different parts of speech: NOUN (1360; 91% instances), NUM (130; 9% instances), VERB (6; 0% instances), DET (3; 0% instances), ADJ (1; 0% instances), (1; 0% instances)

1283 (85%) NUM nodes are leaves.

208 (14%) NUM nodes have one child.

7 (0%) NUM nodes have two children.

3 (0%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 13 different relations: conj (79; 33% instances), cc (75; 31% instances), flat (59; 25% instances), nmod (12; 5% instances), advmod (4; 2% instances), cop (2; 1% instances), obl (2; 1% instances), punct (2; 1% instances), advmod:emph (1; 0% instances), dep (1; 0% instances), mark (1; 0% instances), nsubj (1; 0% instances), xcomp (1; 0% instances)

Children of NUM nodes belong to 12 different parts of speech: NUM (130; 54% instances), CCONJ (76; 32% instances), NOUN (16; 7% instances), DET (5; 2% instances), ADV (4; 2% instances), AUX (2; 1% instances), PUNCT (2; 1% instances), ADJ (1; 0% instances), PRON (1; 0% instances), SCONJ (1; 0% instances), VERB (1; 0% instances), X (1; 0% instances)