home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-ParTUT: POS Tags: NUM

There are 268 NUM lemmas (4%), 271 NUM types (3%) and 810 NUM tokens (1%). Out of 15 observed tags, the rank of NUM is: 6 in number of lemmas, 6 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: due, tre, 1, 6, quattro, 2000, 1999, cinque, 3, 18

The 10 most frequent NUM types: due, tre, 1, 6, quattro, 2000, 1999, cinque, 3, 18

The 10 most frequent ambiguous lemmas: 1 (NUM 21, ADJ 2), uno (DET 938, PRON 41, NUM 2)

The 10 most frequent ambiguous types: 1 (NUM 21, ADJ 2), sei (NUM 8, AUX 1), 5 (NUM 7, CCONJ 1), venti (NUM 3, NOUN 1), un (DET 463, NUM 1, PRON 1), una (DET 338, PRON 10, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.011194 (the average of all parts of speech is 1.488064).

The 1st highest number of forms (2) was observed with the lemma “cinquanta”: cinquant’, cinquanta.

The 2nd highest number of forms (2) was observed with the lemma “uno”: un, una.

The 3rd highest number of forms (2) was observed with the lemma “venti”: vent’, venti.

NUM occurs with 1 features: NumType (810; 100% instances)

NUM occurs with 1 feature-value pairs: NumType=Card

NUM occurs with 1 feature combinations. The most frequent feature combination is NumType=Card (810 tokens). Examples: due, tre, 1, 6, quattro, 2000, 1999, cinque, 3, 18

Relations

NUM nodes are attached to their parents using 11 different relations: nummod (492; 61% instances), obl (131; 16% instances), flat (75; 9% instances), nmod (65; 8% instances), conj (33; 4% instances), nsubj (5; 1% instances), obj (4; 0% instances), appos (2; 0% instances), acl:relcl (1; 0% instances), ccomp (1; 0% instances), nsubj:pass (1; 0% instances)

Parents of NUM nodes belong to 8 different parts of speech: NOUN (424; 52% instances), VERB (166; 20% instances), NUM (120; 15% instances), PROPN (37; 5% instances), SYM (36; 4% instances), ADJ (17; 2% instances), PRON (6; 1% instances), X (4; 0% instances)

419 (52%) NUM nodes are leaves.

64 (8%) NUM nodes have one child.

212 (26%) NUM nodes have two children.

115 (14%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 19 different relations: punct (266; 29% instances), det (209; 23% instances), case (182; 20% instances), flat (98; 11% instances), nummod (38; 4% instances), conj (36; 4% instances), nmod (34; 4% instances), advmod (25; 3% instances), cc (23; 2% instances), fixed (6; 1% instances), amod (2; 0% instances), acl (1; 0% instances), advcl (1; 0% instances), appos (1; 0% instances), cop (1; 0% instances), det:predet (1; 0% instances), mark (1; 0% instances), nsubj (1; 0% instances), parataxis (1; 0% instances)

Children of NUM nodes belong to 15 different parts of speech: PUNCT (266; 29% instances), DET (211; 23% instances), ADP (182; 20% instances), NUM (120; 13% instances), NOUN (63; 7% instances), ADV (25; 3% instances), CCONJ (23; 2% instances), PROPN (23; 2% instances), X (6; 1% instances), ADJ (2; 0% instances), VERB (2; 0% instances), AUX (1; 0% instances), PRON (1; 0% instances), SCONJ (1; 0% instances), SYM (1; 0% instances)