home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-Old: POS Tags: NUM

There are 23 NUM lemmas (0%), 30 NUM types (0%) and 255 NUM tokens (0%). Out of 16 observed tags, the rank of NUM is: 11 in number of lemmas, 12 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: due, tre, mille, sette, quattro, ambo, cento, cinque, dieci, sei

The 10 most frequent NUM types: due, tre, mille, sette, quattro, ambo, cento, cinque, diece, sei

The 10 most frequent ambiguous lemmas: ambo (NUM 13, PRON 1), dieci (NUM 7, NOUN 1)

The 10 most frequent ambiguous types: diece (NUM 6, NOUN 1), nona (NOUN 1, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.304348 (the average of all parts of speech is 1.902154).

The 1st highest number of forms (2) was observed with the lemma “cento”: cent’, cento.

The 2nd highest number of forms (2) was observed with the lemma “cinque”: cinqu’, cinque.

The 3rd highest number of forms (2) was observed with the lemma “cinquecento”: cinquecent’, cinquecento.

NUM occurs with 1 features: NumType (255; 100% instances)

NUM occurs with 1 feature-value pairs: NumType=Card

NUM occurs with 1 feature combinations. The most frequent feature combination is NumType=Card (255 tokens). Examples: due, tre, mille, sette, quattro, ambo, cento, cinque, diece, sei

Relations

NUM nodes are attached to their parents using 16 different relations: nummod (183; 72% instances), obj (19; 7% instances), obl (12; 5% instances), conj (11; 4% instances), nsubj (10; 4% instances), nmod (4; 2% instances), flat (3; 1% instances), flat:redup (2; 1% instances), obl:lmod (2; 1% instances), parataxis (2; 1% instances), root (2; 1% instances), acl:relcl (1; 0% instances), advcl:cmp (1; 0% instances), ccomp (1; 0% instances), obl:agent (1; 0% instances), obl:arg (1; 0% instances)

Parents of NUM nodes belong to 8 different parts of speech: NOUN (173; 68% instances), VERB (45; 18% instances), DET (12; 5% instances), NUM (9; 4% instances), PRON (7; 3% instances), ADV (6; 2% instances), (2; 1% instances), PROPN (1; 0% instances)

178 (70%) NUM nodes are leaves.

43 (17%) NUM nodes have one child.

13 (5%) NUM nodes have two children.

21 (8%) NUM nodes have three or more children.

The highest child degree of a NUM node is 6.

Children of NUM nodes are attached using 24 different relations: case (28; 19% instances), det (24; 16% instances), punct (14; 9% instances), cc (12; 8% instances), conj (12; 8% instances), acl:relcl (8; 5% instances), cop (7; 5% instances), advmod (6; 4% instances), nmod (6; 4% instances), advmod:tmod (5; 3% instances), nsubj (4; 3% instances), acl (3; 2% instances), flat (3; 2% instances), mark (3; 2% instances), advcl (2; 1% instances), flat:redup (2; 1% instances), obl:lmod (2; 1% instances), advmod:lmod (1; 1% instances), advmod:neg (1; 1% instances), amod (1; 1% instances), det:poss (1; 1% instances), flat:name (1; 1% instances), obl (1; 1% instances), parataxis (1; 1% instances)

Children of NUM nodes belong to 12 different parts of speech: DET (29; 20% instances), ADP (28; 19% instances), ADV (16; 11% instances), CCONJ (14; 9% instances), PUNCT (14; 9% instances), VERB (12; 8% instances), NOUN (11; 7% instances), NUM (9; 6% instances), AUX (7; 5% instances), PRON (4; 3% instances), ADJ (3; 2% instances), SCONJ (1; 1% instances)