home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-MarkIT: POS Tags: NUM

There are 83 NUM lemmas (2%), 84 NUM types (1%) and 163 NUM tokens (0%). Out of 15 observed tags, the rank of NUM is: 6 in number of lemmas, 9 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: due, tre, 2001, 2016, 2013, novecento, quattro, 1, 13, 2

The 10 most frequent NUM types: due, tre, 2001, 2016, 2013, Novecento, quattro, 1, 13, 2

The 10 most frequent ambiguous lemmas: novecento (NUM 4, NOUN 1), uno (DET 857, PRON 25, NUM 2, ADJ 1)

The 10 most frequent ambiguous types: Novecento (NUM 4, NOUN 2), sei (NUM 1, VERB 1), un (DET 438, NUM 1, PRON 1), uno (DET 26, PRON 17, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.012048 (the average of all parts of speech is 1.453995).

The 1st highest number of forms (2) was observed with the lemma “uno”: un, uno.

The 2nd highest number of forms (1) was observed with the lemma “’900”: ‘900.

The 3rd highest number of forms (1) was observed with the lemma “1”: 1.

NUM occurs with 1 features: NumType (70; 43% instances)

NUM occurs with 1 feature-value pairs: NumType=Card

NUM occurs with 2 feature combinations. The most frequent feature combination is _ (93 tokens). Examples: 2001, 2016, 2013, 1, 13, 2, 9, 15, 1600, 1776

Relations

NUM nodes are attached to their parents using 10 different relations: nummod (93; 57% instances), obl (25; 15% instances), nmod (23; 14% instances), flat (11; 7% instances), conj (3; 2% instances), nsubj (3; 2% instances), parataxis (2; 1% instances), compound (1; 1% instances), dislocated (1; 1% instances), root (1; 1% instances)

Parents of NUM nodes belong to 7 different parts of speech: NOUN (108; 66% instances), VERB (27; 17% instances), NUM (12; 7% instances), PROPN (8; 5% instances), ADJ (6; 4% instances), (1; 1% instances), X (1; 1% instances)

92 (56%) NUM nodes are leaves.

15 (9%) NUM nodes have one child.

39 (24%) NUM nodes have two children.

17 (10%) NUM nodes have three or more children.

The highest child degree of a NUM node is 6.

Children of NUM nodes are attached using 14 different relations: det (50; 32% instances), case (45; 29% instances), flat (19; 12% instances), punct (16; 10% instances), advmod (6; 4% instances), nmod (5; 3% instances), amod (3; 2% instances), cc (3; 2% instances), conj (2; 1% instances), obl (2; 1% instances), advcl (1; 1% instances), appos (1; 1% instances), cop (1; 1% instances), nsubj (1; 1% instances)

Children of NUM nodes belong to 11 different parts of speech: DET (50; 32% instances), ADP (44; 28% instances), NOUN (18; 12% instances), PUNCT (16; 10% instances), NUM (12; 8% instances), ADV (6; 4% instances), ADJ (3; 2% instances), CCONJ (3; 2% instances), AUX (1; 1% instances), PRON (1; 1% instances), X (1; 1% instances)