home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-TWITTIRO: POS Tags: NUM

There are 130 NUM lemmas (3%), 131 NUM types (2%) and 300 NUM tokens (1%). Out of 16 observed tags, the rank of NUM is: 7 in number of lemmas, 7 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: due, 3, 2, mila, tre, 5, 1, 12, 7, 10

The 10 most frequent NUM types: due, 3, 2, mila, tre, 5, 1, 12, 7, 10

The 10 most frequent ambiguous lemmas: 5 (NUM 9, PROPN 1), 4 (NUM 3, PROPN 1), quattro (NUM 3, ADP 1), guli1979 (NUM 2, SYM 1), @user1 (SYM 76, NUM 1), uno (DET 416, PRON 12, NUM 1)

The 10 most frequent ambiguous types: 3 (NUM 16, ADJ 1), 5 (NUM 9, PROPN 1), 1 (NUM 8, DET 1), 6 (NUM 6, AUX 1), 4 (NUM 3, PROPN 1), guli1979 (NUM 2, SYM 1), @user1 (SYM 76, NUM 1), licei (NOUN 1, NUM 1), sei (AUX 13, NUM 1), sette (NOUN 1, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.007692 (the average of all parts of speech is 1.274961).

The 1st highest number of forms (2) was observed with the lemma “venti”: vent’, vent’.

The 2nd highest number of forms (1) was observed with the lemma “’10”: ‘10.

The 3rd highest number of forms (1) was observed with the lemma “’50”: ‘50.

NUM occurs with 1 features: NumType (298; 99% instances)

NUM occurs with 1 feature-value pairs: NumType=Card

NUM occurs with 2 feature combinations. The most frequent feature combination is NumType=Card (298 tokens). Examples: due, 3, 2, mila, tre, 1, 12, 5, 7, 10

Relations

NUM nodes are attached to their parents using 13 different relations: nummod (221; 74% instances), obl (23; 8% instances), flat (19; 6% instances), conj (9; 3% instances), nmod (7; 2% instances), parataxis (7; 2% instances), nsubj (3; 1% instances), nsubj:pass (3; 1% instances), obj (3; 1% instances), flat:name (2; 1% instances), dislocated (1; 0% instances), orphan (1; 0% instances), vocative:mention (1; 0% instances)

Parents of NUM nodes belong to 9 different parts of speech: NOUN (197; 66% instances), VERB (43; 14% instances), NUM (24; 8% instances), SYM (13; 4% instances), PROPN (10; 3% instances), ADJ (6; 2% instances), PRON (3; 1% instances), INTJ (2; 1% instances), X (2; 1% instances)

207 (69%) NUM nodes are leaves.

45 (15%) NUM nodes have one child.

35 (12%) NUM nodes have two children.

13 (4%) NUM nodes have three or more children.

The highest child degree of a NUM node is 5.

Children of NUM nodes are attached using 12 different relations: case (42; 26% instances), det (35; 22% instances), punct (30; 19% instances), flat (27; 17% instances), conj (7; 4% instances), nmod (6; 4% instances), advmod (5; 3% instances), cc (3; 2% instances), amod (2; 1% instances), acl (1; 1% instances), fixed (1; 1% instances), nummod (1; 1% instances)

Children of NUM nodes belong to 11 different parts of speech: ADP (41; 26% instances), DET (35; 22% instances), PUNCT (30; 19% instances), NUM (24; 15% instances), NOUN (15; 9% instances), ADV (6; 4% instances), CCONJ (3; 2% instances), ADJ (2; 1% instances), SYM (2; 1% instances), PROPN (1; 1% instances), VERB (1; 1% instances)