Treebank Statistics: UD_Italian-KIParlaForest: POS Tags: NUM
There are 32 NUM lemmas (2%), 35 NUM types (2%) and 92 NUM tokens (1%).
Out of 15 observed tags, the rank of NUM is: 9 in number of lemmas, 10 in number of types and 14 in number of tokens.
The 10 most frequent NUM lemmas: due, primo, quattro, tre, cinquanta, dodici, quattordici, secondo, trenta, undici
The 10 most frequent NUM types: due, quattro, prima, primi, tre, undici, cinquanta, dodici, quattordici, venti
The 10 most frequent ambiguous lemmas: secondo (ADP 14, NOUN 3, NUM 3)
The 10 most frequent ambiguous types: prima (ADV 14, NUM 6), secondo (ADP 14, NUM 1), sei (AUX 11, NUM 1)
- prima
- secondo
- sei
Morphology
The form / lemma ratio of NUM is 1.093750 (the average of all parts of speech is 1.372894).
The 1st highest number of forms (3) was observed with the lemma “primo”: prima, primi, primo.
The 2nd highest number of forms (3) was observed with the lemma “secondo”: seconda, secondo, secron~.
The 3rd highest number of forms (2) was observed with the lemma “quattro”: quattro, qua~.
NUM occurs with 3 features: NumType (84; 91% instances), Gender (13; 14% instances), Number (13; 14% instances)
NUM occurs with 6 feature-value pairs: Gender=Fem, Gender=Masc, NumType=Card, NumType=Ord, Number=Plur, Number=Sing
NUM occurs with 7 feature combinations.
The most frequent feature combination is NumType=Card (71 tokens).
Examples: due, quattro, tre, dodici, quattordici, undici, venti, cinquanta, cinque, dieci
Relations
NUM nodes are attached to their parents using 9 different relations: nummod (65; 71% instances), obl (9; 10% instances), conj (4; 4% instances), obj (3; 3% instances), reparandum (3; 3% instances), root (3; 3% instances), amod (2; 2% instances), nsubj (2; 2% instances), parataxis (1; 1% instances)
Parents of NUM nodes belong to 6 different parts of speech: NOUN (62; 67% instances), VERB (17; 18% instances), NUM (6; 7% instances), ADJ (3; 3% instances), (3; 3% instances), X (1; 1% instances)
68 (74%) NUM nodes are leaves.
11 (12%) NUM nodes have one child.
6 (7%) NUM nodes have two children.
7 (8%) NUM nodes have three or more children.
The highest child degree of a NUM node is 7.
Children of NUM nodes are attached using 15 different relations: det (11; 23% instances), case (10; 21% instances), conj (4; 8% instances), nmod (4; 8% instances), cc (3; 6% instances), obl (3; 6% instances), reparandum (3; 6% instances), discourse (2; 4% instances), parataxis (2; 4% instances), advmod (1; 2% instances), amod (1; 2% instances), cop (1; 2% instances), iobj (1; 2% instances), nummod (1; 2% instances), obj (1; 2% instances)
Children of NUM nodes belong to 12 different parts of speech: DET (11; 23% instances), ADP (8; 17% instances), NOUN (7; 15% instances), NUM (6; 13% instances), CCONJ (4; 8% instances), ADV (3; 6% instances), ADJ (2; 4% instances), INTJ (2; 4% instances), PRON (2; 4% instances), AUX (1; 2% instances), VERB (1; 2% instances), X (1; 2% instances)