Treebank Statistics: UD_Italian-PUD: POS Tags: NOUN
There are 1799 NOUN
lemmas (36%), 2112 NOUN
types (32%) and 4392 NOUN
tokens (19%).
Out of 16 observed tags, the rank of NOUN
is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN
lemmas: anno, parte, volta, città, persona, giorno, governo, tempo, guerra, stato
The 10 most frequent NOUN
types: anni, parte, città, anno, persone, governo, volta, guerra, secolo, stato
The 10 most frequent ambiguous lemmas: anno (NOUN 74, PROPN 2), guerra (NOUN 21, PROPN 4), secolo (NOUN 18, PROPN 2), fine (NOUN 17, ADJ 1), mondo (NOUN 17, PROPN 1), media (NOUN 11, X 2), pubblico (NOUN 11, ADJ 8, ADV 1), elezione (NOUN 9, VERB 1), fiume (NOUN 9, PROPN 1), altro (DET 28, ADJ 9, NOUN 8)
The 10 most frequent ambiguous types: anni (NOUN 51, PROPN 2), guerra (NOUN 20, PROPN 4), secolo (NOUN 18, PROPN 2), stato (AUX 30, NOUN 17, VERB 4), mondo (NOUN 17, PROPN 1), media (NOUN 13, X 2, ADJ 1), pubblico (NOUN 11, ADJ 4), seguito (NOUN 11, VERB 2), dati (NOUN 10, VERB 1), elezioni (NOUN 7, VERB 1)
- anni
- guerra
- secolo
- stato
- mondo
- media
- NOUN 13: È il tumore più comune riscontrato in i neonati , con una media di uno su 35.000 nascite .
- X 2: Per chiunque segua le transizioni di Capitol Hill su i social media , questo sarà leggermente diverso .
- ADJ 1: Ma la crescente classe media cinese è stata stranamente esplicita in le sue proteste contro la tossicità di l’ aria in città come Pechino , dove ogni giorno lo smog mette sempre più a rischio i polmoni di i cittadini .
- pubblico
- seguito
- dati
- elezioni
Morphology
The form / lemma ratio of NOUN
is 1.173986 (the average of all parts of speech is 1.285799).
The 1st highest number of forms (5) was observed with the lemma “ragazzo”: ragazza, ragazze, ragazzi, ragazzini, ragazzo.
The 2nd highest number of forms (4) was observed with the lemma “figlio”: figli, figlia, figlie, figlio.
The 3rd highest number of forms (3) was observed with the lemma “altro”: altra, altri, altro.
NOUN
occurs with 3 features: Number (4385; 100% instances), Gender (4384; 100% instances), PronType (2; 0% instances)
NOUN
occurs with 5 feature-value pairs: Gender=Fem
, Gender=Masc
, Number=Plur
, Number=Sing
, PronType=Ind
NOUN
occurs with 8 feature combinations.
The most frequent feature combination is Gender=Masc|Number=Sing
(1692 tokens).
Examples: anno, governo, stato, tempo, giorno, mondo, numero, secolo, periodo, lavoro
Relations
NOUN
nodes are attached to their parents using 23 different relations: nmod (1197; 27% instances), obl (1154; 26% instances), obj (697; 16% instances), nsubj (632; 14% instances), conj (279; 6% instances), nsubj:pass (115; 3% instances), root (89; 2% instances), appos (54; 1% instances), xcomp (42; 1% instances), compound (30; 1% instances), flat (25; 1% instances), ccomp (15; 0% instances), fixed (13; 0% instances), parataxis (13; 0% instances), acl:relcl (11; 0% instances), advcl (10; 0% instances), obl:tmod (8; 0% instances), acl (2; 0% instances), vocative (2; 0% instances), amod (1; 0% instances), case (1; 0% instances), mark (1; 0% instances), orphan (1; 0% instances)
Parents of NOUN
nodes belong to 12 different parts of speech: VERB (2407; 55% instances), NOUN (1510; 34% instances), ADJ (186; 4% instances), (89; 2% instances), PROPN (80; 2% instances), NUM (64; 1% instances), PRON (15; 0% instances), ADP (12; 0% instances), SYM (11; 0% instances), DET (10; 0% instances), ADV (7; 0% instances), AUX (1; 0% instances)
144 (3%) NOUN
nodes are leaves.
911 (21%) NOUN
nodes have one child.
1471 (33%) NOUN
nodes have two children.
1866 (42%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 11.
Children of NOUN
nodes are attached using 30 different relations: det (3028; 28% instances), case (2389; 22% instances), nmod (1664; 16% instances), amod (1234; 12% instances), punct (553; 5% instances), conj (278; 3% instances), det:poss (229; 2% instances), cc (228; 2% instances), acl (190; 2% instances), acl:relcl (185; 2% instances), nummod (149; 1% instances), cop (132; 1% instances), advmod (123; 1% instances), nsubj (97; 1% instances), compound (46; 0% instances), obl (43; 0% instances), appos (36; 0% instances), flat (26; 0% instances), mark (26; 0% instances), obl:tmod (13; 0% instances), aux (12; 0% instances), parataxis (12; 0% instances), advcl (9; 0% instances), ccomp (7; 0% instances), csubj (6; 0% instances), xcomp (4; 0% instances), fixed (3; 0% instances), expl (1; 0% instances), obj (1; 0% instances), orphan (1; 0% instances)
Children of NOUN
nodes belong to 15 different parts of speech: DET (3034; 28% instances), ADP (2373; 22% instances), NOUN (1510; 14% instances), ADJ (1259; 12% instances), PUNCT (553; 5% instances), PROPN (526; 5% instances), VERB (402; 4% instances), PRON (267; 2% instances), NUM (230; 2% instances), CCONJ (224; 2% instances), ADV (151; 1% instances), AUX (144; 1% instances), SCONJ (20; 0% instances), SYM (19; 0% instances), X (13; 0% instances)