Treebank Statistics: UD_Italian-VIT: POS Tags: PROPN
There are 4072 PROPN lemmas (23%), 4074 PROPN types (16%) and 11885 PROPN tokens (4%).
Out of 17 observed tags, the rank of PROPN is: 2 in number of lemmas, 4 in number of types and 7 in number of tokens.
The 10 most frequent PROPN lemmas: Italia, Roma, Milano, Francesco, Europa, De, d’, Berlusconi, Stati, Uniti
The 10 most frequent PROPN types: Italia, Roma, Milano, Francesco, Europa, De, d’, Berlusconi, Stati, Uniti
The 10 most frequent ambiguous lemmas: Berlusconi (PROPN 54, NOUN 1), Camera (PROPN 51, NOUN 1), Carlo (PROPN 46, PRON 1), Di (PROPN 30, ADP 1), Enel (PROPN 27, NOUN 3), Spa (PROPN 25, NOUN 1), Dc (PROPN 22, NOUN 1), Mosca (PROPN 16, NOUN 1), Psi (PROPN 15, NOUN 1), Iva (PROPN 14, NOUN 1)
The 10 most frequent ambiguous types: d’ (ADP 373, PROPN 5), Berlusconi (PROPN 54, NOUN 1), Carlo (PROPN 46, PRON 1), Via (PROPN 38, ADP 1, ADV 1, NOUN 1), Di (ADP 77, PROPN 30), Enel (PROPN 27, NOUN 3), gran (ADJ 25, PROPN 2), Spa (PROPN 25, NOUN 1), Piazza (PROPN 24, NOUN 2), Centro (PROPN 22, NOUN 1)
- d’
- ADP 373: E che cosa fanno i dirigenti in le ore d’ ufficio ?
- PROPN 5: Bruxelles - il governo belga ha avviato una trattativa per cedere la propria quota di l’ 82,3 % in il capitale di la holding pubblica Société Nationale d’ Investissement ( Sni ) a il gruppo privato Ackermans & Van Haaren di Anversa .
- Berlusconi
- Carlo
- Via
- PROPN 38: Arrivammo in il tardo pomeriggio a l’ Excelsior di Via Veneto .
- ADP 1: Via radio le comunicazioni tra i diversi “ posti di blocco “ : “ scuoiate li subito , altrimenti con questo caldo marciscono “ .
- ADV 1: Via i bottoni via anche il ricamo , sì a l’ uso fresco di le piume , di il tulle , di il merletto in tutti i suoi più divertenti disegni .
- NOUN 1: Via libera a i Boc : per i risparmiatori ora ci sono anche i titoli di città .
- Di
- Enel
- gran
- Spa
- Piazza
- Centro
Morphology
The form / lemma ratio of PROPN is 1.000491 (the average of all parts of speech is 1.502699).
The 1st highest number of forms (2) was observed with the lemma “Bbc”: BBC, Bbc.
The 2nd highest number of forms (1) was observed with the lemma “&”: &.
The 3rd highest number of forms (1) was observed with the lemma “1”: 1.
PROPN does not occur with any features.
Relations
PROPN nodes are attached to their parents using 21 different relations: nmod (3731; 31% instances), flat:name (2895; 24% instances), nsubj (1646; 14% instances), obl (1238; 10% instances), conj (1036; 9% instances), appos (398; 3% instances), root (284; 2% instances), obj (205; 2% instances), obl:agent (194; 2% instances), nsubj:pass (123; 1% instances), parataxis (83; 1% instances), xcomp (12; 0% instances), vocative (10; 0% instances), ccomp (8; 0% instances), advcl (6; 0% instances), acl:relcl (4; 0% instances), compound (3; 0% instances), flat (3; 0% instances), acl (2; 0% instances), dislocated (2; 0% instances), list (2; 0% instances)
Parents of PROPN nodes belong to 13 different parts of speech: PROPN (4240; 36% instances), NOUN (3818; 32% instances), VERB (3151; 27% instances), (284; 2% instances), ADJ (157; 1% instances), PRON (102; 1% instances), NUM (47; 0% instances), ADV (31; 0% instances), SYM (29; 0% instances), X (21; 0% instances), ADP (2; 0% instances), INTJ (2; 0% instances), CCONJ (1; 0% instances)
4231 (36%) PROPN nodes are leaves.
2788 (23%) PROPN nodes have one child.
2430 (20%) PROPN nodes have two children.
2436 (20%) PROPN nodes have three or more children.
The highest child degree of a PROPN node is 14.
Children of PROPN nodes are attached using 31 different relations: case (4226; 24% instances), det (2906; 17% instances), flat:name (2869; 17% instances), punct (2578; 15% instances), conj (1114; 6% instances), nmod (930; 5% instances), appos (616; 4% instances), cc (577; 3% instances), amod (436; 3% instances), acl:relcl (280; 2% instances), nummod (157; 1% instances), advmod (155; 1% instances), advcl (95; 1% instances), cop (71; 0% instances), parataxis (55; 0% instances), acl (54; 0% instances), nsubj (45; 0% instances), compound (25; 0% instances), det:poss (21; 0% instances), obl (20; 0% instances), aux (14; 0% instances), ccomp (14; 0% instances), mark (12; 0% instances), det:predet (5; 0% instances), xcomp (4; 0% instances), flat (3; 0% instances), list (2; 0% instances), obj (2; 0% instances), discourse (1; 0% instances), obl:agent (1; 0% instances), orphan (1; 0% instances)
Children of PROPN nodes belong to 17 different parts of speech: PROPN (4240; 25% instances), ADP (4196; 24% instances), DET (2932; 17% instances), PUNCT (2578; 15% instances), NOUN (1223; 7% instances), CCONJ (575; 3% instances), ADJ (454; 3% instances), VERB (451; 3% instances), NUM (225; 1% instances), ADV (198; 1% instances), AUX (85; 0% instances), PRON (53; 0% instances), SYM (47; 0% instances), X (18; 0% instances), SCONJ (11; 0% instances), INTJ (2; 0% instances), PART (1; 0% instances)