home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-PoSTWITA: POS Tags: PROPN

There are 2622 PROPN lemmas (18%), 2643 PROPN types (14%) and 10489 PROPN tokens (8%). Out of 16 observed tags, the rank of PROPN is: 3 in number of lemmas, 4 in number of types and 7 in number of tokens.

The 10 most frequent PROPN lemmas: monti, mario, italia, berlusconi, roma, pd, lega, pdl, twitter, napolitano

The 10 most frequent PROPN types: monti, mario, italia, berlusconi, roma, pd, lega, pdl, twitter, napolitano

The 10 most frequent ambiguous lemmas: lega (PROPN 17, NOUN 1), di (ADP 4341, PROPN 8, DET 4, X 1), porta (NOUN 12, PROPN 4), il (DET 11006, PROPN 5, PRON 2), euro (NOUN 29, PROPN 18, X 1), grillo (NOUN 3, PROPN 1), repubblica (NOUN 8, PROPN 1), la (PRON 150, PROPN 10, ADP 1, X 1), fatto (NOUN 35, VERB 2, PROPN 1), 5 (NUM 63, PROPN 26)

The 10 most frequent ambiguous types: monti (PROPN 168, NOUN 4), lega (PROPN 17, NOUN 1, VERB 1), di (ADP 4192, PROPN 8, PRON 2, VERB 1, X 1), porta (VERB 7, NOUN 6, PROPN 4), il (DET 3956, PROPN 5), letta (PROPN 2, VERB 1), passera (PROPN 3, NOUN 2), euro (NOUN 28, PROPN 18, X 1), repubblica (NOUN 5, PROPN 1), la (DET 2320, PRON 142, PROPN 10, ADP 1, X 1)

Morphology

The form / lemma ratio of PROPN is 1.008009 (the average of all parts of speech is 1.304759).

The 1st highest number of forms (3) was observed with the lemma “michele”: MIchele, Michel, Michele.

The 2nd highest number of forms (2) was observed with the lemma “Beppe”: Beppe, Beppeee.

The 3rd highest number of forms (2) was observed with the lemma “Fornero”: FORNERO, Fornero.

PROPN occurs with 1 features: Gender (1; 0% instances)

PROPN occurs with 1 feature-value pairs: Gender=Masc

PROPN occurs with 2 feature combinations. The most frequent feature combination is _ (10488 tokens). Examples: monti, mario, italia, berlusconi, roma, pd, lega, pdl, twitter, napolitano

Relations

PROPN nodes are attached to their parents using 32 different relations: nmod (3256; 31% instances), flat:name (2438; 23% instances), nsubj (1167; 11% instances), parataxis (782; 7% instances), obl (765; 7% instances), conj (454; 4% instances), root (359; 3% instances), obj (348; 3% instances), flat (193; 2% instances), vocative (192; 2% instances), appos (137; 1% instances), list (131; 1% instances), nsubj:pass (51; 0% instances), obl:agent (45; 0% instances), xcomp (24; 0% instances), dep (22; 0% instances), parataxis:nsubj (17; 0% instances), parataxis:appos (15; 0% instances), ccomp (14; 0% instances), dislocated (13; 0% instances), discourse (12; 0% instances), compound (11; 0% instances), advcl (10; 0% instances), flat:foreign (9; 0% instances), parataxis:obj (7; 0% instances), acl:relcl (5; 0% instances), orphan (5; 0% instances), amod (3; 0% instances), acl (1; 0% instances), csubj (1; 0% instances), goeswith (1; 0% instances), iobj (1; 0% instances)

Parents of PROPN nodes belong to 14 different parts of speech: NOUN (3656; 35% instances), PROPN (3193; 30% instances), VERB (2449; 23% instances), (359; 3% instances), SYM (262; 2% instances), ADJ (217; 2% instances), PRON (103; 1% instances), INTJ (91; 1% instances), X (76; 1% instances), ADV (64; 1% instances), NUM (15; 0% instances), ADP (2; 0% instances), AUX (1; 0% instances), DET (1; 0% instances)

4849 (46%) PROPN nodes are leaves.

2470 (24%) PROPN nodes have one child.

1780 (17%) PROPN nodes have two children.

1390 (13%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 19.

Children of PROPN nodes are attached using 40 different relations: flat:name (2521; 22% instances), case (2218; 19% instances), punct (1978; 17% instances), det (1365; 12% instances), conj (502; 4% instances), nmod (491; 4% instances), dep (310; 3% instances), cc (299; 3% instances), parataxis (298; 3% instances), appos (232; 2% instances), amod (192; 2% instances), nummod (182; 2% instances), advmod (148; 1% instances), parataxis:hashtag (128; 1% instances), flat (125; 1% instances), vocative:mention (100; 1% instances), cop (92; 1% instances), nsubj (67; 1% instances), discourse (61; 1% instances), acl:relcl (43; 0% instances), flat:foreign (33; 0% instances), acl (31; 0% instances), obl (30; 0% instances), list (29; 0% instances), mark (22; 0% instances), parataxis:appos (22; 0% instances), discourse:emo (17; 0% instances), det:poss (15; 0% instances), compound (12; 0% instances), vocative (7; 0% instances), det:predet (5; 0% instances), orphan (5; 0% instances), aux (4; 0% instances), parataxis:insert (3; 0% instances), advcl (2; 0% instances), csubj (1; 0% instances), iobj (1; 0% instances), obj (1; 0% instances), parataxis:discourse (1; 0% instances), parataxis:nsubj (1; 0% instances)

Children of PROPN nodes belong to 16 different parts of speech: PROPN (3193; 28% instances), ADP (2200; 19% instances), PUNCT (1978; 17% instances), DET (1388; 12% instances), SYM (764; 7% instances), NOUN (642; 6% instances), CCONJ (298; 3% instances), NUM (223; 2% instances), ADJ (217; 2% instances), VERB (185; 2% instances), ADV (174; 2% instances), X (101; 1% instances), AUX (96; 1% instances), PRON (58; 1% instances), INTJ (54; 0% instances), SCONJ (23; 0% instances)