home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-PoSTWITA: POS Tags: PROPN

There are 2611 PROPN lemmas (18%), 2641 PROPN types (14%) and 10486 PROPN tokens (8%). Out of 16 observed tags, the rank of PROPN is: 3 in number of lemmas, 4 in number of types and 7 in number of tokens.

The 10 most frequent PROPN lemmas: monti, mario, italia, berlusconi, roma, pd, lega, pdl, twitter, napolitano

The 10 most frequent PROPN types: monti, mario, italia, berlusconi, roma, pd, lega, pdl, twitter, napolitano

The 10 most frequent ambiguous lemmas: lega (PROPN 17, NOUN 1), di (ADP 4343, PROPN 8, DET 4, X 1), porta (NOUN 12, PROPN 4), il (DET 11006, PROPN 5, PRON 2), euro (NOUN 29, PROPN 18, X 1), grillo (NOUN 3, PROPN 1), repubblica (NOUN 8, PROPN 1), la (PRON 150, PROPN 10, ADP 1, X 1), fatto (NOUN 35, VERB 2, PROPN 1), 5 (NUM 63, PROPN 26)

The 10 most frequent ambiguous types: monti (PROPN 168, NOUN 4), lega (PROPN 17, NOUN 1, VERB 1), di (ADP 4192, PROPN 8, PRON 2, VERB 1, X 1), porta (VERB 7, NOUN 6, PROPN 4), il (DET 3956, PROPN 5), letta (PROPN 2, VERB 1), passera (PROPN 3, NOUN 2), euro (NOUN 28, PROPN 18, X 1), repubblica (NOUN 5, PROPN 1), la (DET 2320, PRON 142, PROPN 10, ADP 1, X 1)

Morphology

The form / lemma ratio of PROPN is 1.011490 (the average of all parts of speech is 1.310744).

The 1st highest number of forms (3) was observed with the lemma “Beppe”: Beppe, Beppeee, Beppeeeeee.

The 2nd highest number of forms (3) was observed with the lemma “michele”: MIchele, Michel, Michele.

The 3rd highest number of forms (2) was observed with the lemma “EURO”: EURO, EUROOO.

PROPN occurs with 1 features: Gender (1; 0% instances)

PROPN occurs with 1 feature-value pairs: Gender=Masc

PROPN occurs with 2 feature combinations. The most frequent feature combination is _ (10485 tokens). Examples: monti, mario, italia, berlusconi, roma, pd, lega, pdl, twitter, napolitano

Relations

PROPN nodes are attached to their parents using 31 different relations: nmod (3257; 31% instances), flat:name (2437; 23% instances), nsubj (1161; 11% instances), parataxis (792; 8% instances), obl (766; 7% instances), conj (458; 4% instances), root (359; 3% instances), obj (352; 3% instances), vocative (194; 2% instances), flat (191; 2% instances), appos (137; 1% instances), list (131; 1% instances), nsubj:pass (51; 0% instances), obl:agent (45; 0% instances), xcomp (24; 0% instances), parataxis:nsubj (17; 0% instances), dep (15; 0% instances), parataxis:appos (15; 0% instances), ccomp (14; 0% instances), dislocated (13; 0% instances), compound (11; 0% instances), advcl (10; 0% instances), discourse (8; 0% instances), flat:foreign (8; 0% instances), acl:relcl (5; 0% instances), orphan (5; 0% instances), parataxis:obj (4; 0% instances), amod (3; 0% instances), acl (1; 0% instances), csubj (1; 0% instances), iobj (1; 0% instances)

Parents of PROPN nodes belong to 14 different parts of speech: NOUN (3655; 35% instances), PROPN (3194; 30% instances), VERB (2449; 23% instances), (359; 3% instances), SYM (261; 2% instances), ADJ (218; 2% instances), PRON (104; 1% instances), INTJ (90; 1% instances), X (72; 1% instances), ADV (65; 1% instances), NUM (15; 0% instances), ADP (2; 0% instances), AUX (1; 0% instances), DET (1; 0% instances)

4846 (46%) PROPN nodes are leaves.

2471 (24%) PROPN nodes have one child.

1781 (17%) PROPN nodes have two children.

1388 (13%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 19.

Children of PROPN nodes are attached using 38 different relations: flat:name (2520; 22% instances), case (2214; 19% instances), punct (2028; 17% instances), det (1364; 12% instances), parataxis (539; 5% instances), conj (506; 4% instances), nmod (491; 4% instances), cc (299; 3% instances), appos (233; 2% instances), amod (191; 2% instances), nummod (182; 2% instances), advmod (149; 1% instances), parataxis:hashtag (128; 1% instances), flat (124; 1% instances), vocative (107; 1% instances), cop (92; 1% instances), discourse (75; 1% instances), nsubj (67; 1% instances), acl:relcl (42; 0% instances), flat:foreign (33; 0% instances), acl (31; 0% instances), obl (30; 0% instances), list (29; 0% instances), dep (23; 0% instances), mark (22; 0% instances), parataxis:appos (22; 0% instances), det:poss (15; 0% instances), compound (12; 0% instances), det:predet (5; 0% instances), orphan (5; 0% instances), aux (4; 0% instances), parataxis:insert (3; 0% instances), advcl (2; 0% instances), csubj (1; 0% instances), iobj (1; 0% instances), obj (1; 0% instances), parataxis:discourse (1; 0% instances), parataxis:nsubj (1; 0% instances)

Children of PROPN nodes belong to 16 different parts of speech: PROPN (3194; 28% instances), ADP (2199; 19% instances), PUNCT (2028; 17% instances), DET (1387; 12% instances), SYM (733; 6% instances), NOUN (627; 5% instances), CCONJ (298; 3% instances), NUM (223; 2% instances), ADJ (215; 2% instances), VERB (186; 2% instances), ADV (174; 2% instances), X (98; 1% instances), AUX (96; 1% instances), PRON (58; 1% instances), INTJ (53; 0% instances), SCONJ (23; 0% instances)