home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-ParTUT: POS Tags: PROPN

There are 771 PROPN lemmas (13%), 775 PROPN types (9%) and 2034 PROPN tokens (4%). Out of 15 observed tags, the rank of PROPN is: 4 in number of lemmas, 4 in number of types and 8 in number of tokens.

The 10 most frequent PROPN lemmas: Shakespeare, Balzac, Facebook, Europa, Ucraina, Pericle, Stati, Uniti, Europea, Unione

The 10 most frequent PROPN types: Shakespeare, Balzac, Facebook, Europa, Ucraina, Pericle, Stati, Uniti, Europea, Unione

The 10 most frequent ambiguous lemmas: de (ADP 15, X 4, PROPN 2), New (PROPN 4, X 1), Lord (PROPN 3, X 1), Men (X 7, PROPN 3), Chamberlain’s (PROPN 2, X 1), Comédie (X 5, PROPN 2), School (PROPN 2, X 1), Cousin (PROPN 1, X 1), Goriot (X 2, PROPN 1), His (ADJ 1, PROPN 1)

The 10 most frequent ambiguous types: Unione (PROPN 18, NOUN 2), Creative (PROPN 13, ADJ 1), de (ADP 18, X 4, PROPN 2), Lord (PROPN 3, X 1), Men (X 7, PROPN 3), Chamberlain’s (PROPN 2, X 1), Comédie (X 5, PROPN 2), New (PROPN 2, X 1), School (PROPN 2, X 1), Usa (PROPN 2, VERB 2)

Morphology

The form / lemma ratio of PROPN is 1.005188 (the average of all parts of speech is 1.488064).

The 1st highest number of forms (2) was observed with the lemma “Cambridge”: CAMBRIDGE, Cambridge.

The 2nd highest number of forms (2) was observed with the lemma “Londra”: LONDRA, Londra.

The 3rd highest number of forms (2) was observed with the lemma “New”: NEW, New.

PROPN occurs with 2 features: Gender (1; 0% instances), Number (1; 0% instances)

PROPN occurs with 2 feature-value pairs: Gender=Fem, Number=Plur

PROPN occurs with 2 feature combinations. The most frequent feature combination is _ (2033 tokens). Examples: Shakespeare, Balzac, Facebook, Europa, Ucraina, Pericle, Stati, Uniti, Europea, Unione

Relations

PROPN nodes are attached to their parents using 16 different relations: nmod (788; 39% instances), flat:name (359; 18% instances), nsubj (298; 15% instances), obl (175; 9% instances), conj (150; 7% instances), flat (73; 4% instances), obj (65; 3% instances), nsubj:pass (32; 2% instances), obl:agent (27; 1% instances), appos (22; 1% instances), root (19; 1% instances), amod (12; 1% instances), xcomp (6; 0% instances), advcl (3; 0% instances), compound (3; 0% instances), orphan (2; 0% instances)

Parents of PROPN nodes belong to 10 different parts of speech: NOUN (805; 40% instances), PROPN (592; 29% instances), VERB (542; 27% instances), ADJ (28; 1% instances), NUM (23; 1% instances), (19; 1% instances), PRON (18; 1% instances), X (5; 0% instances), ADV (1; 0% instances), DET (1; 0% instances)

787 (39%) PROPN nodes are leaves.

561 (28%) PROPN nodes have one child.

364 (18%) PROPN nodes have two children.

322 (16%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 15.

Children of PROPN nodes are attached using 26 different relations: case (744; 30% instances), det (406; 16% instances), flat:name (362; 14% instances), punct (353; 14% instances), conj (176; 7% instances), nmod (155; 6% instances), cc (98; 4% instances), amod (60; 2% instances), flat (36; 1% instances), acl:relcl (29; 1% instances), appos (24; 1% instances), nummod (23; 1% instances), advmod (18; 1% instances), acl (5; 0% instances), cop (5; 0% instances), compound (4; 0% instances), det:poss (4; 0% instances), nsubj (4; 0% instances), mark (3; 0% instances), det:predet (2; 0% instances), fixed (2; 0% instances), advcl (1; 0% instances), dep (1; 0% instances), obj (1; 0% instances), obl (1; 0% instances), orphan (1; 0% instances)

Children of PROPN nodes belong to 14 different parts of speech: ADP (752; 30% instances), PROPN (592; 24% instances), DET (412; 16% instances), PUNCT (353; 14% instances), NOUN (141; 6% instances), CCONJ (98; 4% instances), ADJ (64; 3% instances), NUM (37; 1% instances), VERB (29; 1% instances), ADV (21; 1% instances), PRON (10; 0% instances), AUX (5; 0% instances), SCONJ (2; 0% instances), X (2; 0% instances)