home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-PoSTWITA: POS Tags: PRON

There are 71 PRON lemmas (0%), 150 PRON types (1%) and 6486 PRON tokens (5%). Out of 16 observed tags, the rank of PRON is: 11 in number of lemmas, 11 in number of types and 8 in number of tokens.

The 10 most frequent PRON lemmas: che, si, ci, mi, tutto, lo, ti, io, chi, quello

The 10 most frequent PRON types: che, si, mi, ci, lo, ti, tutti, io, chi, c’

The 10 most frequent ambiguous lemmas: che (SCONJ 802, PRON 704, DET 162, ADP 19, PROPN 5, CCONJ 3), si (PRON 627, INTJ 1, X 1), tutto (PRON 474, DET 274, ADJ 1), lo (PRON 407, DET 22, PROPN 1), quello (PRON 193, DET 61), me (PRON 166, X 5, PROPN 1), ne (PRON 153, CCONJ 6, X 1), la (PRON 150, PROPN 10, ADP 1, X 1), cosa (NOUN 146, PRON 127), questo (DET 307, PRON 102)

The 10 most frequent ambiguous types: che (SCONJ 762, PRON 660, DET 114, ADP 19, PROPN 5, CCONJ 3, X 1), si (PRON 556, INTJ 42, ADV 4, AUX 1, X 1), lo (PRON 302, DET 145, PROPN 1), tutti (PRON 247, DET 120), io (PRON 191, DET 1), tutto (PRON 156, DET 76), me (PRON 154, X 5, PROPN 1), la (DET 2320, PRON 142, PROPN 10, ADP 1, X 1), ne (PRON 141, CCONJ 8, X 1), cosa (NOUN 83, PRON 76)

Morphology

The form / lemma ratio of PRON is 2.112676 (the average of all parts of speech is 1.310882).

The 1st highest number of forms (9) was observed with the lemma “quello”: kvelli, qll, quel, quella, quelle, quelle/i, quelli, quello, quellp.

The 2nd highest number of forms (7) was observed with the lemma “tutto”: tt, tutt’, tutta, tutte, tutteeeeee, tutti, tutto.

The 3rd highest number of forms (5) was observed with the lemma “che”: che, chè, k, ke, que.

PRON occurs with 8 features: PronType (6485; 100% instances), Number (4352; 67% instances), Person (3880; 60% instances), Clitic (3253; 50% instances), Gender (1931; 30% instances), Poss (39; 1% instances), Typo (3; 0% instances), Definite (2; 0% instances)

PRON occurs with 18 feature-value pairs: Clitic=Yes, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Art, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, PronType=Rel, Typo=Yes

PRON occurs with 57 feature combinations. The most frequent feature combination is PronType=Rel (937 tokens). Examples: che, chi, cui, quanto, ke, quale, chiunque, cha, k, quanta

Relations

PRON nodes are attached to their parents using 34 different relations: obj (1486; 23% instances), nsubj (1430; 22% instances), expl (1057; 16% instances), iobj (936; 14% instances), obl (586; 9% instances), expl:impers (191; 3% instances), nmod (170; 3% instances), parataxis (149; 2% instances), root (149; 2% instances), conj (109; 2% instances), expl:pass (56; 1% instances), ccomp (23; 0% instances), nsubj:pass (19; 0% instances), advcl (15; 0% instances), dislocated (15; 0% instances), appos (14; 0% instances), discourse (13; 0% instances), vocative (12; 0% instances), parataxis:appos (10; 0% instances), det (8; 0% instances), acl:relcl (7; 0% instances), obl:agent (7; 0% instances), xcomp (5; 0% instances), amod (3; 0% instances), dep (3; 0% instances), det:predet (2; 0% instances), orphan (2; 0% instances), parataxis:insert (2; 0% instances), parataxis:obj (2; 0% instances), acl (1; 0% instances), compound (1; 0% instances), det:poss (1; 0% instances), fixed (1; 0% instances), nsubj:outer (1; 0% instances)

Parents of PRON nodes belong to 15 different parts of speech: VERB (5481; 85% instances), NOUN (356; 5% instances), ADJ (161; 2% instances), (149; 2% instances), PRON (113; 2% instances), PROPN (58; 1% instances), INTJ (45; 1% instances), SYM (41; 1% instances), ADV (37; 1% instances), X (26; 0% instances), AUX (8; 0% instances), NUM (5; 0% instances), DET (3; 0% instances), ADP (2; 0% instances), CCONJ (1; 0% instances)

4902 (76%) PRON nodes are leaves.

836 (13%) PRON nodes have one child.

389 (6%) PRON nodes have two children.

359 (6%) PRON nodes have three or more children.

The highest child degree of a PRON node is 15.

Children of PRON nodes are attached using 38 different relations: case (702; 22% instances), punct (388; 12% instances), acl:relcl (326; 10% instances), advmod (285; 9% instances), nmod (200; 6% instances), cop (166; 5% instances), det (164; 5% instances), parataxis (150; 5% instances), nsubj (127; 4% instances), cc (100; 3% instances), discourse (98; 3% instances), vocative (94; 3% instances), conj (83; 3% instances), acl (45; 1% instances), amod (43; 1% instances), parataxis:hashtag (42; 1% instances), appos (34; 1% instances), mark (27; 1% instances), obl (24; 1% instances), advcl (22; 1% instances), orphan (13; 0% instances), dep (12; 0% instances), aux (6; 0% instances), nummod (5; 0% instances), csubj (4; 0% instances), parataxis:appos (4; 0% instances), compound (3; 0% instances), det:predet (3; 0% instances), dislocated (3; 0% instances), det:poss (2; 0% instances), fixed (2; 0% instances), iobj (2; 0% instances), ccomp (1; 0% instances), flat (1; 0% instances), flat:name (1; 0% instances), parataxis:discourse (1; 0% instances), parataxis:insert (1; 0% instances), xcomp (1; 0% instances)

Children of PRON nodes belong to 16 different parts of speech: ADP (695; 22% instances), VERB (418; 13% instances), PUNCT (388; 12% instances), ADV (302; 9% instances), NOUN (257; 8% instances), SYM (238; 7% instances), AUX (180; 6% instances), DET (176; 6% instances), PRON (113; 4% instances), CCONJ (111; 3% instances), PROPN (104; 3% instances), ADJ (79; 2% instances), INTJ (60; 2% instances), SCONJ (26; 1% instances), X (22; 1% instances), NUM (16; 1% instances)