home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-PoSTWITA: POS Tags: PRON

There are 68 PRON lemmas (0%), 145 PRON types (1%) and 6470 PRON tokens (5%). Out of 16 observed tags, the rank of PRON is: 11 in number of lemmas, 11 in number of types and 8 in number of tokens.

The 10 most frequent PRON lemmas: che, si, ci, mi, tutto, lo, ti, io, chi, quello

The 10 most frequent PRON types: che, si, mi, ci, lo, ti, tutti, io, chi, c’

The 10 most frequent ambiguous lemmas: che (SCONJ 803, PRON 703, DET 161, ADP 19, PROPN 5, CCONJ 3, X 1), si (PRON 626, INTJ 1, X 1), tutto (PRON 476, DET 271, ADV 2), lo (PRON 406, DET 10, PROPN 1), quello (PRON 192, DET 61), me (PRON 165, X 6, PROPN 1), ne (PRON 153, CCONJ 8, X 1), la (PRON 150, PROPN 10, ADP 1, X 1), cosa (NOUN 146, PRON 124), questo (DET 308, PRON 102)

The 10 most frequent ambiguous types: che (SCONJ 762, PRON 661, DET 113, ADP 19, PROPN 5, CCONJ 3, X 1), si (PRON 556, INTJ 45, ADV 1, AUX 1, X 1), lo (PRON 301, DET 144, PROPN 1), tutti (PRON 247, DET 120), io (PRON 189, DET 1), tutto (PRON 158, DET 73, ADV 1), me (PRON 154, X 5, PROPN 1), la (DET 2317, PRON 142, PROPN 10, ADP 1, X 1), ne (PRON 141, CCONJ 8, X 1), cosa (NOUN 83, PRON 76)

Morphology

The form / lemma ratio of PRON is 2.132353 (the average of all parts of speech is 1.303101).

The 1st highest number of forms (9) was observed with the lemma “quello”: kvelli, qll, quel, quella, quelle, quelle/i, quelli, quello, quellp.

The 2nd highest number of forms (7) was observed with the lemma “tutto”: tt, tutt’, tutta, tutte, tutteeeeee, tutti, tutto.

The 3rd highest number of forms (5) was observed with the lemma “ci”: c, c’, ci, c’, di.

PRON occurs with 7 features: PronType (6469; 100% instances), Number (4353; 67% instances), Person (3879; 60% instances), Clitic (3247; 50% instances), Gender (1933; 30% instances), Poss (39; 1% instances), Definite (2; 0% instances)

PRON occurs with 17 feature-value pairs: Clitic=Yes, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Art, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, PronType=Rel

PRON occurs with 55 feature combinations. The most frequent feature combination is PronType=Rel (939 tokens). Examples: che, chi, cui, quanto, ke, quale, chiunque, cha, k, quanta

Relations

PRON nodes are attached to their parents using 33 different relations: obj (1482; 23% instances), nsubj (1434; 22% instances), expl (1056; 16% instances), iobj (933; 14% instances), obl (583; 9% instances), expl:impers (190; 3% instances), nmod (169; 3% instances), root (146; 2% instances), parataxis (145; 2% instances), conj (109; 2% instances), expl:pass (56; 1% instances), ccomp (20; 0% instances), nsubj:pass (19; 0% instances), advcl (15; 0% instances), appos (14; 0% instances), dislocated (14; 0% instances), discourse (13; 0% instances), parataxis:appos (11; 0% instances), vocative (11; 0% instances), det (8; 0% instances), acl:relcl (7; 0% instances), obl:agent (7; 0% instances), parataxis:obj (7; 0% instances), xcomp (5; 0% instances), amod (3; 0% instances), dep (3; 0% instances), det:predet (2; 0% instances), orphan (2; 0% instances), parataxis:insert (2; 0% instances), acl (1; 0% instances), compound (1; 0% instances), det:poss (1; 0% instances), goeswith (1; 0% instances)

Parents of PRON nodes belong to 15 different parts of speech: VERB (5467; 84% instances), NOUN (353; 5% instances), ADJ (162; 3% instances), (146; 2% instances), PRON (113; 2% instances), PROPN (58; 1% instances), INTJ (45; 1% instances), SYM (40; 1% instances), ADV (37; 1% instances), X (29; 0% instances), AUX (8; 0% instances), NUM (5; 0% instances), ADP (3; 0% instances), DET (3; 0% instances), CCONJ (1; 0% instances)

4895 (76%) PRON nodes are leaves.

829 (13%) PRON nodes have one child.

390 (6%) PRON nodes have two children.

356 (6%) PRON nodes have three or more children.

The highest child degree of a PRON node is 15.

Children of PRON nodes are attached using 41 different relations: case (703; 22% instances), punct (383; 12% instances), acl:relcl (326; 10% instances), advmod (282; 9% instances), nmod (203; 6% instances), cop (163; 5% instances), det (163; 5% instances), nsubj (125; 4% instances), parataxis (105; 3% instances), cc (100; 3% instances), conj (82; 3% instances), vocative:mention (75; 2% instances), discourse (67; 2% instances), dep (54; 2% instances), acl (44; 1% instances), amod (43; 1% instances), parataxis:hashtag (40; 1% instances), appos (35; 1% instances), discourse:emo (30; 1% instances), mark (27; 1% instances), obl (24; 1% instances), advcl (22; 1% instances), vocative (16; 1% instances), orphan (13; 0% instances), aux (6; 0% instances), nummod (5; 0% instances), csubj (4; 0% instances), parataxis:appos (4; 0% instances), compound (3; 0% instances), det:predet (3; 0% instances), det:poss (2; 0% instances), dislocated (2; 0% instances), iobj (2; 0% instances), ccomp (1; 0% instances), fixed (1; 0% instances), flat (1; 0% instances), flat:name (1; 0% instances), parataxis:discourse (1; 0% instances), parataxis:insert (1; 0% instances), parataxis:obj (1; 0% instances), xcomp (1; 0% instances)

Children of PRON nodes belong to 16 different parts of speech: ADP (694; 22% instances), VERB (417; 13% instances), PUNCT (383; 12% instances), ADV (299; 9% instances), NOUN (256; 8% instances), SYM (236; 7% instances), AUX (177; 6% instances), DET (175; 6% instances), PRON (113; 4% instances), CCONJ (110; 3% instances), PROPN (102; 3% instances), ADJ (79; 2% instances), INTJ (59; 2% instances), SCONJ (26; 1% instances), X (21; 1% instances), NUM (17; 1% instances)