home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-PoSTWITA: POS Tags: PRON

There are 70 PRON lemmas (0%), 147 PRON types (1%) and 6483 PRON tokens (5%). Out of 16 observed tags, the rank of PRON is: 11 in number of lemmas, 11 in number of types and 8 in number of tokens.

The 10 most frequent PRON lemmas: che, si, ci, mi, tutto, lo, ti, io, chi, quello

The 10 most frequent PRON types: che, si, mi, ci, lo, ti, tutti, io, chi, c’

The 10 most frequent ambiguous lemmas: che (SCONJ 803, PRON 702, DET 162, ADP 19, PROPN 5, CCONJ 3, X 1), si (PRON 627, INTJ 1, X 1), tutto (PRON 476, DET 272, ADJ 1), lo (PRON 408, DET 22, PROPN 1), quello (PRON 193, DET 61), me (PRON 165, X 6, PROPN 1), ne (PRON 153, CCONJ 6, X 1), la (PRON 150, PROPN 10, ADP 1, X 1), cosa (NOUN 146, PRON 127), questo (DET 307, PRON 102)

The 10 most frequent ambiguous types: che (SCONJ 762, PRON 660, DET 114, ADP 19, PROPN 5, CCONJ 3, X 1), si (PRON 556, INTJ 42, ADV 4, AUX 1, X 1), lo (PRON 302, DET 145, PROPN 1), tutti (PRON 247, DET 120), io (PRON 191, DET 1), tutto (PRON 158, DET 74), me (PRON 154, X 5, PROPN 1), la (DET 2320, PRON 142, PROPN 10, ADP 1, X 1), ne (PRON 141, CCONJ 8, X 1), cosa (NOUN 83, PRON 76)

Morphology

The form / lemma ratio of PRON is 2.100000 (the average of all parts of speech is 1.304759).

The 1st highest number of forms (9) was observed with the lemma “quello”: kvelli, qll, quel, quella, quelle, quelle/i, quelli, quello, quellp.

The 2nd highest number of forms (7) was observed with the lemma “tutto”: tt, tutt’, tutta, tutte, tutteeeeee, tutti, tutto.

The 3rd highest number of forms (6) was observed with the lemma “lo”: gli, l, l’, li, lo, qual.

PRON occurs with 7 features: PronType (6482; 100% instances), Number (4353; 67% instances), Person (3879; 60% instances), Clitic (3254; 50% instances), Gender (1933; 30% instances), Poss (39; 1% instances), Definite (2; 0% instances)

PRON occurs with 17 feature-value pairs: Clitic=Yes, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Art, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, PronType=Rel

PRON occurs with 56 feature combinations. The most frequent feature combination is PronType=Rel (938 tokens). Examples: che, chi, cui, quanto, ke, quale, chiunque, cha, k, quanta

Relations

PRON nodes are attached to their parents using 33 different relations: obj (1479; 23% instances), nsubj (1439; 22% instances), expl (1058; 16% instances), iobj (936; 14% instances), obl (584; 9% instances), expl:impers (191; 3% instances), nmod (170; 3% instances), root (148; 2% instances), parataxis (145; 2% instances), conj (110; 2% instances), expl:pass (56; 1% instances), ccomp (21; 0% instances), nsubj:pass (19; 0% instances), advcl (15; 0% instances), appos (14; 0% instances), dislocated (14; 0% instances), discourse (13; 0% instances), parataxis:appos (11; 0% instances), vocative (11; 0% instances), det (8; 0% instances), acl:relcl (7; 0% instances), obl:agent (7; 0% instances), parataxis:obj (6; 0% instances), xcomp (5; 0% instances), amod (3; 0% instances), dep (3; 0% instances), det:predet (2; 0% instances), orphan (2; 0% instances), parataxis:insert (2; 0% instances), acl (1; 0% instances), compound (1; 0% instances), det:poss (1; 0% instances), goeswith (1; 0% instances)

Parents of PRON nodes belong to 15 different parts of speech: VERB (5478; 84% instances), NOUN (355; 5% instances), ADJ (161; 2% instances), (148; 2% instances), PRON (113; 2% instances), PROPN (58; 1% instances), INTJ (45; 1% instances), SYM (41; 1% instances), ADV (37; 1% instances), X (27; 0% instances), AUX (8; 0% instances), NUM (5; 0% instances), ADP (3; 0% instances), DET (3; 0% instances), CCONJ (1; 0% instances)

4900 (76%) PRON nodes are leaves.

835 (13%) PRON nodes have one child.

390 (6%) PRON nodes have two children.

358 (6%) PRON nodes have three or more children.

The highest child degree of a PRON node is 15.

Children of PRON nodes are attached using 41 different relations: case (702; 22% instances), punct (384; 12% instances), acl:relcl (326; 10% instances), advmod (285; 9% instances), nmod (202; 6% instances), cop (166; 5% instances), det (163; 5% instances), nsubj (129; 4% instances), parataxis (106; 3% instances), cc (100; 3% instances), conj (83; 3% instances), vocative:mention (76; 2% instances), discourse (68; 2% instances), dep (54; 2% instances), acl (44; 1% instances), amod (43; 1% instances), parataxis:hashtag (42; 1% instances), appos (35; 1% instances), discourse:emo (30; 1% instances), mark (27; 1% instances), obl (24; 1% instances), advcl (23; 1% instances), vocative (16; 1% instances), orphan (13; 0% instances), aux (6; 0% instances), nummod (5; 0% instances), csubj (4; 0% instances), parataxis:appos (4; 0% instances), compound (3; 0% instances), det:predet (3; 0% instances), dislocated (3; 0% instances), det:poss (2; 0% instances), iobj (2; 0% instances), ccomp (1; 0% instances), fixed (1; 0% instances), flat (1; 0% instances), flat:name (1; 0% instances), parataxis:discourse (1; 0% instances), parataxis:insert (1; 0% instances), parataxis:obj (1; 0% instances), xcomp (1; 0% instances)

Children of PRON nodes belong to 16 different parts of speech: ADP (695; 22% instances), VERB (420; 13% instances), PUNCT (384; 12% instances), ADV (302; 9% instances), NOUN (259; 8% instances), SYM (238; 7% instances), AUX (180; 6% instances), DET (175; 6% instances), PRON (113; 4% instances), CCONJ (111; 3% instances), PROPN (103; 3% instances), ADJ (79; 2% instances), INTJ (59; 2% instances), SCONJ (26; 1% instances), X (20; 1% instances), NUM (17; 1% instances)