home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-PoSTWITA: POS Tags: X

There are 693 X lemmas (5%), 698 X types (4%) and 978 X tokens (1%). Out of 16 observed tags, the rank of X is: 6 in number of lemmas, 6 in number of types and 16 in number of tokens.

The 10 most frequent X lemmas: of, Prince, i, live, _, a, and, on, the, in

The 10 most frequent X types: of, Prince, i, live, a, and, on, the, in, my

The 10 most frequent ambiguous lemmas: of (X 3, PROPN 2), Prince (X 21, PROPN 7), i (X 15, INTJ 5, PROPN 3, PRON 1), live (X 12, NOUN 2, ADV 1), _ (X 10, PUNCT 6), a (ADP 2901, PROPN 16, X 10, INTJ 3, SYM 1), and (X 9, PROPN 1), the (X 8, NOUN 1, PROPN 1), in (ADP 1237, X 5, PROPN 1), me (PRON 166, X 5, PROPN 1)

The 10 most frequent ambiguous types: of (X 3, PROPN 2), Prince (X 21, PROPN 7), i (DET 1152, INTJ 5, PROPN 3, X 3, PRON 1), live (X 12, ADV 1, NOUN 1), a (ADP 2760, PROPN 16, X 10, AUX 5, INTJ 2, DET 1), and (X 9, PROPN 1), the (X 5, NOUN 1, PROPN 1), in (ADP 1138, X 5, ADV 1, DET 1, PROPN 1), me (PRON 154, X 5, PROPN 1), o (CCONJ 154, X 3)

Morphology

The form / lemma ratio of X is 1.007215 (the average of all parts of speech is 1.310689).

The 1st highest number of forms (7) was observed with the lemma “_”: che, i, ignora, sera, up, vano, è.

The 2nd highest number of forms (3) was observed with the lemma “i”: i, mausolei, moltissimi.

The 3rd highest number of forms (2) was observed with the lemma “for”: 4, for.

X occurs with 5 features: Foreign (583; 60% instances), Clitic (1; 0% instances), Number (1; 0% instances), Person (1; 0% instances), PronType (1; 0% instances)

X occurs with 5 feature-value pairs: Clitic=Yes, Foreign=Yes, Number=Sing, Person=2, PronType=Prs

X occurs with 3 feature combinations. The most frequent feature combination is Foreign=Yes (583 tokens). Examples: of, Prince, live, i, and, on, a, the, in, my

Relations

X nodes are attached to their parents using 29 different relations: flat:foreign (284; 29% instances), dep (137; 14% instances), parataxis (110; 11% instances), nmod (64; 7% instances), obl (49; 5% instances), conj (48; 5% instances), root (44; 4% instances), flat (43; 4% instances), obj (37; 4% instances), discourse (31; 3% instances), list (24; 2% instances), nsubj (16; 2% instances), appos (13; 1% instances), flat:name (13; 1% instances), amod (11; 1% instances), case (9; 1% instances), advcl (7; 1% instances), vocative (7; 1% instances), goeswith (6; 1% instances), ccomp (5; 1% instances), xcomp (5; 1% instances), acl:relcl (3; 0% instances), compound (2; 0% instances), fixed (2; 0% instances), iobj (2; 0% instances), parataxis:hashtag (2; 0% instances), parataxis:obj (2; 0% instances), dislocated (1; 0% instances), nsubj:pass (1; 0% instances)

Parents of X nodes belong to 13 different parts of speech: X (291; 30% instances), VERB (254; 26% instances), NOUN (161; 16% instances), PROPN (91; 9% instances), SYM (58; 6% instances), (44; 4% instances), ADJ (24; 2% instances), PRON (22; 2% instances), INTJ (14; 1% instances), ADV (11; 1% instances), NUM (5; 1% instances), DET (2; 0% instances), SCONJ (1; 0% instances)

548 (56%) X nodes are leaves.

145 (15%) X nodes have one child.

109 (11%) X nodes have two children.

176 (18%) X nodes have three or more children.

The highest child degree of a X node is 12.

Children of X nodes are attached using 38 different relations: flat:foreign (251; 22% instances), punct (196; 17% instances), det (110; 10% instances), case (84; 7% instances), flat (56; 5% instances), parataxis (56; 5% instances), nummod (36; 3% instances), cop (35; 3% instances), advmod (34; 3% instances), nsubj (34; 3% instances), cc (32; 3% instances), nmod (31; 3% instances), vocative (31; 3% instances), conj (24; 2% instances), obl (22; 2% instances), discourse (17; 1% instances), amod (12; 1% instances), aux (12; 1% instances), mark (12; 1% instances), parataxis:hashtag (12; 1% instances), flat:name (8; 1% instances), obj (7; 1% instances), dep (6; 1% instances), expl (6; 1% instances), advcl (4; 0% instances), appos (4; 0% instances), det:poss (3; 0% instances), iobj (3; 0% instances), acl:relcl (2; 0% instances), aux:pass (2; 0% instances), nsubj:pass (2; 0% instances), acl (1; 0% instances), ccomp (1; 0% instances), dislocated (1; 0% instances), expl:impers (1; 0% instances), orphan (1; 0% instances), parataxis:appos (1; 0% instances), parataxis:insert (1; 0% instances)

Children of X nodes belong to 16 different parts of speech: X (291; 25% instances), PUNCT (196; 17% instances), DET (113; 10% instances), SYM (88; 8% instances), ADP (84; 7% instances), PROPN (72; 6% instances), NOUN (58; 5% instances), AUX (49; 4% instances), NUM (40; 3% instances), ADV (38; 3% instances), CCONJ (32; 3% instances), VERB (28; 2% instances), PRON (26; 2% instances), ADJ (17; 1% instances), SCONJ (11; 1% instances), INTJ (8; 1% instances)