home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-PoSTWITA: POS Tags: X

There are 770 X lemmas (5%), 774 X types (4%) and 1064 X tokens (1%). Out of 16 observed tags, the rank of X is: 6 in number of lemmas, 6 in number of types and 16 in number of tokens.

The 10 most frequent X lemmas: of, Prince, i, live, and, a, on, in, the, me

The 10 most frequent X types: of, Prince, i, live, and, a, on, in, the, my

The 10 most frequent ambiguous lemmas: of (X 3, PROPN 2), Prince (X 22, PROPN 7), i (X 15, INTJ 6, PROPN 3), live (X 13, NOUN 2), and (X 10, PROPN 1), a (ADP 2899, PROPN 17, X 9, INTJ 3, SYM 1), in (ADP 1236, X 6, PROPN 1), the (X 8, NOUN 1, PROPN 1), me (PRON 165, X 6, PROPN 1), new (X 4, PROPN 3)

The 10 most frequent ambiguous types: of (X 3, PROPN 2), Prince (X 22, PROPN 7), i (DET 1151, INTJ 6, PROPN 3, X 2), live (X 13, NOUN 1), and (X 10, PROPN 1), a (ADP 2759, PROPN 17, X 9, AUX 5, INTJ 2, DET 1), in (ADP 1137, X 6, DET 1, PROPN 1), the (X 5, NOUN 1, PROPN 1), me (PRON 154, X 5, PROPN 1), video (NOUN 56, X 4)

Morphology

The form / lemma ratio of X is 1.005195 (the average of all parts of speech is 1.303101).

The 1st highest number of forms (3) was observed with the lemma “i”: i, mausolei, moltissimi.

The 2nd highest number of forms (2) was observed with the lemma “andare”: Vamoooooosssssss, namo.

The 3rd highest number of forms (2) was observed with the lemma “for”: 4, for.

X occurs with 5 features: Number (2; 0% instances), Clitic (1; 0% instances), Gender (1; 0% instances), Person (1; 0% instances), PronType (1; 0% instances)

X occurs with 5 feature-value pairs: Clitic=Yes, Gender=Masc, Number=Sing, Person=2, PronType=Prs

X occurs with 3 feature combinations. The most frequent feature combination is _ (1062 tokens). Examples: of, Prince, i, live, and, a, on, in, the, my

Relations

X nodes are attached to their parents using 34 different relations: flat:foreign (291; 27% instances), dep (156; 15% instances), parataxis (105; 10% instances), nmod (78; 7% instances), obl (56; 5% instances), root (50; 5% instances), conj (49; 5% instances), obj (45; 4% instances), flat (43; 4% instances), discourse (32; 3% instances), list (25; 2% instances), nsubj (24; 2% instances), amod (16; 2% instances), appos (13; 1% instances), flat:name (13; 1% instances), case (10; 1% instances), advcl (8; 1% instances), vocative (7; 1% instances), ccomp (6; 1% instances), xcomp (6; 1% instances), discourse:emo (5; 0% instances), parataxis:obj (4; 0% instances), acl:relcl (3; 0% instances), fixed (3; 0% instances), nsubj:pass (3; 0% instances), compound (2; 0% instances), goeswith (2; 0% instances), iobj (2; 0% instances), parataxis:hashtag (2; 0% instances), advmod (1; 0% instances), dislocated (1; 0% instances), mark (1; 0% instances), obl:agent (1; 0% instances), parataxis:discourse (1; 0% instances)

Parents of X nodes belong to 13 different parts of speech: VERB (293; 28% instances), X (286; 27% instances), NOUN (183; 17% instances), PROPN (107; 10% instances), SYM (68; 6% instances), (50; 5% instances), ADJ (27; 3% instances), PRON (21; 2% instances), INTJ (13; 1% instances), ADV (8; 1% instances), NUM (6; 1% instances), ADP (1; 0% instances), DET (1; 0% instances)

590 (55%) X nodes are leaves.

168 (16%) X nodes have one child.

119 (11%) X nodes have two children.

187 (18%) X nodes have three or more children.

The highest child degree of a X node is 12.

Children of X nodes are attached using 43 different relations: flat:foreign (248; 20% instances), punct (198; 16% instances), det (123; 10% instances), case (94; 8% instances), flat (53; 4% instances), nsubj (44; 4% instances), advmod (38; 3% instances), cop (37; 3% instances), nummod (37; 3% instances), cc (35; 3% instances), dep (35; 3% instances), parataxis (35; 3% instances), nmod (34; 3% instances), vocative:mention (30; 2% instances), conj (27; 2% instances), obl (24; 2% instances), parataxis:hashtag (14; 1% instances), amod (13; 1% instances), aux (13; 1% instances), mark (13; 1% instances), obj (13; 1% instances), discourse (12; 1% instances), flat:name (10; 1% instances), discourse:emo (9; 1% instances), vocative (8; 1% instances), advcl (6; 0% instances), expl (6; 0% instances), acl:relcl (4; 0% instances), appos (4; 0% instances), det:poss (3; 0% instances), iobj (3; 0% instances), nsubj:pass (3; 0% instances), aux:pass (2; 0% instances), goeswith (2; 0% instances), parataxis:appos (2; 0% instances), acl (1; 0% instances), ccomp (1; 0% instances), compound (1; 0% instances), dislocated (1; 0% instances), expl:impers (1; 0% instances), orphan (1; 0% instances), parataxis:insert (1; 0% instances), xcomp (1; 0% instances)

Children of X nodes belong to 16 different parts of speech: X (286; 23% instances), PUNCT (198; 16% instances), DET (126; 10% instances), SYM (102; 8% instances), ADP (93; 8% instances), NOUN (83; 7% instances), PROPN (82; 7% instances), AUX (52; 4% instances), ADV (42; 3% instances), NUM (40; 3% instances), CCONJ (38; 3% instances), VERB (33; 3% instances), PRON (29; 2% instances), ADJ (17; 1% instances), SCONJ (12; 1% instances), INTJ (7; 1% instances)