Treebank Statistics: UD_Italian-VIT: POS Tags: X
There are 220 X
lemmas (1%), 219 X
types (1%) and 402 X
tokens (0%).
Out of 17 observed tags, the rank of X
is: 7 in number of lemmas, 7 in number of types and 15 in number of tokens.
The 10 most frequent X
lemmas: joint, venture, work, personal, baby, cd, computer, sitter, station, condicio
The 10 most frequent X
types: joint, venture, personal, station, work, baby, cd, computer, sitter, condicio
The 10 most frequent ambiguous lemmas: work (X 9, NOUN 2), personal (NOUN 8, X 8), computer (NOUN 20, X 7), station (X 7, NOUN 2), facile (ADJ 18, X 5), bond (X 4, NOUN 3), top (NOUN 4, X 4, ADJ 2), business (NOUN 5, X 3), pro (X 3, ADP 2, ADV 1), the (X 3, DET 2)
The 10 most frequent ambiguous types: personal (NOUN 8, X 8), station (X 8, NOUN 2), work (X 8, NOUN 2), computer (NOUN 20, X 7), facile (ADJ 17, X 5), bond (X 4, NOUN 2), c’ (PRON 174, X 3), top (X 4, NOUN 3, ADJ 2), business (NOUN 5, X 3), pro (X 3, ADP 2, ADV 1)
- personal
- station
- work
- computer
- facile
- bond
- X 4: Il prezzo e la cedola di i titoli saranno fissati oggi , garantendo un premio di rendimento in l’ area di 350 basis point ( il 3,5 % ) su il treasury bond di riferimento .
- NOUN 2: A metà seduta , il bond trentennale era in rialzo di 25,32 e aveva toccato quota , pari a un rendimento di il 7,36 per cento .
- c’
- top
- X 4: Niente top issime , ma le top model ci saranno .
- NOUN 3: Niente top issime , ma le top model ci saranno .
- ADJ 2: Durante un convegno sponsorizzato da la rivista Forbes a Annapolis , molti top managers hanno detto di puntare su i mercati esteri come motore per la crescita e di non aspettar si per il momento una ripresa forte di i consumi in gli Stati Uniti .
- business
- pro
- X 3: I valori di il Pil pro capite di la provincia di Oristano si collocano a il di sotto di la media meridionale ( 14,7 milioni per abitante contro il 16,3 relativo a il Sud in generale ) , mentre la graduatoria nazionale non mostra mutamenti di posizione in il decennio ottanta ( star88 posto ) .
- ADP 2: Lettere , pro memoria , dati e previsioni relative a il suo settore di responsabilità a l’ interno di l’ azienda .
- ADV 1: Sarebbe sbagliato leggere tutto pro o contro Berlusconi .
Morphology
The form / lemma ratio of X
is 0.995455 (the average of all parts of speech is 1.501662).
The 1st highest number of forms (2) was observed with the lemma “work”: station, work.
The 2nd highest number of forms (1) was observed with the lemma “ANTIHOOLIGANS”: antihooligans.
The 3rd highest number of forms (1) was observed with the lemma “Allons”: allons.
X
occurs with 3 features: Foreign (393; 98% instances), Gender (59; 15% instances), Number (27; 7% instances)
X
occurs with 5 feature-value pairs: Foreign=Yes
, Gender=Fem
, Gender=Masc
, Number=Plur
, Number=Sing
X
occurs with 8 feature combinations.
The most frequent feature combination is Foreign=Yes
(334 tokens).
Examples: joint, venture, baby, cd, sitter, rom, condicio, par, est, facile
Relations
X
nodes are attached to their parents using 15 different relations: flat:foreign (178; 44% instances), nmod (94; 23% instances), obl (25; 6% instances), nsubj (20; 5% instances), obj (17; 4% instances), conj (15; 4% instances), appos (13; 3% instances), compound (13; 3% instances), root (13; 3% instances), flat:name (4; 1% instances), nsubj:pass (3; 1% instances), flat (2; 0% instances), parataxis (2; 0% instances), xcomp (2; 0% instances), acl:relcl (1; 0% instances)
Parents of X
nodes belong to 9 different parts of speech: X (180; 45% instances), NOUN (110; 27% instances), VERB (65; 16% instances), PROPN (22; 5% instances), (13; 3% instances), NUM (4; 1% instances), ADJ (3; 1% instances), PRON (3; 1% instances), ADV (2; 0% instances)
197 (49%) X
nodes are leaves.
35 (9%) X
nodes have one child.
48 (12%) X
nodes have two children.
122 (30%) X
nodes have three or more children.
The highest child degree of a X
node is 11.
Children of X
nodes are attached using 25 different relations: flat:foreign (174; 27% instances), det (111; 17% instances), case (100; 15% instances), punct (97; 15% instances), nmod (48; 7% instances), amod (19; 3% instances), appos (13; 2% instances), acl:relcl (11; 2% instances), cc (10; 2% instances), conj (8; 1% instances), nummod (8; 1% instances), advmod (6; 1% instances), compound (6; 1% instances), flat:name (6; 1% instances), advcl (5; 1% instances), cop (5; 1% instances), nsubj (4; 1% instances), acl (3; 0% instances), det:poss (3; 0% instances), fixed (3; 0% instances), csubj (2; 0% instances), flat (2; 0% instances), parataxis (2; 0% instances), aux (1; 0% instances), obl:agent (1; 0% instances)
Children of X
nodes belong to 14 different parts of speech: X (180; 28% instances), DET (114; 18% instances), ADP (103; 16% instances), PUNCT (97; 15% instances), NOUN (50; 8% instances), VERB (23; 4% instances), ADJ (22; 3% instances), PROPN (21; 3% instances), NUM (11; 2% instances), CCONJ (10; 2% instances), ADV (7; 1% instances), AUX (6; 1% instances), PRON (2; 0% instances), SYM (2; 0% instances)