Statistics of X in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_English-GUM: POS Tags: `X`

There are 224 X lemmas (1%), 240 X types (1%) and 362 X tokens (0%). Out of 17 observed tags, the rank of X is: 7 in number of lemmas, 7 in number of types and 16 in number of tokens.

The 10 most frequent X lemmas: _, et, al., de, 1, 1., 2., in, situ, 2

The 10 most frequent X types: et, al., de, 1, 1., 2., in, situ, 2, 3

The 10 most frequent ambiguous lemmas: et (X 15, PROPN 3), de (PROPN 24, X 8), 1 (NUM 148, X 6), 1. (X 6, NUM 2), 2. (X 6, NUM 2), in (ADP 3423, SCONJ 104, ADV 25, X 5, NOUN 1), 2 (NUM 115, X 4), 3 (NUM 76, X 4), 4 (NUM 61, X 4), 3. (X 3, NUM 1)

The 10 most frequent ambiguous types: et (X 15, PROPN 3), de (PROPN 24, X 8), 1 (NUM 149, X 6), 1. (X 6, NUM 2), 2. (X 6, NUM 2), in (ADP 3086, SCONJ 93, ADV 25, X 5), 2 (NUM 115, X 4), 3 (NUM 76, X 4), 4 (NUM 61, X 4), 3. (X 3, NUM 1)

et
- X 15: You know , kissing , et cetera , et cetera .
- PROPN 3: François Truffaut ‘s New Wave film Jules et Jim ( 1962 ) , her biggest success internationally , is centered on her magnetic starring role . [ 2 ]
de
- PROPN 24: Sir de Villiers Graaff ,
- X 8: J’ ai besoin de tout mon courage pour mourir à vingt ans ! ”
1
- NUM 149: Part 1 of 2 :
- X 6: 1 Introduction
1.
- X 6: 1. Matter is composed of exceedingly small particles called atoms .
- NUM 2: 1. Legislation to lower the interest rate on SBA loans :
2.
- X 6: 2. GUJJOLAAY EEGIMAA , ITS SPEAKERS AND THEIR NEIGHBOURS
- NUM 2: 2. Legislation to increase the amount of a loan which does not have to be repaid :
in
- ADP 3086: Emperor Joshua Norton , in full military regalia , circa 1880 or earlier
- SCONJ 93: We must act today in order to preserve tomorrow .
- ADV 25: But if you like slugs , you ‘re in – you ‘re in luck .
- X 5: The Jacob cycle at Auckland Castle is the only UK example of a continental collection preserved in situ in purpose - built surroundings .
2
- NUM 115: Part 1 of 2 :
- X 4: 2 Chaharbagh Boulevard .
3
- NUM 76: That is a Category 3 storm .
- X 4: 3 Prepare seed containers .
4
- NUM 61: It can take as long as 4 to 4 years before you get flowers .
- X 4: 4 Sheikh Lotf Allah Mosque , Naqsh-e Jahan Square , east side .
3.
- X 3: 3. Conclusions
- NUM 1: 3. Legislation to rebuild public recreation areas :

Morphology

The form / lemma ratio of X is 1.071429 (the average of all parts of speech is 1.236974).

The 1st highest number of forms (17) was observed with the lemma “_”: 400, age, arranged, balls, cent, ever, ey, hand, is, less, m, more, n, na, rhaps, right, the.

The 2nd highest number of forms (1) was observed with the lemma “(a)”: (a).

The 3rd highest number of forms (1) was observed with the lemma “(b)”: (b).

X occurs with 3 features: Foreign (80; 22% instances), Abbr (15; 4% instances), Polarity (1; 0% instances)

X occurs with 3 feature-value pairs: Abbr=Yes, Foreign=Yes, Polarity=Neg

X occurs with 4 feature combinations. The most frequent feature combination is _ (267 tokens). Examples: et, 1, 1., 2., in, situ, 2, 3, 4, Formica

Relations

X nodes are attached to their parents using 25 different relations: discourse (79; 22% instances), flat (49; 14% instances), conj (36; 10% instances), compound (30; 8% instances), appos (26; 7% instances), goeswith (21; 6% instances), root (20; 6% instances), nmod (18; 5% instances), cc (15; 4% instances), nsubj (11; 3% instances), obl (11; 3% instances), amod (9; 2% instances), obj (7; 2% instances), xcomp (7; 2% instances), case (5; 1% instances), orphan (4; 1% instances), dep (3; 1% instances), parataxis (3; 1% instances), nmod:poss (2; 1% instances), advcl (1; 0% instances), ccomp (1; 0% instances), fixed (1; 0% instances), list (1; 0% instances), nmod:tmod (1; 0% instances), obl:agent (1; 0% instances)

Parents of X nodes belong to 10 different parts of speech: X (120; 33% instances), NOUN (72; 20% instances), VERB (70; 19% instances), PROPN (61; 17% instances), (20; 6% instances), ADV (10; 3% instances), NUM (3; 1% instances), PRON (3; 1% instances), CCONJ (2; 1% instances), INTJ (1; 0% instances)

194 (54%) X nodes are leaves.

71 (20%) X nodes have one child.

38 (10%) X nodes have two children.

59 (16%) X nodes have three or more children.

The highest child degree of a X node is 9.

Children of X nodes are attached using 28 different relations: punct (139; 35% instances), flat (57; 14% instances), case (33; 8% instances), cc (31; 8% instances), compound (25; 6% instances), conj (21; 5% instances), nmod (15; 4% instances), appos (13; 3% instances), det (12; 3% instances), dep (8; 2% instances), cop (5; 1% instances), advmod (4; 1% instances), mark (4; 1% instances), nsubj (4; 1% instances), reparandum (4; 1% instances), amod (3; 1% instances), discourse (3; 1% instances), parataxis (3; 1% instances), acl (2; 1% instances), acl:relcl (2; 1% instances), nmod:poss (2; 1% instances), nmod:tmod (2; 1% instances), obj (2; 1% instances), xcomp (2; 1% instances), aux (1; 0% instances), fixed (1; 0% instances), nummod (1; 0% instances), vocative (1; 0% instances)

Children of X nodes belong to 16 different parts of speech: PUNCT (139; 35% instances), X (120; 30% instances), ADP (27; 7% instances), NOUN (23; 6% instances), PROPN (21; 5% instances), CCONJ (17; 4% instances), DET (12; 3% instances), ADJ (10; 3% instances), AUX (6; 2% instances), NUM (6; 2% instances), ADV (5; 1% instances), VERB (5; 1% instances), INTJ (4; 1% instances), PART (2; 1% instances), SCONJ (2; 1% instances), PRON (1; 0% instances)

Treebank Statistics: UD_English-GUM: POS Tags: X

Morphology

Relations

Treebank Statistics: UD_English-GUM: POS Tags: `X`