Statistics of X in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_English-GUM: POS Tags: `X`

There are 252 X lemmas (1%), 271 X types (1%) and 419 X tokens (0%). Out of 17 observed tags, the rank of X is: 7 in number of lemmas, 7 in number of types and 16 in number of tokens.

The 10 most frequent X lemmas: _, et, al., de, 1, 1., 2., in, 2, 3

The 10 most frequent X types: et, al., de, 1, 1., 2., in, 2, 3, situ

The 10 most frequent ambiguous lemmas: et (X 18, PROPN 3), de (PROPN 24, X 9), 1 (NUM 153, X 7), 1. (X 7, NUM 2), 2. (X 7, NUM 2), in (ADP 4102, SCONJ 105, ADV 41, X 6, NOUN 1), 2 (NUM 130, X 5), 3 (NUM 80, X 5), 3. (X 4, NUM 1), 4 (NUM 66, X 4)

The 10 most frequent ambiguous types: et (X 18, PROPN 3), de (PROPN 24, X 9), 1 (NUM 154, X 7), 1. (X 7, NUM 2), 2. (X 7, NUM 2), in (ADP 3705, SCONJ 101, ADV 41, X 6), 2 (NUM 130, X 5), 3 (NUM 80, X 5), 3. (X 4, NUM 1), 4 (NUM 66, X 4)

et
- X 18: You know , kissing , et cetera , et cetera .
- PROPN 3: François Truffaut ‘s New Wave film Jules et Jim ( 1962 ) , her biggest success internationally , is centered on her magnetic starring role . [ 2 ]
de
- PROPN 24: Sir de Villiers Graaff ,
- X 9: J’ ai besoin de tout mon courage pour mourir à vingt ans ! ”
1
- NUM 154: Part 1 of 2 :
- X 7: 1 Introduction
1.
- X 7: 1. Matter is composed of exceedingly small particles called atoms .
- NUM 2: 1. Legislation to lower the interest rate on SBA loans :
2.
- X 7: 2. GUJJOLAAY EEGIMAA , ITS SPEAKERS AND THEIR NEIGHBOURS
- NUM 2: 2. Legislation to increase the amount of a loan which does not have to be repaid :
in
- ADP 3705: Emperor Joshua Norton , in full military regalia , circa 1880 or earlier
- SCONJ 101: I ‘m just interested in when did she get that .
- ADV 41: But if you like slugs , you ‘re in – you ‘re in luck .
- X 6: The Jacob cycle at Auckland Castle is the only UK example of a continental collection preserved in situ in purpose - built surroundings .
2
- NUM 130: Part 1 of 2 :
- X 5: 2 Chaharbagh Boulevard .
3
- NUM 80: If needed , that ‘s in my condensed book at Tab 3 .
- X 5: 3 Prepare seed containers .
3.
- X 4: 3. Conclusions
- NUM 1: 3. Legislation to rebuild public recreation areas :
4
- NUM 66: It can take as long as 4 to 4 years before you get flowers .
- X 4: 4 Sheikh Lotf Allah Mosque , Naqsh-e Jahan Square , east side .

Morphology

The form / lemma ratio of X is 1.075397 (the average of all parts of speech is 1.248450).

The 1st highest number of forms (20) was observed with the lemma “_”: 400, 70, age, arranged, balls, cent, etings, ever, ey, hand, is, less, m, more, n, rhaps, right, stand, the, yyone.

The 2nd highest number of forms (1) was observed with the lemma “(a)”: (a).

The 3rd highest number of forms (1) was observed with the lemma “(b)”: (b).

X occurs with 2 features: Foreign (96; 23% instances), Abbr (22; 5% instances)

X occurs with 2 feature-value pairs: Abbr=Yes, Foreign=Yes

X occurs with 4 feature combinations. The most frequent feature combination is _ (302 tokens). Examples: et, 1, 1., 2., in, 2, 3, situ, 3., 4

Relations

X nodes are attached to their parents using 23 different relations: discourse (94; 22% instances), flat (68; 16% instances), conj (40; 10% instances), appos (27; 6% instances), compound (27; 6% instances), goeswith (24; 6% instances), root (23; 5% instances), nmod (19; 5% instances), cc (18; 4% instances), obl (15; 4% instances), nsubj (13; 3% instances), amod (9; 2% instances), obj (8; 2% instances), xcomp (8; 2% instances), case (7; 2% instances), parataxis (7; 2% instances), orphan (4; 1% instances), dep (2; 0% instances), nmod:poss (2; 0% instances), advcl (1; 0% instances), ccomp (1; 0% instances), list (1; 0% instances), obl:agent (1; 0% instances)

Parents of X nodes belong to 10 different parts of speech: X (135; 32% instances), NOUN (85; 20% instances), VERB (84; 20% instances), PROPN (69; 16% instances), (23; 5% instances), ADV (12; 3% instances), PRON (5; 1% instances), NUM (3; 1% instances), CCONJ (2; 0% instances), INTJ (1; 0% instances)

222 (53%) X nodes are leaves.

89 (21%) X nodes have one child.

44 (11%) X nodes have two children.

64 (15%) X nodes have three or more children.

The highest child degree of a X node is 9.

Children of X nodes are attached using 26 different relations: punct (156; 35% instances), flat (76; 17% instances), case (38; 8% instances), cc (34; 8% instances), conj (21; 5% instances), compound (19; 4% instances), nmod (17; 4% instances), appos (14; 3% instances), det (14; 3% instances), dep (7; 2% instances), cop (6; 1% instances), advmod (5; 1% instances), mark (5; 1% instances), parataxis (5; 1% instances), amod (4; 1% instances), nsubj (4; 1% instances), reparandum (4; 1% instances), discourse (3; 1% instances), nmod:poss (3; 1% instances), acl (2; 0% instances), acl:relcl (2; 0% instances), nmod:unmarked (2; 0% instances), obj (2; 0% instances), vocative (2; 0% instances), xcomp (2; 0% instances), aux (1; 0% instances)

Children of X nodes belong to 16 different parts of speech: PUNCT (156; 35% instances), X (135; 30% instances), ADP (30; 7% instances), NOUN (25; 6% instances), PROPN (25; 6% instances), CCONJ (17; 4% instances), DET (14; 3% instances), ADJ (11; 2% instances), AUX (7; 2% instances), ADV (6; 1% instances), NUM (6; 1% instances), VERB (5; 1% instances), INTJ (4; 1% instances), PART (3; 1% instances), PRON (2; 0% instances), SCONJ (2; 0% instances)

Treebank Statistics: UD_English-GUM: POS Tags: X

Morphology

Relations

Treebank Statistics: UD_English-GUM: POS Tags: `X`