Statistics of X in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_English-EWT: POS Tags: `X`

There are 140 X lemmas (1%), 245 X types (1%) and 447 X tokens (0%). Out of 17 observed tags, the rank of X is: 8 in number of lemmas, 7 in number of types and 17 in number of tokens.

The 10 most frequent X lemmas: _, .doc, (, ), -, Analysis_0712, access, and, pricing, transmission

The 10 most frequent X types: .doc, -, (, ), Access, Analysis_0712, COMMUNICATIONS, Oct, Pricing, Transmission

The 10 most frequent ambiguous lemmas: _ (X 168, PUNCT 5), ( (PUNCT 1030, X 7), ) (PUNCT 1067, X 7), - (PUNCT 1647, SYM 119, X 6), access (NOUN 32, VERB 6, X 6), and (CCONJ 6111, X 6), pricing (NOUN 13, X 6), transmission (X 6, NOUN 5), mid (X 5, ADJ 1, ADV 1), & (CCONJ 139, X 4)

The 10 most frequent ambiguous types: - (PUNCT 1626, SYM 119, X 8), ( (PUNCT 1030, X 7), ) (PUNCT 1067, X 7), Oct (PROPN 8, X 6), Pricing (X 6, VERB 1), Transmission (X 6, PROPN 2, NOUN 1), a (DET 4542, ADP 7, NUM 6, NOUN 4, ADV 2, X 2, ADJ 1, AUX 1, CCONJ 1, PART 1), and (CCONJ 5915, X 6, DET 5, ADP 2), for (ADP 2029, SCONJ 183, CCONJ 5, X 5), mid (X 4, NOUN 2, ADJ 1, ADV 1)

-
- PUNCT 1626: TEHRAN ( AFP ) -
- SYM 119: Intercept : - 0.3931 ( 0.0076 )
- X 8: « Compaq.com - notebook.url »
(
- PUNCT 1030: TEHRAN ( AFP ) -
- X 7: - ENRON-CPS ( GISB rev1 ) .doc
)
- PUNCT 1067: TEHRAN ( AFP ) -
- X 7: - ENRON-CPS ( GISB rev1 ) .doc
Oct
- PROPN 8: The Effective Date ( or start date ) is 01 Oct 2001 .
- X 6: Mary « MEH-risk Oct 20 »
Pricing
- X 6: « Alberta Transmission Access and Pricing Analysis_0712 .doc »
- VERB 1: I used to e-mail Vince Kaminski about the advice on his article “ The Challenge of Pricing and Risk Managing Electricity Derivatives “ and he had mailed me the copy .
Transmission
- X 6: « Alberta Transmission Access and Pricing Analysis_0712 .doc »
- PROPN 2: I was so impressed with the honesty and integrity of Mike and everyone at Eagle Transmission !
- NOUN 1: Transmission Expansion and Systems in Transition Conference Feb. 5 - 8 , 2002 , Miami , Florida
a
- DET 4542: Read the entire article ; there ‘s a punchline , too .
- ADP 7: Big deal kind a stuff .
- NUM 6: 2 ) I would like to say on a island with an a ) all inclusive resort ( if possible ) , and a beach front room
- NOUN 4: Top range of bike , cheap prices , excellent a +++
- ADV 2: Also , any tour recommendations would be very helpful a well .
- X 2: A la guerre c’est comme a la guerre !
- ADJ 1: there will be talent and opportunity a plenty on the market soon .
- AUX 1: yea i guess but rabbits a easily escape a pen or another rabbit could get in there and that rabbit could be the opposite gender .
- CCONJ 1: But word of advice if you ‘re get your girlfriend a laptop make sure it s a good brand a not something like DELL , Acer , Asus , eMachines etc .
- PART 1: I feel X - BOX is a very smooth system i own it like 3 years , it s very compatible to previous versions and mostly important i was very comfortable with the User Interface and the JOYSTICK …. coz you do nt wan a hold a joystick that gives you discomfort .
and
- CCONJ 5915: Right now that seems to be the US , EU , and IAEA .
- X 6: « Alberta Transmission Access and Pricing Analysis_0712 .doc »
- DET 5: it s your cat you can pick and name you want
- ADP 2: The people there attempt to come across and professional and nice , but I was disappointed with their customer service .
for
- ADP 2029: Yet we did n’t charge them for the evacuation .
- SCONJ 183: Thanks for thinking of me to send it to .
- CCONJ 5: Neither was this day less fortunate to his father Philip ; for on the same day he took Potidea ; » - JOHN AUBREY , F.R.S.
- X 5: So they see the pictures flicker slower and there for it seems choppy to them .
mid
- X 4: Otherwise , I will be sending it to Peoples as our final revision by mid morning .
- NOUN 2: Otherwise , I will be sending it to Peoples as our final revision by mid morning .
- ADJ 1: We did not have big percent of Chinese migration until the mid 90s .
- ADV 1: So I kept reading and then I saw the dates , it was from mid day Friday and arriving home mid day monday . :(

Morphology

The form / lemma ratio of X is 1.750000 (the average of all parts of speech is 1.236432).

The 1st highest number of forms (111) was observed with the lemma “_”: -, 20, 3-5290, @, A, Abramo@ENRON, Akin@ECT, Alatorre@ENRON, Bertone@ENRON_DEVELOPMENT, Blaine@ENRON_DEVELOPMENT, Bryngelson@AZURIX, C, COMMUNICATIONS, Castagnola@ENRON_DEVELOPMENT, Castano@EES, Delainey@ECT, Diebner@ECT, Do@ENRON_DEVELOPMENT, Dorsey@ENRON_DEVELOPMENT, E, ECT, Edison@ENRON, Forster@ENRON, Garcia@ENRON, Griffith@ENRON, Hansen@ENRON, Hopkinson@ENRON_DEVELOPMENT, Horton@ENRON, Huble@ENRON, J, Jacoby@ECT, Johnson@ENRON, Kaminski@ECT, Kaufman@ECT, Khan@TRANSREDES, Kindall@ENRON, Lamb@ENRON, Leibman@ENRON, Leigh, Luan, Mann@ENRON, Martinez@ENRON, McConnell@ECT, Montgomery@ENRON, Oct, Olsen@ENRON, P, Palmer@ENRON, Patel@ENRON, Perry@ENRON_DEVELOPMENT, Rance@ENRON, Rice@ENRON, Salinardo@ENRON, Schwartzenburg@ENRON_DEVELOPMENT, Shackleton@ECT, Stephens@ENRON, Sullivan@ENRON, W, Ward, Warner@ENRON, Williams@ENRON_DEVELOPMENT, back, cent, cooked, d, day, deed, donald, dramatic, educated, ever, expose, fall, for, full, get, going, h, hill, ible, in, informed, ive, line, mail, mentioned, morning, night, notebook.url, o, one, order, out, plenty, power, priced, r, respect, s, self, ship, side, standing, t, time, to, together, u, way, were, where.

The 2nd highest number of forms (2) was observed with the lemma “al.”: al, al..

The 3rd highest number of forms (2) was observed with the lemma “et”: et, et..

X occurs with 3 features: Foreign (52; 12% instances), Number (35; 8% instances), Typo (1; 0% instances)

X occurs with 3 feature-value pairs: Foreign=Yes, Number=Sing, Typo=Yes

X occurs with 5 feature combinations. The most frequent feature combination is _ (360 tokens). Examples: -, (, ), Access, Analysis_0712, COMMUNICATIONS, Oct, Pricing, Transmission, and

Relations

X nodes are attached to their parents using 18 different relations: flat (195; 44% instances), goeswith (168; 38% instances), compound (44; 10% instances), amod (7; 2% instances), root (5; 1% instances), appos (4; 1% instances), case (4; 1% instances), conj (4; 1% instances), parataxis (4; 1% instances), cc (2; 0% instances), nmod (2; 0% instances), obl (2; 0% instances), dep (1; 0% instances), discourse (1; 0% instances), nmod:tmod (1; 0% instances), obj (1; 0% instances), obl:npmod (1; 0% instances), reparandum (1; 0% instances)

Parents of X nodes belong to 12 different parts of speech: NOUN (221; 49% instances), PROPN (88; 20% instances), X (57; 13% instances), ADJ (23; 5% instances), VERB (19; 4% instances), ADV (17; 4% instances), PRON (7; 2% instances), ADP (6; 1% instances), (5; 1% instances), AUX (2; 0% instances), INTJ (1; 0% instances), SCONJ (1; 0% instances)

402 (90%) X nodes are leaves.

22 (5%) X nodes have one child.

8 (2%) X nodes have two children.

15 (3%) X nodes have three or more children.

The highest child degree of a X node is 12.

Children of X nodes are attached using 12 different relations: flat (38; 34% instances), punct (38; 34% instances), compound (12; 11% instances), conj (6; 5% instances), case (4; 4% instances), nmod (3; 3% instances), nummod (3; 3% instances), cc (2; 2% instances), nmod:tmod (2; 2% instances), cop (1; 1% instances), nsubj (1; 1% instances), parataxis (1; 1% instances)

Children of X nodes belong to 9 different parts of speech: X (57; 51% instances), PUNCT (38; 34% instances), NOUN (6; 5% instances), ADP (3; 3% instances), NUM (2; 2% instances), VERB (2; 2% instances), ADJ (1; 1% instances), AUX (1; 1% instances), PRON (1; 1% instances)

Treebank Statistics: UD_English-EWT: POS Tags: X

Morphology

Relations

Treebank Statistics: UD_English-EWT: POS Tags: `X`