Statistics of X in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_English-EWT: POS Tags: `X`

There are 151 X lemmas (1%), 263 X types (1%) and 501 X tokens (0%). Out of 17 observed tags, the rank of X is: 7 in number of lemmas, 7 in number of types and 17 in number of tokens.

The 10 most frequent X lemmas: _, .doc, s, (, ), alberta, -, Analysis_0712, MEH-risk, Oct

The 10 most frequent X types: .doc, s, -, (, ), Alberta, Access, Analysis_0712, COMMUNICATIONS, MEH-risk

The 10 most frequent ambiguous lemmas: _ (X 172, PUNCT 5), s (X 10, NOUN 2, PROPN 1), ( (PUNCT 1030, X 7), ) (PUNCT 1067, X 7), - (PUNCT 1648, SYM 119, X 6), access (NOUN 32, VERB 6, X 6), and (CCONJ 6112, X 6), pricing (NOUN 13, X 6), transmission (X 6, NOUN 4), enron (PROPN 7, X 5)

The 10 most frequent ambiguous types: s (AUX 104, PART 99, X 11, PRON 7, VERB 5, NOUN 2, PROPN 1), - (PUNCT 1627, SYM 119, X 8), ( (PUNCT 1030, X 7), ) (PUNCT 1067, X 7), Alberta (X 7, PROPN 1), Oct (PROPN 8, X 6), Pricing (X 6, VERB 1), Transmission (X 6, PROPN 3), a (DET 4542, ADP 7, NUM 6, NOUN 4, ADV 2, X 2, ADJ 1, AUX 1, CCONJ 1, PART 1), and (CCONJ 5916, X 6, DET 5, ADP 2)

s
- AUX 104: It s all interesting stuff .
- PART 99: you guys want to watch the game at woodrow s tomorrow ?
- X 11: Just to let you all know Matt has confirmed the booking for 3rd Dec i s OK .
- PRON 7: Let s get together soon .
- VERB 5: i always thought there s no custom charges for gifts .
- NOUN 2: In clause ( e ) , delete the “ s “ : from the word consolidation .
- PROPN 1: s
-
- PUNCT 1627: TEHRAN ( AFP ) -
- SYM 119: Intercept : - 0.3931 ( 0.0076 )
- X 8: « Compaq.com - notebook.url »
(
- PUNCT 1030: TEHRAN ( AFP ) -
- X 7: - ENRON-CPS ( GISB rev1 ) .doc
)
- PUNCT 1067: TEHRAN ( AFP ) -
- X 7: - ENRON-CPS ( GISB rev1 ) .doc
Alberta
- X 7: « File : Tabors Conflict Letter Alberta Export 050901.doc »
- PROPN 1: Yes , we should add compliance with OTC Derivatives and / or Commodity Contracts and Qualified Party requirements of the Securities Act ( Alberta ) , Securities Act ( British Columbia ) and Securities Act ( Ontario ) .
Oct
- PROPN 8: The Effective Date ( or start date ) is 01 Oct 2001 .
- X 6: Mary « MEH-risk Oct 20 »
Pricing
- X 6: « Alberta Transmission Access and Pricing Analysis_0712 .doc »
- VERB 1: I used to e-mail Vince Kaminski about the advice on his article “ The Challenge of Pricing and Risk Managing Electricity Derivatives “ and he had mailed me the copy .
Transmission
- X 6: « Alberta Transmission Access and Pricing Analysis_0712 .doc »
- PROPN 3: I was so impressed with the honesty and integrity of Mike and everyone at Eagle Transmission !
a
- DET 4542: Read the entire article ; there ‘s a punchline , too .
- ADP 7: Big deal kind a stuff .
- NUM 6: 2 ) I would like to say on a island with an a ) all inclusive resort ( if possible ) , and a beach front room
- NOUN 4: Top range of bike , cheap prices , excellent a +++
- ADV 2: Also , any tour recommendations would be very helpful a well .
- X 2: A la guerre c’est comme a la guerre !
- ADJ 1: there will be talent and opportunity a plenty on the market soon .
- AUX 1: yea i guess but rabbits a easily escape a pen or another rabbit could get in there and that rabbit could be the opposite gender .
- CCONJ 1: But word of advice if you ‘re get your girlfriend a laptop make sure it s a good brand a not something like DELL , Acer , Asus , eMachines etc .
- PART 1: I feel X - BOX is a very smooth system i own it like 3 years , it s very compatible to previous versions and mostly important i was very comfortable with the User Interface and the JOYSTICK …. coz you do nt wan a hold a joystick that gives you discomfort .
and
- CCONJ 5916: Right now that seems to be the US , EU , and IAEA .
- X 6: « Alberta Transmission Access and Pricing Analysis_0712 .doc »
- DET 5: it s your cat you can pick and name you want
- ADP 2: The people there attempt to come across and professional and nice , but I was disappointed with their customer service .

Morphology

The form / lemma ratio of X is 1.741722 (the average of all parts of speech is 1.250484).

The 1st highest number of forms (116) was observed with the lemma “_”: -, 3-5290, @, A, Abramo@ENRON, Akin@ECT, Alatorre@ENRON, Bertone@ENRON_DEVELOPMENT, Blaine@ENRON_DEVELOPMENT, Bryngelson@AZURIX, C, COMMUNICATIONS, Castagnola@ENRON_DEVELOPMENT, Castano@EES, Delainey@ECT, Diebner@ECT, Do@ENRON_DEVELOPMENT, Dorsey@ENRON_DEVELOPMENT, E, ECT, Edison@ENRON, Forster@ENRON, Garcia@ENRON, Griffith@ENRON, Hansen@ENRON, Hopkinson@ENRON_DEVELOPMENT, Horton@ENRON, Huble@ENRON, J, Jacoby@ECT, Johnson@ENRON, Kaminski@ECT, Kaufman@ECT, Khan@TRANSREDES, Kindall@ENRON, Lamb@ENRON, Leibman@ENRON, Leigh, Luan, Mann@ENRON, Martinez@ENRON, McConnell@ECT, Montgomery@ENRON, Olsen@ENRON, P, Palmer@ENRON, Patel@ENRON, Perry@ENRON_DEVELOPMENT, Rance@ENRON, Rice@ENRON, Salinardo@ENRON, Schwartzenburg@ENRON_DEVELOPMENT, Shackleton@ECT, Stephens@ENRON, Sullivan@ENRON, W, Ward, Warner@ENRON, Williams@ENRON_DEVELOPMENT, back, buy, cent, charged, cooked, d, day, deed, donald, dramatic, educated, ever, expose, fall, for, full, get, going, h, hill, ible, in, informed, ive, line, mail, mentioned, morning, night, notebook.url, o, one, oone, order, out, paid, perform, pixel, plenty, power, priced, r, respect, s, self, ship, side, standing, structure, t, time, to, together, u, way, were, where.

The 2nd highest number of forms (2) was observed with the lemma “al.”: al, al..

The 3rd highest number of forms (2) was observed with the lemma “enron”: ENRON, Enron.

X occurs with 3 features: Foreign (51; 10% instances), ExtPos (38; 8% instances), Typo (1; 0% instances)

X occurs with 3 feature-value pairs: ExtPos=PROPN, Foreign=Yes, Typo=Yes

X occurs with 4 feature combinations. The most frequent feature combination is _ (411 tokens). Examples: .doc, s, -, (, ), Access, Analysis_0712, COMMUNICATIONS, Oct, Pricing

Relations

X nodes are attached to their parents using 19 different relations: flat (206; 41% instances), goeswith (172; 34% instances), compound (39; 8% instances), root (28; 6% instances), amod (14; 3% instances), appos (14; 3% instances), case (5; 1% instances), parataxis (5; 1% instances), conj (4; 1% instances), cc (2; 0% instances), list (2; 0% instances), nmod (2; 0% instances), obl (2; 0% instances), dep (1; 0% instances), discourse (1; 0% instances), nmod:unmarked (1; 0% instances), obj (1; 0% instances), obl:unmarked (1; 0% instances), reparandum (1; 0% instances)

Parents of X nodes belong to 11 different parts of speech: X (221; 44% instances), PROPN (87; 17% instances), NOUN (82; 16% instances), (28; 6% instances), ADJ (24; 5% instances), VERB (22; 4% instances), ADV (18; 4% instances), PRON (10; 2% instances), ADP (6; 1% instances), AUX (2; 0% instances), SCONJ (1; 0% instances)

415 (83%) X nodes are leaves.

19 (4%) X nodes have one child.

17 (3%) X nodes have two children.

50 (10%) X nodes have three or more children.

The highest child degree of a X node is 12.

Children of X nodes are attached using 12 different relations: flat (204; 61% instances), punct (97; 29% instances), compound (12; 4% instances), conj (6; 2% instances), case (4; 1% instances), list (4; 1% instances), nmod (3; 1% instances), cc (2; 1% instances), nmod:unmarked (2; 1% instances), cop (1; 0% instances), nsubj (1; 0% instances), parataxis (1; 0% instances)

Children of X nodes belong to 9 different parts of speech: X (221; 66% instances), PUNCT (97; 29% instances), NOUN (9; 3% instances), ADP (3; 1% instances), NUM (2; 1% instances), VERB (2; 1% instances), ADJ (1; 0% instances), AUX (1; 0% instances), PRON (1; 0% instances)

Treebank Statistics: UD_English-EWT: POS Tags: X

Morphology

Relations

Treebank Statistics: UD_English-EWT: POS Tags: `X`