Statistics of X in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Naija-NSC: POS Tags: `X`

There are 81 X lemmas (2%), 289 X types (5%) and 544 X tokens (0%). Out of 16 observed tags, the rank of X is: 7 in number of lemmas, 5 in number of types and 16 in number of tokens.

The 10 most frequent X lemmas: X, olorun, sannu, si, pe, saanu, Boko, Haram, ma, oloun

The 10 most frequent X types: X, s~, f~, d~, ma, wo~, b~, a~, be~, co~

The 10 most frequent ambiguous lemmas: X (X 411, INTJ 6, DET 3, NUM 1), ma (PART 28, X 3), per (ADP 6, X 3), cup (NOUN 3, X 2), dem (PRON 2220, PART 42, X 2), na (AUX 2025, X 2, ADP 1), |r (PUNCT 551, X 2), di (DET 2816, X 1), gbogbo (ADV 1, X 1), ka (PRON 1, X 1)

The 10 most frequent ambiguous types: ma (PART 28, X 8), b~ (X 7, DET 1), ba (PART 15, X 5), e (PRON 1564, X 5, DET 1), a (DET 127, PRON 22, X 3, NOUN 1), o~ (X 3, NUM 1), per (ADP 6, X 3), wa (INTJ 12, X 3), da (X 2, DET 1), de (PRON 1224, X 2)

ma
- PART 28: # na dis year own ma >+ con worse pass //
- X 8: under dis government naa ni olorun ma wa saanu wa //
b~
- X 7: { b~ || but } I prefer di time wen I still dey Ondo State dere //
- DET 1: dis one o < { b~ || di } Bible does not change o //
ba
- PART 15: as in con see di kind knuckles wey full di girl nails ba !//
- X 5: won ni tawon Kini Kan ba wole { pelu ibon |c pelu bullet } ogbeni dubule //
e
- PRON 1564: # sey e wan carry me go meh e go train me //
- X 5: afojudi re e //
- DET 1: { how || I } wan know { e || di } agreement wey e go carry //
a
- DET 127: # complete almost half a year < # I no see any work do //
- PRON 22: # meh me a go for school first //
- X 3: # { I wonder || I wonder why || I wonder } # as mgbe a gasi na uwa ne eme ntughari na eziokwu true //
- NOUN 1: dey write { a b c |c and one two three } put //
o~
- X 3: o~ //
- NUM 1: # buy # { dis kashi # { { o~ || hundred } naira |c { two || two } hundred } # |c di garden egg fifty naira # |c { hip~ || spinach } fifty naira # |c di oder leaves } //
per
- ADP 6: # so # when di lecture start now < # as per di usual parade < # con see chicks //= see guys //
- X 3: everywhere < # per se < na secretariat //
wa
- INTJ 12: na wa for dat woman o !//
- X 3: under dis government naa ni olorun ma wa saanu wa //
da
- X 2: # de da &//
- DET 1: # when e say [ { da || dat } time < # di guy go ] &//
de
- PRON 1224: # den < ending of di year < # de go con dey pay us # our money //
- X 2: so dat { de || after } di wedding < you go just say [ is dat all ?//] //

Morphology

The form / lemma ratio of X is 3.567901 (the average of all parts of speech is 1.162049).

The 1st highest number of forms (210) was observed with the lemma “X”: Adigo, Agwan~, Alap~, Ala~, Ea~, Fren~, Fr~, Had~, Kw~, Lil~, Max~, Om~, Oria~, RI~, STP~, X, a, abf, ab~, ak~, ala, almo~, al~, anyb~, ar~, avera~, aw~, a~, ba, ban~, ba~, be~, bi~, brin~, bro~, br~, bu~, b~, ca, ca~, chea~, checkli~, chi~, ch~, cle~, conne~, con~, coun~, co~, cr~, cu~, c~, da, de~, di~, do~, du~, d~, e, eh, en~, epurutepu, etin~, ev, everyti~, everyt~, ev~, exa~, e~, fa, fai~, fe~, fini~, fin~, fi~, fore~, for~, fo~, f~, ga~, gbu~, ge, gene~, ge~, ghet~, giti, gi~, gm~, gover~, go~, gu~, g~, hav~, hel~, hip~, ho~, hub~, huma~, h~, im~, inf~, ingred~, insi~, inst~, i~, kambia, k~, lafs~, la~, le~, lit~, li~, ma, mad~, mana~, ma~, med~, me~, mil~, mir~, mon~, mor~, mow~, mo~, mu~, m~, nai~, ne, nikan, norm~, not~, no~, nso, nu, num~, n~, ogbeni, origi~, ori~, oro~, over~, o~, pala~, pa~, pelu, peo~, pers~, pe~, pik~, pi~, pla~, pol~, post, pre~, profe~, pro~, pur~, pu~, p~, re, reach, repre~, res~, re~, r~, sab~, sa~, se~, shere, sh~, sin~, sis~, si~, sle~, sm~, som~, so~, spe~, spu~, sp~, st~, su~, swe~, sy~, s~, tawon, ta~, thirt~, thou~, ti~, traffi~, tre~, tri~, tu, t~, una, under~, un~, wa, wa~, wet~, we~, wit~, wi~, wom~, wor~, wo~, wu~, w~, zaga.

The 2nd highest number of forms (2) was observed with the lemma “cup”: Cup, Cupa.

The 3rd highest number of forms (1) was observed with the lemma “Boko”: Boko.

X occurs with 6 features: ExtPos (7; 1% instances), Case (1; 0% instances), NumType (1; 0% instances), Number (1; 0% instances), Person (1; 0% instances), PronType (1; 0% instances)

X occurs with 7 feature-value pairs: Case=Nom, ExtPos=ADV, ExtPos=PROPN, NumType=Card, Number=Plur, Person=3, PronType=Prs

X occurs with 5 feature combinations. The most frequent feature combination is _ (535 tokens). Examples: X, s~, f~, d~, ma, wo~, b~, a~, be~, co~

Relations

X nodes are attached to their parents using 25 different relations: reparandum (285; 52% instances), flat:foreign (97; 18% instances), root (41; 8% instances), obj (25; 5% instances), flat (15; 3% instances), dep (11; 2% instances), nmod (9; 2% instances), obl:mod (8; 1% instances), discourse (7; 1% instances), compound (5; 1% instances), conj (5; 1% instances), fixed (4; 1% instances), nsubj (4; 1% instances), parataxis (4; 1% instances), xcomp (4; 1% instances), acl:relcl (3; 1% instances), compound:svc (3; 1% instances), obl:arg (3; 1% instances), advcl:cleft (2; 0% instances), compound:redup (2; 0% instances), parataxis:conj (2; 0% instances), parataxis:parenth (2; 0% instances), advcl (1; 0% instances), advmod (1; 0% instances), vocative (1; 0% instances)

Parents of X nodes belong to 16 different parts of speech: NOUN (158; 29% instances), VERB (135; 25% instances), X (96; 18% instances), (41; 8% instances), PROPN (30; 6% instances), ADJ (23; 4% instances), PRON (22; 4% instances), AUX (10; 2% instances), PART (7; 1% instances), ADP (6; 1% instances), NUM (6; 1% instances), ADV (4; 1% instances), CCONJ (2; 0% instances), SCONJ (2; 0% instances), DET (1; 0% instances), INTJ (1; 0% instances)

166 (31%) X nodes are leaves.

177 (33%) X nodes have one child.

69 (13%) X nodes have two children.

132 (24%) X nodes have three or more children.

The highest child degree of a X node is 26.

Children of X nodes are attached using 33 different relations: punct (493; 55% instances), flat:foreign (112; 12% instances), det (44; 5% instances), nsubj (43; 5% instances), discourse (28; 3% instances), aux (27; 3% instances), case (21; 2% instances), nmod (17; 2% instances), cop (14; 2% instances), flat (13; 1% instances), advmod (12; 1% instances), conj (12; 1% instances), amod (11; 1% instances), reparandum (9; 1% instances), cc (6; 1% instances), dislocated (6; 1% instances), acl (4; 0% instances), nmod:poss (4; 0% instances), advcl (3; 0% instances), compound (3; 0% instances), acl:relcl (2; 0% instances), compound:redup (2; 0% instances), dep (2; 0% instances), mark (2; 0% instances), nummod (2; 0% instances), parataxis:conj (2; 0% instances), appos (1; 0% instances), compound:svc (1; 0% instances), fixed (1; 0% instances), obj (1; 0% instances), obl:mod (1; 0% instances), parataxis:discourse (1; 0% instances), xcomp (1; 0% instances)

Children of X nodes belong to 16 different parts of speech: PUNCT (493; 55% instances), X (96; 11% instances), PRON (56; 6% instances), DET (45; 5% instances), AUX (41; 5% instances), NOUN (38; 4% instances), INTJ (21; 2% instances), ADP (18; 2% instances), VERB (18; 2% instances), ADJ (16; 2% instances), ADV (16; 2% instances), PART (13; 1% instances), SCONJ (13; 1% instances), CCONJ (8; 1% instances), PROPN (7; 1% instances), NUM (2; 0% instances)

Treebank Statistics: UD_Naija-NSC: POS Tags: X

Morphology

Relations

Treebank Statistics: UD_Naija-NSC: POS Tags: `X`