home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Naija-NSC: POS Tags: X

There are 81 X lemmas (2%), 289 X types (5%) and 544 X tokens (0%). Out of 16 observed tags, the rank of X is: 7 in number of lemmas, 5 in number of types and 16 in number of tokens.

The 10 most frequent X lemmas: X, olorun, sannu, si, pe, saanu, Boko, Haram, ma, oloun

The 10 most frequent X types: X, s~, f~, d~, ma, wo~, b~, a~, be~, co~

The 10 most frequent ambiguous lemmas: X (X 411, INTJ 6, DET 3, NUM 1), ma (PART 28, X 3), per (ADP 6, X 3), cup (NOUN 3, X 2), dem (PRON 2220, PART 42, X 2), na (AUX 2025, X 2, ADP 1), |r (PUNCT 551, X 2), di (DET 2816, X 1), gbogbo (ADV 1, X 1), ka (PRON 1, X 1)

The 10 most frequent ambiguous types: ma (PART 28, X 8), b~ (X 7, DET 1), ba (PART 15, X 5), e (PRON 1564, X 5, DET 1), a (DET 127, PRON 22, X 3, NOUN 1), o~ (X 3, NUM 1), per (ADP 6, X 3), wa (INTJ 12, X 3), da (X 2, DET 1), de (PRON 1224, X 2)

Morphology

The form / lemma ratio of X is 3.567901 (the average of all parts of speech is 1.162049).

The 1st highest number of forms (210) was observed with the lemma “X”: Adigo, Agwan~, Alap~, Ala~, Ea~, Fren~, Fr~, Had~, Kw~, Lil~, Max~, Om~, Oria~, RI~, STP~, X, a, abf, ab~, ak~, ala, almo~, al~, anyb~, ar~, avera~, aw~, a~, ba, ban~, ba~, be~, bi~, brin~, bro~, br~, bu~, b~, ca, ca~, chea~, checkli~, chi~, ch~, cle~, conne~, con~, coun~, co~, cr~, cu~, c~, da, de~, di~, do~, du~, d~, e, eh, en~, epurutepu, etin~, ev, everyti~, everyt~, ev~, exa~, e~, fa, fai~, fe~, fini~, fin~, fi~, fore~, for~, fo~, f~, ga~, gbu~, ge, gene~, ge~, ghet~, giti, gi~, gm~, gover~, go~, gu~, g~, hav~, hel~, hip~, ho~, hub~, huma~, h~, im~, inf~, ingred~, insi~, inst~, i~, kambia, k~, lafs~, la~, le~, lit~, li~, ma, mad~, mana~, ma~, med~, me~, mil~, mir~, mon~, mor~, mow~, mo~, mu~, m~, nai~, ne, nikan, norm~, not~, no~, nso, nu, num~, n~, ogbeni, origi~, ori~, oro~, over~, o~, pala~, pa~, pelu, peo~, pers~, pe~, pik~, pi~, pla~, pol~, post, pre~, profe~, pro~, pur~, pu~, p~, re, reach, repre~, res~, re~, r~, sab~, sa~, se~, shere, sh~, sin~, sis~, si~, sle~, sm~, som~, so~, spe~, spu~, sp~, st~, su~, swe~, sy~, s~, tawon, ta~, thirt~, thou~, ti~, traffi~, tre~, tri~, tu, t~, una, under~, un~, wa, wa~, wet~, we~, wit~, wi~, wom~, wor~, wo~, wu~, w~, zaga.

The 2nd highest number of forms (2) was observed with the lemma “cup”: Cup, Cupa.

The 3rd highest number of forms (1) was observed with the lemma “Boko”: Boko.

X occurs with 6 features: ExtPos (7; 1% instances), Case (1; 0% instances), NumType (1; 0% instances), Number (1; 0% instances), Person (1; 0% instances), PronType (1; 0% instances)

X occurs with 7 feature-value pairs: Case=Nom, ExtPos=ADV, ExtPos=PROPN, NumType=Card, Number=Plur, Person=3, PronType=Prs

X occurs with 5 feature combinations. The most frequent feature combination is _ (535 tokens). Examples: X, s~, f~, d~, ma, wo~, b~, a~, be~, co~

Relations

X nodes are attached to their parents using 25 different relations: reparandum (285; 52% instances), flat:foreign (97; 18% instances), root (41; 8% instances), obj (25; 5% instances), flat (15; 3% instances), dep (11; 2% instances), nmod (9; 2% instances), obl:mod (8; 1% instances), discourse (7; 1% instances), compound (5; 1% instances), conj (5; 1% instances), fixed (4; 1% instances), nsubj (4; 1% instances), parataxis (4; 1% instances), xcomp (4; 1% instances), acl:relcl (3; 1% instances), compound:svc (3; 1% instances), obl:arg (3; 1% instances), advcl:cleft (2; 0% instances), compound:redup (2; 0% instances), parataxis:conj (2; 0% instances), parataxis:parenth (2; 0% instances), advcl (1; 0% instances), advmod (1; 0% instances), vocative (1; 0% instances)

Parents of X nodes belong to 16 different parts of speech: NOUN (158; 29% instances), VERB (135; 25% instances), X (96; 18% instances), (41; 8% instances), PROPN (30; 6% instances), ADJ (23; 4% instances), PRON (22; 4% instances), AUX (10; 2% instances), PART (7; 1% instances), ADP (6; 1% instances), NUM (6; 1% instances), ADV (4; 1% instances), CCONJ (2; 0% instances), SCONJ (2; 0% instances), DET (1; 0% instances), INTJ (1; 0% instances)

166 (31%) X nodes are leaves.

177 (33%) X nodes have one child.

69 (13%) X nodes have two children.

132 (24%) X nodes have three or more children.

The highest child degree of a X node is 26.

Children of X nodes are attached using 33 different relations: punct (493; 55% instances), flat:foreign (112; 12% instances), det (44; 5% instances), nsubj (43; 5% instances), discourse (28; 3% instances), aux (27; 3% instances), case (21; 2% instances), nmod (17; 2% instances), cop (14; 2% instances), flat (13; 1% instances), advmod (12; 1% instances), conj (12; 1% instances), amod (11; 1% instances), reparandum (9; 1% instances), cc (6; 1% instances), dislocated (6; 1% instances), acl (4; 0% instances), nmod:poss (4; 0% instances), advcl (3; 0% instances), compound (3; 0% instances), acl:relcl (2; 0% instances), compound:redup (2; 0% instances), dep (2; 0% instances), mark (2; 0% instances), nummod (2; 0% instances), parataxis:conj (2; 0% instances), appos (1; 0% instances), compound:svc (1; 0% instances), fixed (1; 0% instances), obj (1; 0% instances), obl:mod (1; 0% instances), parataxis:discourse (1; 0% instances), xcomp (1; 0% instances)

Children of X nodes belong to 16 different parts of speech: PUNCT (493; 55% instances), X (96; 11% instances), PRON (56; 6% instances), DET (45; 5% instances), AUX (41; 5% instances), NOUN (38; 4% instances), INTJ (21; 2% instances), ADP (18; 2% instances), VERB (18; 2% instances), ADJ (16; 2% instances), ADV (16; 2% instances), PART (13; 1% instances), SCONJ (13; 1% instances), CCONJ (8; 1% instances), PROPN (7; 1% instances), NUM (2; 0% instances)