home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Naija-NSC: POS Tags: NUM

There are 39 NUM lemmas (1%), 39 NUM types (1%) and 1367 NUM tokens (1%). Out of 15 observed tags, the rank of NUM is: 10 in number of lemmas, 11 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: one, two, five, three, hundred, thousand, twenty, six, seven, fifty

The 10 most frequent NUM types: one, two, five, three, hundred, thousand, twenty, six, seven, fifty

The 10 most frequent ambiguous lemmas: one (NOUN 435, NUM 234, DET 232, PRON 3, ADJ 1), sixteen (NUM 4, NOUN 1), k (NUM 2, NOUN 1), o (PART 700, NUM 1), X (X 410, INTJ 6, DET 3, NUM 1)

The 10 most frequent ambiguous types: one (NOUN 411, NUM 233, DET 232, PRON 3, ADJ 1), sixteen (NUM 4, NOUN 1), k (NOUN 1, NUM 1), o (PART 699, NUM 1), o~ (X 3, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.163032).

The 1st highest number of forms (1) was observed with the lemma “101”: 101.

The 2nd highest number of forms (1) was observed with the lemma “O”: O.

The 3rd highest number of forms (1) was observed with the lemma “X”: o~.

NUM occurs with 1 features: NumType (1305; 95% instances)

NUM occurs with 1 feature-value pairs: NumType=Card

NUM occurs with 2 feature combinations. The most frequent feature combination is NumType=Card (1305 tokens). Examples: one, two, five, three, hundred, thousand, twenty, six, seven, fifty

Relations

NUM nodes are attached to their parents using 24 different relations: nummod (555; 41% instances), flat (330; 24% instances), conj (148; 11% instances), obj (55; 4% instances), root (53; 4% instances), nmod (42; 3% instances), obl:mod (39; 3% instances), nsubj (35; 3% instances), reparandum (20; 1% instances), compound:redup (15; 1% instances), dislocated (14; 1% instances), obl:arg (14; 1% instances), xcomp (14; 1% instances), appos (11; 1% instances), ccomp (8; 1% instances), acl:relcl (4; 0% instances), iobj (3; 0% instances), advcl (1; 0% instances), advcl:cleft (1; 0% instances), compound:prt (1; 0% instances), discourse (1; 0% instances), parataxis (1; 0% instances), parataxis:conj (1; 0% instances), parataxis:parenth (1; 0% instances)

Parents of NUM nodes belong to 13 different parts of speech: NOUN (570; 42% instances), NUM (503; 37% instances), VERB (169; 12% instances), (53; 4% instances), PROPN (37; 3% instances), ADJ (11; 1% instances), ADV (6; 0% instances), PRON (5; 0% instances), X (5; 0% instances), ADP (3; 0% instances), AUX (3; 0% instances), DET (1; 0% instances), PART (1; 0% instances)

744 (54%) NUM nodes are leaves.

308 (23%) NUM nodes have one child.

111 (8%) NUM nodes have two children.

204 (15%) NUM nodes have three or more children.

The highest child degree of a NUM node is 15.

Children of NUM nodes are attached using 30 different relations: dep (459; 31% instances), flat (348; 24% instances), conj (154; 11% instances), case (77; 5% instances), advmod (65; 4% instances), cc (65; 4% instances), nmod (64; 4% instances), cop (51; 3% instances), discourse (36; 2% instances), dislocated (19; 1% instances), nsubj (17; 1% instances), advcl:cleft (15; 1% instances), compound:redup (15; 1% instances), obl:mod (13; 1% instances), det (11; 1% instances), mark (11; 1% instances), reparandum (11; 1% instances), acl:relcl (9; 1% instances), advcl (5; 0% instances), amod (4; 0% instances), appos (4; 0% instances), parataxis:parenth (3; 0% instances), aux (2; 0% instances), acl (1; 0% instances), compound (1; 0% instances), expl:subj (1; 0% instances), obl:arg (1; 0% instances), parataxis:conj (1; 0% instances), parataxis:discourse (1; 0% instances), vocative (1; 0% instances)

Children of NUM nodes belong to 15 different parts of speech: NUM (503; 34% instances), X (453; 31% instances), ADP (102; 7% instances), NOUN (100; 7% instances), CCONJ (59; 4% instances), ADV (56; 4% instances), AUX (53; 4% instances), VERB (32; 2% instances), PRON (28; 2% instances), INTJ (24; 2% instances), PROPN (13; 1% instances), DET (12; 1% instances), SCONJ (12; 1% instances), PART (10; 1% instances), ADJ (8; 1% instances)