home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Naija-NSC: POS Tags: NUM

There are 39 NUM lemmas (1%), 39 NUM types (1%) and 1367 NUM tokens (1%). Out of 15 observed tags, the rank of NUM is: 10 in number of lemmas, 11 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: one, two, five, three, hundred, thousand, twenty, six, seven, fifty

The 10 most frequent NUM types: one, two, five, three, hundred, thousand, twenty, six, seven, fifty

The 10 most frequent ambiguous lemmas: one (NOUN 434, NUM 234, DET 232, PRON 3, ADJ 1), sixteen (NUM 4, NOUN 1), k (NUM 2, NOUN 1), o (PART 700, NUM 1), X (X 411, INTJ 6, DET 3, NUM 1)

The 10 most frequent ambiguous types: one (NOUN 411, NUM 233, DET 232, PRON 3, ADJ 1), sixteen (NUM 4, NOUN 1), k (NOUN 1, NUM 1), o (PART 699, NUM 1), o~ (X 3, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.162376).

The 1st highest number of forms (1) was observed with the lemma “101”: 101.

The 2nd highest number of forms (1) was observed with the lemma “O”: O.

The 3rd highest number of forms (1) was observed with the lemma “X”: o~.

NUM occurs with 1 features: NumType (1305; 95% instances)

NUM occurs with 1 feature-value pairs: NumType=Card

NUM occurs with 2 feature combinations. The most frequent feature combination is NumType=Card (1305 tokens). Examples: one, two, five, three, hundred, thousand, twenty, six, seven, fifty

Relations

NUM nodes are attached to their parents using 25 different relations: nummod (556; 41% instances), flat (330; 24% instances), conj (149; 11% instances), obj (58; 4% instances), root (54; 4% instances), nmod (42; 3% instances), obl:mod (40; 3% instances), nsubj (34; 2% instances), obl:arg (16; 1% instances), compound:redup (15; 1% instances), dislocated (14; 1% instances), xcomp (14; 1% instances), appos (11; 1% instances), reparandum (11; 1% instances), ccomp (9; 1% instances), acl:relcl (4; 0% instances), parataxis:parenth (2; 0% instances), advcl (1; 0% instances), advcl:cleft (1; 0% instances), compound:prt (1; 0% instances), discourse (1; 0% instances), nsubj:outer (1; 0% instances), parataxis (1; 0% instances), parataxis:conj (1; 0% instances), parataxis:dislocated (1; 0% instances)

Parents of NUM nodes belong to 12 different parts of speech: NOUN (571; 42% instances), NUM (503; 37% instances), VERB (173; 13% instances), (54; 4% instances), PROPN (37; 3% instances), ADJ (11; 1% instances), ADV (6; 0% instances), PRON (5; 0% instances), AUX (3; 0% instances), X (2; 0% instances), ADP (1; 0% instances), PART (1; 0% instances)

744 (54%) NUM nodes are leaves.

304 (22%) NUM nodes have one child.

117 (9%) NUM nodes have two children.

202 (15%) NUM nodes have three or more children.

The highest child degree of a NUM node is 15.

Children of NUM nodes are attached using 30 different relations: dep (460; 31% instances), flat (348; 24% instances), conj (154; 10% instances), case (77; 5% instances), advmod (65; 4% instances), cc (65; 4% instances), nmod (62; 4% instances), cop (51; 3% instances), discourse (35; 2% instances), reparandum (23; 2% instances), dislocated (19; 1% instances), nsubj (17; 1% instances), advcl:cleft (15; 1% instances), compound:redup (15; 1% instances), obl:mod (14; 1% instances), det (11; 1% instances), mark (11; 1% instances), acl:relcl (9; 1% instances), advcl (5; 0% instances), amod (4; 0% instances), appos (4; 0% instances), parataxis:parenth (3; 0% instances), aux (2; 0% instances), acl (1; 0% instances), compound (1; 0% instances), expl:subj (1; 0% instances), obl:arg (1; 0% instances), parataxis:conj (1; 0% instances), parataxis:discourse (1; 0% instances), vocative (1; 0% instances)

Children of NUM nodes belong to 15 different parts of speech: NUM (503; 34% instances), X (457; 31% instances), ADP (106; 7% instances), NOUN (101; 7% instances), CCONJ (59; 4% instances), ADV (56; 4% instances), AUX (53; 4% instances), VERB (35; 2% instances), PRON (27; 2% instances), INTJ (23; 2% instances), DET (13; 1% instances), PROPN (13; 1% instances), SCONJ (12; 1% instances), PART (10; 1% instances), ADJ (8; 1% instances)