NUM
: numeral
Definition
A numeral is a word, functioning most typically as a determiner, adjective or pronoun, that expresses a number and a relation to the number, such as quantity, sequence, frequency or fraction.
Examples
- 0, 1, 2, 3, 4, 5, 2014, 1000000, 3.14159265359
- tre “three”, femtito “fifty-two”, fire-fem “four-five”, tusen “thousand”
Treebank Statistics (UD_Norwegian)
There are 688 NUM
lemmas (3%), 693 NUM
types (2%) and 3962 NUM
tokens (1%).
Out of 17 observed tags, the rank of NUM
is: 5 in number of lemmas, 5 in number of types and 14 in number of tokens.
The 10 most frequent NUM
lemmas: to, tre, én, fire, eneste, 2, fem, ti, 20, seks
The 10 most frequent NUM
types: to, tre, fire, eneste, ett, 2, fem, ti, 20, seks
The 10 most frequent ambiguous lemmas: to (NUM 356, X 11), tre (NUM 174, NOUN 10, VERB 7), 2 (NUM 84, PROPN 1), 3 (NUM 51, PROPN 1), hundre (NUM 15, NOUN 13), null (NUM 11, NOUN 1), tusen (NOUN 25, NUM 11), 32 (NUM 7, X 1), 34 (NUM 5, X 1), fire-fem (NUM 3, DET 1)
The 10 most frequent ambiguous types: to (NUM 331, X 11), tre (NUM 155, VERB 3, NOUN 2), ett (NUM 86, X 1, DET 1), 2 (NUM 83, PROPN 1), 3 (NUM 51, PROPN 1), hundre (NUM 15, NOUN 13), null (NUM 11, NOUN 1), tusen (NOUN 22, NUM 8), 32 (NUM 7, X 1), 34 (NUM 5, X 1)
- to
- tre
- ett
- 2
- 3
- hundre
- null
- tusen
- 32
- 34
Morphology
The form / lemma ratio of NUM
is 1.007267 (the average of all parts of speech is 1.382778).
The 1st highest number of forms (3) was observed with the lemma “én”: ett, Èn, én.
The 2nd highest number of forms (2) was observed with the lemma “2”: 2, 2s.
The 3rd highest number of forms (2) was observed with the lemma “2011”: 2011, 2011s.
NUM
occurs with 5 features: NumType (3962; 100% instances), Number (3759; 95% instances), Gender (164; 4% instances), Definite (137; 3% instances), Case (3; 0% instances)
NUM
occurs with 8 feature-value pairs: Case=Gen
, Definite=Def
, Gender=Fem
, Gender=Masc
, Gender=Neut
, NumType=Card
, Number=Plur
, Number=Sing
NUM
occurs with 9 feature combinations.
The most frequent feature combination is Number=Plur|NumType=Card
(3519 tokens).
Examples: to, tre, fire, 2, fem, ti, 20, seks, 3, 50
Relations
NUM
nodes are attached to their parents using 18 different relations: nummod (2300; 58% instances), nmod (1069; 27% instances), conj (139; 4% instances), name (86; 2% instances), nsubj (76; 2% instances), compound (64; 2% instances), root (54; 1% instances), dobj (47; 1% instances), appos (35; 1% instances), xcomp (34; 1% instances), remnant (16; 0% instances), advcl (12; 0% instances), acl (8; 0% instances), nsubjpass (8; 0% instances), parataxis (8; 0% instances), acl:relcl (4; 0% instances), csubj (1; 0% instances), dislocated (1; 0% instances)
Parents of NUM
nodes belong to 10 different parts of speech: NOUN (2739; 69% instances), VERB (569; 14% instances), PROPN (255; 6% instances), NUM (208; 5% instances), ADJ (109; 3% instances), ROOT (54; 1% instances), DET (9; 0% instances), ADP (7; 0% instances), PRON (7; 0% instances), ADV (5; 0% instances)
2455 (62%) NUM
nodes are leaves.
943 (24%) NUM
nodes have one child.
385 (10%) NUM
nodes have two children.
179 (5%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 11.
Children of NUM
nodes are attached using 23 different relations: case (778; 32% instances), punct (462; 19% instances), nmod (351; 14% instances), advmod (262; 11% instances), conj (156; 6% instances), det (83; 3% instances), cc (80; 3% instances), cop (58; 2% instances), compound (55; 2% instances), nsubj (53; 2% instances), acl:relcl (31; 1% instances), mark (25; 1% instances), appos (10; 0% instances), amod (8; 0% instances), advcl (6; 0% instances), acl (5; 0% instances), expl (4; 0% instances), name (4; 0% instances), xcomp (4; 0% instances), aux (2; 0% instances), dobj (1; 0% instances), neg (1; 0% instances), parataxis (1; 0% instances)
Children of NUM
nodes belong to 15 different parts of speech: ADP (803; 33% instances), PUNCT (462; 19% instances), NOUN (306; 13% instances), NUM (208; 9% instances), ADV (178; 7% instances), ADJ (108; 4% instances), VERB (99; 4% instances), CONJ (80; 3% instances), DET (79; 3% instances), PROPN (49; 2% instances), PRON (48; 2% instances), SCONJ (15; 1% instances), AUX (2; 0% instances), SYM (2; 0% instances), PART (1; 0% instances)
NUM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]