NumType: numeral type
Some languages (especially Slavic) have a complex system of numerals. For example, in the school grammar of Czech, the main part of speech is “numeral”, it includes almost everything where counting is involved and there are various subtypes. It also includes interrogative, relative, indefinite and demonstrative words referring to numbers (words like kolik / how many, tolik / so many, několik / some, a few), so at the same time we may have a non-empty value of PronType. (In English, these words are called quantifiers and they are considered a subgroup of determiners.)
From the syntactic point of view, some numtypes behave like adjectives
and some behave like adverbs. We tag them u-pos/ADJ and
u-pos/ADV respectively. Thus the
NumType feature applies to
several different parts of speech:
- u-pos/NUM: cardinal numerals
- u-pos/DET: quantifiers
- u-pos/ADJ: definite adjectival, e.g. ordinal numerals
- u-pos/ADV: adverbial (e.g. ordinal and multiplicative) numerals, both definite and pronominal
Card: cardinal number or corresponding interrogative / relative / indefinite / demonstrative word
Note that in some Indo-European languages there is a fuzzy borderline between numerals and nouns for thousand, million and billion.
- [en] one, two, three
- [cs] jeden, dva, tři “one, two, three”; kolik “how many”; několik “some”; tolik “so many”; mnoho “many”; málo “few”
- [cs] čtvero, patero, desatero (specific forms of four, five, ten;
they are morphologically, syntactically and stylistically distinct from the
default forms čtyři, pět, deset; in Czech grammar they are classified
as “generic numerals”, which also encompasses some other rare types;
Cardis the closest match for them among the universal types.
Ord: ordinal number or corresponding interrogative / relative / indefinite / demonstrative word
This is a subtype of adjective or (in some languages) of adverb.
- [en] first, second, third;
- [cs] adjectival: první “first”; druhý “second”, třetí “third”; kolikátý lit. how manieth “which rank”; několikátý “some rank”; tolikátý “this/that rank”
- [cs] adverbial: poprvé “for the first time”; podruhé “for the second time”; potřetí “for the third time”; pokolikáté “for which time”, poněkolikáté “for x-th time”, potolikáté
Mult: multiplicative numeral or corresponding interrogative / relative / indefinite / demonstrative word
This is subtype of adjective or adverb.
- [sl] dvojen “double, twofold”; trojen “triple, threefold”; četveren “fourfold”
- [cs] dvojí “twofold”; trojí “threefold” (multiplicative adjectives)
- [cs] jednou “once”; dvakrát “twice”; třikrát “three times”; kolikrát “how many times”, několikrát “several times”; tolikrát “so many times” (multiplicative adverbs)
This is a subtype of cardinal numbers, occasionally distinguished in corpora. It may denote a fraction or just the denominator of the fraction. In various languages these words may behave morphologically and syntactically as nouns or ordinal numerals.
- [en] three-quarters
- [cs] půl / polovina “half”; třetina “one third”; čtvrt / čtvrtina “quarter”
Sets: number of sets of things; collective numeral
Morphologically distinct class of numerals used to count sets of things, or nouns that are pluralia tantum. Some authors call this type collective numeral.
- [cs] dvoje / troje boty “two / three [pairs of] shoes”; as opposed to normal cardinal numbers: dvě / tři boty “two / three shoes”
Dist: distributive numeral
Used to express that the same quantity is distributed to each member in a set of targets.
- [hu] három-három in gyermekenként három-három ezer forinttal “three thousand forint per child”
Range: range of values
This could be considered a subtype of cardinal numbers, occasionally distinguished in corpora.
- [en] two-five “two to five” (provided tokenization leaves it as one token.)
NumType in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [u] [ug] [uk] [ur] [urj] [vi] [yue] [zh]