Number
: number
Number
is an inflectional feature of nouns and other parts of speech (adjectives, verbs) that mark agreement with nouns.
Sing
: singular number
A singular noun denotes one person, animal or thing.
Examples
- старий чоловік прийшов “an old man came”
- молода жінка прийшла “a young woman came”
- мале курча “a small chicken came”
Plur
: plural number
A plural noun denotes several persons, animals or things.
Examples
- старі чоловіки (жінки) прийшли “old men (women) came”
Ptan
: plurale tantum
Some nouns appear only in the plural form even though they denote one thing (semantic singular); some tagsets mark this distinction. Grammatically they behave like plurals, so Plur
is obviously the back-off value here; however, the non-existence of singular form sometimes means that the gender is unknown. In Ukrainian, special type of numerals is used when counting nouns that are plurale tantum (NumType=Sets
).
Examples
- ножиці, штани “scissors, pants”
Coll
: collective / mass / singulare tantum
Collective or mass or singulare tantum is a special case of singular. It applies to words that use grammatical singular to describe sets of objects, i.e. semantic plural. Although in theory they might be able to form plural, in practice it would be rarely semantically plausible. Sometimes, the plural form exists and means “several sorts of” or “several packages of”.
Examples
- людство “mankind”
Diffs
Ukrainian Dependency Treebank
The UDT tagset does not distinguish Ptan
from Plur
and Coll
from Sing
, therefore this distinction is not being made in the converted data.
Treebank Statistics (UD_Ukrainian)
This feature is universal.
It occurs with 2 different values: Plur
, Sing
.
312 tokens (19%) have a non-empty value of Number
.
204 types (29%) occur at least once with a non-empty value of Number
.
157 lemmas (26%) occur at least once with a non-empty value of Number
.
The feature is used with 8 part-of-speech tags: VERB (143; 9% instances), NOUN (79; 5% instances), PRON (49; 3% instances), ADJ (14; 1% instances), DET (11; 1% instances), NUM (11; 1% instances), ADV (4; 0% instances), PROPN (1; 0% instances).
VERB
143 VERB tokens (45% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: VerbForm=Fin (143; 100%), Gender=EMPTY (143; 100%), Mood=Ind (130; 91%), Aspect=Imp (123; 86%), Tense=Pres (104; 73%), Person=3 (88; 62%).
VERB
tokens may have the following values of Number
:
Plur
(47; 33% of non-emptyNumber
): навчають, Кажуть, мусять, Почекайте, Сподіваємося, запилюють, запилюються, запізнилися, змінять, маютьSing
(96; 67% of non-emptyNumber
): є, каже, плаває, буде, любиш, сплю, Іди, Вважаю, Запитаю, думайEMPTY
(176): був, пішов, сказав, було, зберігати, зароблено, копати, плавати, розмовляти, чекати
Paradigm бути | Sing | Plur |
---|---|---|
Person=3|Tense=Fut | буде | |
Person=3|Tense=Pres | є | |
Tense=Past | були |
Number
seems to be lexical feature of VERB
. 92% lemmas (59) occur only with one value of Number
.
NOUN
79 NOUN tokens (31% of all NOUN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NOUN
and Number
co-occurred: Animacy=Inan (53; 67%).
NOUN
tokens may have the following values of Number
:
Plur
(79; 100% of non-emptyNumber
): доларів, квіти, груші, дітей, бджолами, бджоли, гривень, математики, проблеми, хлопцівEMPTY
(179): спокій, хлопець, Начальник, футболіст, модель, яблуку, Село, брат, водій, вокзалі
Number
seems to be lexical feature of NOUN
. 100% lemmas (58) occur only with one value of Number
.
PRON
49 PRON tokens (32% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: PronType=Prs (49; 100%), Gender=EMPTY (49; 100%), Animacy=Anim (39; 80%).
PRON
tokens may have the following values of Number
:
Plur
(21; 43% of non-emptyNumber
): вони, ми, нас, них, Їх, ви, нами, вас, їмSing
(28; 57% of non-emptyNumber
): ти, я, мені, мене, тебе, тобіEMPTY
(104): він, це, його, вона, що, дехто, її, те, все, ніхто
ADJ
14 ADJ tokens (15% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: Gender=EMPTY (14; 100%), NumType=EMPTY (14; 100%), Degree=EMPTY (12; 86%), Voice=EMPTY (11; 79%), VerbForm=EMPTY (11; 79%), Aspect=EMPTY (11; 79%).
ADJ
tokens may have the following values of Number
:
Plur
(14; 100% of non-emptyNumber
): Бородаті, Кольорові, Об’єднаних, Українські, американські, даними, делікатних, зловлені, злотих, незнайомимиEMPTY
(78): Важливим, перший, швидший, далекому, мила, минулому, першим, повинен, розташоване, чесний
Number
seems to be lexical feature of ADJ
. 100% lemmas (13) occur only with one value of Number
.
DET
11 DET tokens (32% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: Gender=EMPTY (11; 100%), Person=EMPTY (9; 82%), Reflex=EMPTY (9; 82%), Poss=EMPTY (7; 64%).
DET
tokens may have the following values of Number
:
Plur
(11; 100% of non-emptyNumber
): Ті, Ваші, своїми, своїх, такими, усіх, цих, іншими, їїEMPTY
(23): свою, той, його, цей, Котра, Котру, Та, Ця, кожному, котрого
NUM
11 NUM tokens (30% of all NUM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NUM
and Number
co-occurred: NumType=Card (11; 100%), Gender=EMPTY (11; 100%).
NUM
tokens may have the following values of Number
:
Plur
(11; 100% of non-emptyNumber
): П’ять, багатьма, Сім, Сімом, багатьох, три, четверо, чотири, чотирмаEMPTY
(26): 50, мільйонів, 5, 200, 3, 8, дві, 1, 14, 2016
ADV
4 ADV tokens (4% of all ADV
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADV
and Number
co-occurred: Degree=EMPTY (4; 100%).
ADV
tokens may have the following values of Number
:
Plur
(4; 100% of non-emptyNumber
): скільки, багато, стількиEMPTY
(101): непогано, важливо, вже, раніше, коли, завтра, так, тоді, треба, ще
PROPN
1 PROPN tokens (2% of all PROPN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PROPN
and Number
co-occurred: Gender=EMPTY (1; 100%), Case=Loc (1; 100%), Animacy=Inan (1; 100%).
PROPN
tokens may have the following values of Number
:
Plur
(1; 100% of non-emptyNumber
): КарпатахEMPTY
(50): Микола, Павло, Богдан, Кеннеді, Крушельниця, Петро, Стрий, С’юзі, Іван, Ігоря
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
VERB –[ccomp]–> VERB (14; 67%),
VERB –[nsubj]–> NOUN (12; 63%),
NOUN –[det]–> DET (6; 100%),
NOUN –[amod]–> ADJ (5; 83%),
NOUN –[conj]–> NOUN (4; 80%),
VERB –[parataxis]–> VERB (3; 75%),
VERB –[nsubj]–> DET (3; 100%),
DET –[acl]–> VERB (2; 67%),
VERB –[iobj]–> PRON (2; 67%),
VERB –[conj]–> VERB (2; 100%).
Number in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]