Animacy: animacy
Similarly to Gender, animacy is a lexical feature of nouns and inflectional feature of other parts of speech that mark agreement with nouns. It is independent of gender, therefore it is encoded separately in some tagsets (e.g. all the Multext-East tagsets). On the other hand, in Czech the (almost) only grammatical implications occur within the masculine gender, which is why the PDT tagset does not have animacy as separate feature and instead defines four genders: masculine animate, masculine inanimate, feminine and neuter.
Anim: animate
Human beings, animals, fictional characters, names of professions etc. are all animate. Even nouns that are normally inanimate can be inflected as animate if they are personified. For instance, consider a children’s story about cars where cars live and talk as people; then the cars may become and be inflected as animates.
PDT examples of masculine animate nouns:
- člověk “man”, ministr “minister”, prezident “president”, předseda “chairman”, ředitel “director”
Inan: inanimate
Nouns that are not animate are inanimate.
PDT examples of masculine inanimate nouns:
- rok “year”, zákon “law”, stát “state”, případ “case”, milión “million”
Treebank Statistics (UD_Czech)
This feature is universal.
It occurs with 2 different values: Anim, Inan.
312931 tokens (21%) have a non-empty value of Animacy.
62654 types (49%) occur at least once with a non-empty value of Animacy.
28568 lemmas (49%) occur at least once with a non-empty value of Animacy.
The feature is used with 8 part-of-speech tags: cs-pos/NOUN (163546; 11% instances), cs-pos/ADJ (73924; 5% instances), cs-pos/PROPN (48949; 3% instances), cs-pos/VERB (15642; 1% instances), cs-pos/PRON (7235; 0% instances), cs-pos/DET (2621; 0% instances), cs-pos/AUX (711; 0% instances), cs-pos/NUM (303; 0% instances).
NOUN
163546 cs-pos/NOUN tokens (44% of all NOUN tokens) have a non-empty value of Animacy.
The most frequent other feature values with which NOUN and Animacy co-occurred: Gender=Masc (163546; 100%), Negative=Pos (163534; 100%), Number=Sing (106848; 65%).
NOUN tokens may have the following values of Animacy:
Anim(42328; 26% of non-emptyAnimacy): lidí, ministr, předseda, lidé, ředitel, prezident, trenér, ministra, prezidenta, premiérInan(121218; 74% of non-emptyAnimacy): roku, roce, případě, zákona, rok, světa, trhu, zákon, zájem, státuEMPTY(208820): korun, let, strany, procent, společnosti, době, firmy, Kč, práce, jednání
| Paradigm člen | Anim | Inan |
|---|---|---|
| Case=Acc|Number=Sing | člena | |
| Case=Acc|Number=Plur | členy | |
| Case=Dat|Number=Sing | členu, členovi | |
| Case=Dat|Number=Plur | členům | členům |
| Case=Gen|Number=Sing | člena | |
| Case=Gen|Number=Plur | členů | členů |
| Case=Ins|Number=Sing | členem | ČLENEM |
| Case=Ins|Number=Plur | členy | členy |
| Case=Loc|Number=Sing | členu, členovi | |
| Case=Loc|Number=Plur | členech | |
| Case=Nom|Number=Sing | člen | člen |
| Case=Nom|Number=Plur | členové |
Animacy seems to be lexical feature of NOUN. 99% lemmas (6964) occur only with one value of Animacy.
ADJ
73924 cs-pos/ADJ tokens (41% of all ADJ tokens) have a non-empty value of Animacy.
The most frequent other feature values with which ADJ and Animacy co-occurred: Gender=Masc (73853; 100%), Negative=Pos (69257; 94%), Degree=Pos (65402; 88%), Number=Sing (46254; 63%).
ADJ tokens may have the following values of Animacy:
Anim(18640; 25% of non-emptyAnimacy): každý, další, bývalý, mnozí, domácí, první, generální, český, americký, českýchInan(55284; 75% of non-emptyAnimacy): další, první, nový, českého, celý, český, každý, velký, letošního, státníhoEMPTY(106887): české, první, další, druhé, nové, možné, poslední, česká, třeba, státní
| Paradigm český | Anim | Inan |
|---|---|---|
| Case=Acc|Number=Sing | českého | český |
| Case=Acc|Number=Plur | české | české |
| Case=Dat|Number=Sing | českému | českému |
| Case=Dat|Number=Plur | českým, českých | českým |
| Case=Gen|Number=Sing | českého | českého |
| Case=Gen|Number=Plur | českých | českých |
| Case=Ins|Number=Sing | českým | českým |
| Case=Ins|Number=Plur | českými | českými |
| Case=Loc|Number=Sing | českém | českém |
| Case=Loc|Number=Plur | českých | českých |
| Case=Nom|Number=Sing | český | český |
| Case=Nom|Number=Plur | čeští | české |
PROPN
48949 cs-pos/PROPN tokens (58% of all PROPN tokens) have a non-empty value of Animacy.
The most frequent other feature values with which PROPN and Animacy co-occurred: Negative=Pos (48949; 100%), Gender=Masc (48949; 100%), Abbr=EMPTY (45594; 93%), Number=Sing (41680; 85%), Case=Nom (27584; 56%).
PROPN tokens may have the following values of Animacy:
Anim(37154; 76% of non-emptyAnimacy): Jiří, J, Jan, Václav, Petr, Pavel, Josef, M, Vladimír, VInan(11795; 24% of non-emptyAnimacy): USA, York, Zlín, Liberec, FNM, SSSR, Hradec, Izrael, Londýn, YorkuEMPTY(35082): Praha, ČR, Praze, LN, ODS, OSN, Evropy, Brno, Prahy, ODA
| Paradigm York | Anim | Inan |
|---|---|---|
| Case=Acc|NameType=Geo | York | |
| Case=Gen|NameType=Geo | Yorku | |
| Case=Loc|NameType=Geo | Yorku, YORKU | |
| Case=Loc|NameType=Sur | Yorku | |
| Case=Nom|NameType=Geo | York, YORK | |
| Case=Nom|NameType=Sur | YORK |
Animacy seems to be lexical feature of PROPN. 99% lemmas (10355) occur only with one value of Animacy.
VERB
15642 cs-pos/VERB tokens (9% of all VERB tokens) have a non-empty value of Animacy.
The most frequent other feature values with which VERB and Animacy co-occurred: VerbForm=Part (15642; 100%), Number=Plur (15642; 100%), Mood=EMPTY (15642; 100%), Person=EMPTY (15642; 100%), Negative=Pos (14429; 92%), Voice=Act (13279; 85%), Tense=Past (13279; 85%), Gender=Masc (9294; 59%).
VERB tokens may have the following values of Animacy:
Anim(9294; 59% of non-emptyAnimacy): měli, byli, mohli, chtěli, začali, museli, dostali, získali, rozhodli, přišliInan(6348; 41% of non-emptyAnimacy): byly, měly, mohly, začaly, nebyly, objevily, dosáhly, získaly, neměly, stalyEMPTY(149992): je, jsou, má, není, byl, být, může, bylo, řekl, měl
| Paradigm být | Anim | Inan |
|---|---|---|
| Gender=Masc|Negative=Neg | nebyli | |
| Gender=Masc|Negative=Pos | byli | |
| Gender=Fem,Masc|Negative=Neg | nebyly | |
| Gender=Fem,Masc|Negative=Pos | byly |
PRON
7235 cs-pos/PRON tokens (10% of all PRON tokens) have a non-empty value of Animacy.
The most frequent other feature values with which PRON and Animacy co-occurred: Variant=EMPTY (7235; 100%), Reflex=EMPTY (7228; 100%), Person=EMPTY (7136; 99%), Gender=Masc (5403; 75%), PronType=Int,Rel (5037; 70%), Case=Nom (4945; 68%).
PRON tokens may have the following values of Animacy:
Anim(3823; 53% of non-emptyAnimacy): kteří, kdo, nikdo, všichni, někdo, ti, sami, oni, kterého, někteříInan(3412; 47% of non-emptyAnimacy): co, které, který, čím, všechny, ty, čem, čeho, jež, čemuEMPTY(65313): se, to, si, které, který, která, tím, tom, nás, tomu
| Paradigm ten | Anim | Inan |
|---|---|---|
| Case=Acc|Number=Sing | toho | ten |
| Case=Acc|Number=Plur | ty | ty |
| Case=Nom|Number=Plur | ti | ty |
DET
2621 cs-pos/DET tokens (9% of all DET tokens) have a non-empty value of Animacy.
The most frequent other feature values with which DET and Animacy co-occurred: Gender=Masc (2621; 100%), Gender[psor]=EMPTY (2547; 97%), Number[psor]=EMPTY (2254; 86%), Person=EMPTY (2254; 86%), Reflex=EMPTY (2049; 78%), Case=Acc (1789; 68%), Poss=EMPTY (1682; 64%), Number=Sing (1578; 60%).
DET tokens may have the following values of Animacy:
Anim(635; 24% of non-emptyAnimacy): někteří, naši, svého, tito, ti, ty, tyto, takového, našeho, tohotoInan(1986; 76% of non-emptyAnimacy): svůj, tento, tyto, některé, náš, takový, nějaký, ten, žádný, takovéEMPTY(25192): jeho, jejich, své, této, její, tohoto, svou, tato, těchto, svých
| Paradigm tento | Anim | Inan |
|---|---|---|
| Case=Acc|Number=Sing | tohoto | tento |
| Case=Acc|Number=Plur | tyto | tyto |
| Case=Nom|Number=Plur | tito | tyto |
AUX
711 cs-pos/AUX tokens (3% of all AUX tokens) have a non-empty value of Animacy.
The most frequent other feature values with which AUX and Animacy co-occurred: Voice=Act (711; 100%), VerbForm=Part (711; 100%), Number=Plur (711; 100%), Mood=EMPTY (711; 100%), Tense=Past (711; 100%), Person=EMPTY (711; 100%), Negative=Pos (653; 92%), Gender=Fem,Masc (535; 75%).
AUX tokens may have the following values of Animacy:
Anim(176; 25% of non-emptyAnimacy): byli, nebyliInan(535; 75% of non-emptyAnimacy): byly, nebyly, bývalyEMPTY(20084): by, bude, jsem, jsme, byl, budou, byla, být, je, bylo
| Paradigm být | Anim | Inan |
|---|---|---|
| Gender=Masc|Negative=Neg | nebyli | |
| Gender=Masc|Negative=Pos | byli | |
| Gender=Fem,Masc|Negative=Neg | nebyly | |
| Gender=Fem,Masc|Negative=Pos | byly |
NUM
303 cs-pos/NUM tokens (1% of all NUM tokens) have a non-empty value of Animacy.
The most frequent other feature values with which NUM and Animacy co-occurred: Case=Acc (303; 100%), Number=Sing (303; 100%), Gender=Masc (303; 100%), NumType=Card (303; 100%), NumValue=1,2,3 (303; 100%), NumForm=Word (303; 100%).
NUM tokens may have the following values of Animacy:
Anim(84; 28% of non-emptyAnimacy): jednohoInan(219; 72% of non-emptyAnimacy): jedenEMPTY(41207): 1, 2, 3, dva, tři, 4, 6, dvě, tisíc, 5
| Paradigm jeden | Anim | Inan |
|---|---|---|
| jednoho | jeden |
Relations with Agreement in Animacy
The 10 most frequent relations where parent and child node agree in Animacy:
NOUN –[amod]–> ADJ (62538; 98%),
PROPN –[name]–> PROPN (11952; 99%),
PROPN –[nmod]–> NOUN (7243; 88%),
PROPN –[conj]–> PROPN (2978; 67%),
ADJ –[conj]–> ADJ (2133; 90%),
PROPN –[amod]–> ADJ (1850; 74%),
ADJ –[nsubj]–> NOUN (1555; 88%),
PROPN –[appos]–> NOUN (681; 77%),
NOUN –[nsubj]–> PROPN (277; 55%),
NOUN –[case]–> NOUN (253; 51%).
Animacy in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]