home cs/feat edit page issue tracker

Animacy: animacy

Similarly to Gender, animacy is a lexical feature of nouns and inflectional feature of other parts of speech that mark agreement with nouns. It is independent of gender, therefore it is encoded separately in some tagsets (e.g. all the Multext-East tagsets). On the other hand, in Czech the (almost) only grammatical implications occur within the masculine gender, which is why the PDT tagset does not have animacy as separate feature and instead defines four genders: masculine animate, masculine inanimate, feminine and neuter.

Anim: animate

Human beings, animals, fictional characters, names of professions etc. are all animate. Even nouns that are normally inanimate can be inflected as animate if they are personified. For instance, consider a children’s story about cars where cars live and talk as people; then the cars may become and be inflected as animates.

PDT examples of masculine animate nouns:

Inan: inanimate

Nouns that are not animate are inanimate.

PDT examples of masculine inanimate nouns:


Treebank Statistics (UD_Czech)

This feature is universal. It occurs with 2 different values: Anim, Inan.

312931 tokens (21%) have a non-empty value of Animacy. 62654 types (49%) occur at least once with a non-empty value of Animacy. 28568 lemmas (49%) occur at least once with a non-empty value of Animacy. The feature is used with 8 part-of-speech tags: cs-pos/NOUN (163546; 11% instances), cs-pos/ADJ (73924; 5% instances), cs-pos/PROPN (48949; 3% instances), cs-pos/VERB (15642; 1% instances), cs-pos/PRON (7235; 0% instances), cs-pos/DET (2621; 0% instances), cs-pos/AUX (711; 0% instances), cs-pos/NUM (303; 0% instances).

NOUN

163546 cs-pos/NOUN tokens (44% of all NOUN tokens) have a non-empty value of Animacy.

The most frequent other feature values with which NOUN and Animacy co-occurred: Gender=Masc (163546; 100%), Negative=Pos (163534; 100%), Number=Sing (106848; 65%).

NOUN tokens may have the following values of Animacy:

Paradigm členAnimInan
Case=Acc|Number=Singčlena
Case=Acc|Number=Plurčleny
Case=Dat|Number=Singčlenu, členovi
Case=Dat|Number=Plurčlenůmčlenům
Case=Gen|Number=Singčlena
Case=Gen|Number=Plurčlenůčlenů
Case=Ins|Number=SingčlenemČLENEM
Case=Ins|Number=Plurčlenyčleny
Case=Loc|Number=Singčlenu, členovi
Case=Loc|Number=Plurčlenech
Case=Nom|Number=Singčlenčlen
Case=Nom|Number=Plurčlenové

Animacy seems to be lexical feature of NOUN. 99% lemmas (6964) occur only with one value of Animacy.

ADJ

73924 cs-pos/ADJ tokens (41% of all ADJ tokens) have a non-empty value of Animacy.

The most frequent other feature values with which ADJ and Animacy co-occurred: Gender=Masc (73853; 100%), Negative=Pos (69257; 94%), Degree=Pos (65402; 88%), Number=Sing (46254; 63%).

ADJ tokens may have the following values of Animacy:

Paradigm českýAnimInan
Case=Acc|Number=Singčeskéhočeský
Case=Acc|Number=Plurčeskéčeské
Case=Dat|Number=Singčeskémučeskému
Case=Dat|Number=Plurčeským, českýchčeským
Case=Gen|Number=Singčeskéhočeského
Case=Gen|Number=Plurčeskýchčeských
Case=Ins|Number=Singčeskýmčeským
Case=Ins|Number=Plurčeskýmičeskými
Case=Loc|Number=Singčeskémčeském
Case=Loc|Number=Plurčeskýchčeských
Case=Nom|Number=Singčeskýčeský
Case=Nom|Number=Plurčeštíčeské

PROPN

48949 cs-pos/PROPN tokens (58% of all PROPN tokens) have a non-empty value of Animacy.

The most frequent other feature values with which PROPN and Animacy co-occurred: Negative=Pos (48949; 100%), Gender=Masc (48949; 100%), Abbr=EMPTY (45594; 93%), Number=Sing (41680; 85%), Case=Nom (27584; 56%).

PROPN tokens may have the following values of Animacy:

Paradigm YorkAnimInan
Case=Acc|NameType=GeoYork
Case=Gen|NameType=GeoYorku
Case=Loc|NameType=GeoYorku, YORKU
Case=Loc|NameType=SurYorku
Case=Nom|NameType=GeoYork, YORK
Case=Nom|NameType=SurYORK

Animacy seems to be lexical feature of PROPN. 99% lemmas (10355) occur only with one value of Animacy.

VERB

15642 cs-pos/VERB tokens (9% of all VERB tokens) have a non-empty value of Animacy.

The most frequent other feature values with which VERB and Animacy co-occurred: VerbForm=Part (15642; 100%), Number=Plur (15642; 100%), Mood=EMPTY (15642; 100%), Person=EMPTY (15642; 100%), Negative=Pos (14429; 92%), Voice=Act (13279; 85%), Tense=Past (13279; 85%), Gender=Masc (9294; 59%).

VERB tokens may have the following values of Animacy:

Paradigm býtAnimInan
Gender=Masc|Negative=Negnebyli
Gender=Masc|Negative=Posbyli
Gender=Fem,Masc|Negative=Negnebyly
Gender=Fem,Masc|Negative=Posbyly

PRON

7235 cs-pos/PRON tokens (10% of all PRON tokens) have a non-empty value of Animacy.

The most frequent other feature values with which PRON and Animacy co-occurred: Variant=EMPTY (7235; 100%), Reflex=EMPTY (7228; 100%), Person=EMPTY (7136; 99%), Gender=Masc (5403; 75%), PronType=Int,Rel (5037; 70%), Case=Nom (4945; 68%).

PRON tokens may have the following values of Animacy:

Paradigm tenAnimInan
Case=Acc|Number=Singtohoten
Case=Acc|Number=Plurtyty
Case=Nom|Number=Plurtity

DET

2621 cs-pos/DET tokens (9% of all DET tokens) have a non-empty value of Animacy.

The most frequent other feature values with which DET and Animacy co-occurred: Gender=Masc (2621; 100%), Gender[psor]=EMPTY (2547; 97%), Number[psor]=EMPTY (2254; 86%), Person=EMPTY (2254; 86%), Reflex=EMPTY (2049; 78%), Case=Acc (1789; 68%), Poss=EMPTY (1682; 64%), Number=Sing (1578; 60%).

DET tokens may have the following values of Animacy:

Paradigm tentoAnimInan
Case=Acc|Number=Singtohototento
Case=Acc|Number=Plurtytotyto
Case=Nom|Number=Plurtitotyto

AUX

711 cs-pos/AUX tokens (3% of all AUX tokens) have a non-empty value of Animacy.

The most frequent other feature values with which AUX and Animacy co-occurred: Voice=Act (711; 100%), VerbForm=Part (711; 100%), Number=Plur (711; 100%), Mood=EMPTY (711; 100%), Tense=Past (711; 100%), Person=EMPTY (711; 100%), Negative=Pos (653; 92%), Gender=Fem,Masc (535; 75%).

AUX tokens may have the following values of Animacy:

Paradigm býtAnimInan
Gender=Masc|Negative=Negnebyli
Gender=Masc|Negative=Posbyli
Gender=Fem,Masc|Negative=Negnebyly
Gender=Fem,Masc|Negative=Posbyly

NUM

303 cs-pos/NUM tokens (1% of all NUM tokens) have a non-empty value of Animacy.

The most frequent other feature values with which NUM and Animacy co-occurred: Case=Acc (303; 100%), Number=Sing (303; 100%), Gender=Masc (303; 100%), NumType=Card (303; 100%), NumValue=1,2,3 (303; 100%), NumForm=Word (303; 100%).

NUM tokens may have the following values of Animacy:

Paradigm jedenAnimInan
jednohojeden

Relations with Agreement in Animacy

The 10 most frequent relations where parent and child node agree in Animacy: NOUN –[amod]–> ADJ (62538; 98%), PROPN –[name]–> PROPN (11952; 99%), PROPN –[nmod]–> NOUN (7243; 88%), PROPN –[conj]–> PROPN (2978; 67%), ADJ –[conj]–> ADJ (2133; 90%), PROPN –[amod]–> ADJ (1850; 74%), ADJ –[nsubj]–> NOUN (1555; 88%), PROPN –[appos]–> NOUN (681; 77%), NOUN –[nsubj]–> PROPN (277; 55%), NOUN –[case]–> NOUN (253; 51%).


Animacy in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]