Treebank Statistics: UD_Czech-CAC: Features: NameType
This feature is language-specific.
It occurs with 7 different values: Com, Geo, Giv, Nat, Oth, Pro, Sur.
Some words have combined values of the feature; 12 combinations have been observed: Com|Geo, Com|Giv, Com|Pro, Com|Sur, Geo|Giv, Geo|Oth, Geo|Sur, Giv|Oth, Giv|Pro, Giv|Sur, Nat|Sur, Pro|Sur.
10592 tokens (2%) have a non-empty value of NameType.
4886 types (8%) occur at least once with a non-empty value of NameType.
3816 lemmas (13%) occur at least once with a non-empty value of NameType.
The feature is used with 6 part-of-speech tags: PROPN (9814; 2% instances), ADJ (768; 0% instances), ADP (4; 0% instances), PART (4; 0% instances), CCONJ (1; 0% instances), PRON (1; 0% instances).
PROPN
9814 PROPN tokens (100% of all PROPN tokens) have a non-empty value of NameType.
The most frequent other feature values with which PROPN and NameType co-occurred: Abbr=EMPTY (7936; 81%), Number=Sing (7187; 73%), Gender=Masc (5431; 55%).
PROPN tokens may have the following values of NameType:
Com(1750; 18% of non-emptyNameType): KSČ, ROH, ÚJČ, SSM, ČSAV, TIBA, NF, OV, GŘ, VÚMCom,Geo(2; 0% of non-emptyNameType): Böhmen, PragCom,Giv(2; 0% of non-emptyNameType): KonstruktivaCom,Pro(3; 0% of non-emptyNameType): Dermacol, Gambrinus, PrazdrojCom,Sur(4; 0% of non-emptyNameType): Bell, Bramsch, VenturaGeo(3418; 35% of non-emptyNameType): Praze, SSSR, Praha, ČSSR, Prahy, ČSR, Československa, NDR, Země, USAGeo,Giv(16; 0% of non-emptyNameType): Boleslav, Mořici, Kuby, Louis, Lucie, MořiceGeo,Oth(1; 0% of non-emptyNameType): ÖsterreichGeo,Sur(26; 0% of non-emptyNameType): Pavlov, Hejná, Tigridem, Blatná, Hlinka, Hracholusk, Janského, Lachaise, Lhota, LhotuGiv(1221; 12% of non-emptyNameType): Karel, Julius, Václav, Jana, Karla, Jaroslav, Josef, Jiří, Zdeněk, KlementGiv,Oth(1; 0% of non-emptyNameType): LuciiGiv,Pro(8; 0% of non-emptyNameType): Claudio, Othelo, Pascal, Pascalu, Radka, Raduna, Sandy, ZoraGiv,Sur(24; 0% of non-emptyNameType): Ariadna, Kosmy, Perry, Joy, Dante, Dantem, Eliot, Figara, James, JameseNat(129; 1% of non-emptyNameType): Adygejci, Pražané, Angličan, Egypťané, Keltové, Afričanů, Američana, Američané, Američanů, AsyřanéNat,Sur(1; 0% of non-emptyNameType): SrbaOth(21; 0% of non-emptyNameType): Opeplatis, SNP, Plastex, Rena, Erotissimo, Gaudeamus, Intermóda, Invex, Luna, MusikbuchPro(290; 3% of non-emptyNameType): Škoda, Merkur, Duha, Octavia, FSČ, Klad, Romatic, SAPO, SaS, TatramatPro,Sur(3; 0% of non-emptyNameType): Baracchi, Burda, MendozaSur(2894; 29% of non-emptyNameType): Fučík, Erben, Horálek, Knappová, Němec, Těšitelová, Lenin, Záveský, Kraus, Fučíka
| Paradigm Svoboda | Com | Geo | Pro | Sur |
|---|---|---|---|---|
| Animacy=Anim|Case=Acc|Gender=Masc | Svobodu | |||
| Animacy=Anim|Case=Gen|Gender=Masc | Svobody | |||
| Animacy=Anim|Case=Nom|Gender=Masc | Svoboda | |||
| Case=Loc|Gender=Fem | Svobodě | |||
| Case=Nom|Gender=Fem | Svoboda | Svoboda |
NameType seems to be lexical feature of PROPN. 98% lemmas (3400) occur only with one value of NameType.
ADJ
768 ADJ tokens (1% of all ADJ tokens) have a non-empty value of NameType.
The most frequent other feature values with which ADJ and NameType co-occurred: VerbForm=EMPTY (768; 100%), Voice=EMPTY (768; 100%), Degree=EMPTY (565; 74%), Polarity=EMPTY (565; 74%), Number=Sing (520; 68%), Animacy=EMPTY (489; 64%).
ADJ tokens may have the following values of NameType:
Com(18; 2% of non-emptyNameType): Koh, i, Telephone, Tonkünstler, Červeného, Deutscher, Jazykovedným, Povážské, Pražská, UnitedCom,Pro(1; 0% of non-emptyNameType): VereinGeo(165; 21% of non-emptyNameType): Králové, Kutná, Kašperských, Lužických, České, Mariánských, Nové, Vrátné, Bassa, JanskéGeo,Giv(16; 2% of non-emptyNameType): Karlovy, Josefův, Karlových, Františkových, Jindřichova, KonstantinovyGeo,Sur(28; 4% of non-emptyNameType): Gottwaldově, Vančurově, Jiráskova, Melantrichova, Alšově, Chotkově, Engelsových, Fučíkovy, Fučíkově, GottwaldovyGiv(51; 7% of non-emptyNameType): Karlovy, Karlův, Karlově, Angelino, Anglický, Ariadnin, Bozděchovy, Božetěchově, Buridanův, ChefrenovyOth(1; 0% of non-emptyNameType): SantaPro(15; 2% of non-emptyNameType): Rudého, Illustrierte, Mladé, Prague, Rouge, Schweizer, Super, Touring, Tourist, linguistiqueSur(473; 62% of non-emptyNameType): Erbenových, Erbenovy, Fučíkova, Mohorovičićovy, Bohrův, Erbenova, Erbenově, Fučíkovy, Fučíkův, Fučíkovo
| Paradigm Karlův | Geo,Giv | Geo,Sur | Giv |
|---|---|---|---|
| Animacy=Anim|Case=Nom|Gender=Masc|Number=Sing | Karlův | ||
| Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing | Karlův | ||
| Animacy=Inan|Case=Acc|Gender=Masc|Number=Plur | Karlovy | ||
| Animacy=Inan|Case=Loc|Gender=Masc|Number=Plur | Karlových | ||
| Animacy=Inan|Case=Nom|Gender=Masc|Number=Sing | Karlův | ||
| Animacy=Inan|Case=Nom|Gender=Masc|Number=Plur | Karlovy | Karlovy | |
| Case=Gen|Gender=Fem|Number=Sing | Karlovy | ||
| Case=Gen|Gender=Fem|Number=Plur | Karlových | ||
| Case=Loc|Gender=Fem|Number=Sing | Karlově | ||
| Case=Nom|Gender=Neut|Number=Sing | Karlovo |
NameType seems to be lexical feature of ADJ. 97% lemmas (346) occur only with one value of NameType.
ADP
4 ADP tokens (0% of all ADP tokens) have a non-empty value of NameType.
The most frequent other feature values with which ADP and NameType co-occurred: AdpType=Prep (4; 100%), Case=EMPTY (3; 75%).
ADP tokens may have the following values of NameType:
Com(2; 50% of non-emptyNameType): Pro, pourOth(1; 25% of non-emptyNameType): ausPro(1; 25% of non-emptyNameType): della
PART
4 PART tokens (0% of all PART tokens) have a non-empty value of NameType.
PART tokens may have the following values of NameType:
Geo(3; 75% of non-emptyNameType): el, LaOth(1; 25% of non-emptyNameType): Al
CCONJ
1 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of NameType.
CCONJ tokens may have the following values of NameType:
Com(1; 100% of non-emptyNameType): and
PRON
1 PRON tokens (0% of all PRON tokens) have a non-empty value of NameType.
The most frequent other feature values with which PRON and NameType co-occurred: Case=Acc (1; 100%), Gender=Masc (1; 100%), Number=Plur (1; 100%), Person=EMPTY (1; 100%), PrepCase=EMPTY (1; 100%), PronType=Tot (1; 100%), Reflex=EMPTY (1; 100%), Variant=EMPTY (1; 100%).
PRON tokens may have the following values of NameType:
Com(1; 100% of non-emptyNameType): Tous
Relations with Agreement in NameType
The 10 most frequent relations where parent and child node agree in NameType:
PROPN –[conj]–> PROPN (1114; 94%),
ADJ –[conj]–> ADJ (24; 89%),
PROPN –[advcl:pred]–> PROPN (7; 100%),
PROPN –[dep]–> PROPN (7; 54%),
ADJ –[flat]–> PROPN (5; 56%),
PROPN –[flat]–> ADJ (4; 100%),
PROPN –[appos]–> PROPN (3; 60%),
PROPN –[nsubj]–> PROPN (2; 67%),
ADJ –[flat]–> ADJ (1; 100%),
PRON –[case]–> ADP (1; 100%).