Abbr
: abbreviation
Boolean feature. Is this an abbreviation? Note that the abbreviated word typically belongs to a part of speech other than cs-pos/X.
Yes: it is abbreviation
Examples
- Acronyms: ČR (Česká republika) “Czech Republic”, LN (Lidové noviny) (a newspaper), ODS (Občanská demokratická strana) “Civic Democratic Party”, OSN (Organizace spojených národů) “United Nations Organization”, ODA (Občanská demokratická aliance) “Civic Democratic Alliance”
- Initials: J, M, V, A, C
- Abbreviations: r. (rok) “year”, např. (například) “for example”, tzv. (takzvaný) “so-called”, a. s. (akciová společnost) “joint-stock company”, tel. (telefon) “phone”
Treebank Statistics (UD_Czech)
This feature is language-specific.
It occurs with 1 different values: Yes
.
21743 tokens (1%) have a non-empty value of Abbr
.
1755 types (1%) occur at least once with a non-empty value of Abbr
.
1812 lemmas (3%) occur at least once with a non-empty value of Abbr
.
The feature is used with 11 part-of-speech tags: cs-pos/PROPN (13042; 1% instances), cs-pos/NOUN (5768; 0% instances), cs-pos/ADJ (1714; 0% instances), cs-pos/ADV (956; 0% instances), cs-pos/CONJ (182; 0% instances), cs-pos/ADP (23; 0% instances), cs-pos/VERB (22; 0% instances), cs-pos/DET (15; 0% instances), cs-pos/X (12; 0% instances), cs-pos/PRON (6; 0% instances), cs-pos/PART (3; 0% instances).
PROPN
13042 cs-pos/PROPN tokens (16% of all PROPN
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which PROPN
and Abbr
co-occurred: Negative=Pos (13042; 100%), Case=EMPTY (13010; 100%), Number=EMPTY (12219; 94%), Animacy=EMPTY (9687; 74%), Gender=Fem (6911; 53%), NameType=Com (6803; 52%).
PROPN
tokens may have the following values of Abbr
:
Yes
(13042; 100% of non-emptyAbbr
): ČR, LN, ODS, J, OSN, ODA, M, ČSFR, V, A
Abbr
seems to be lexical feature of PROPN
. 100% lemmas (1236) occur only with one value of Abbr
.
NOUN
5768 cs-pos/NOUN tokens (2% of all NOUN
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which NOUN
and Abbr
co-occurred: Negative=Pos (5768; 100%), Case=EMPTY (5608; 97%), Number=EMPTY (5538; 96%), Gender=Masc (3042; 53%).
NOUN
tokens may have the following values of Abbr
:
Yes
(5768; 100% of non-emptyAbbr
): r, s, tel, m, č, km, MS, mil, Kčs, cm
Abbr
seems to be lexical feature of NOUN
. 100% lemmas (490) occur only with one value of Abbr
.
ADJ
1714 cs-pos/ADJ tokens (1% of all ADJ
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which ADJ
and Abbr
co-occurred: Negative=Pos (1714; 100%), Animacy=EMPTY (1713; 100%), Degree=Pos (1705; 99%), Number=EMPTY (1601; 93%), Case=EMPTY (1601; 93%), Gender=EMPTY (1598; 93%).
ADJ
tokens may have the following values of Abbr
:
Yes
(1714; 100% of non-emptyAbbr
): tzv, a, čs, o, sv, RM, US, Č, n, k
Abbr
seems to be lexical feature of ADJ
. 100% lemmas (185) occur only with one value of Abbr
.
ADV
956 cs-pos/ADV tokens (1% of all ADV
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which ADV
and Abbr
co-occurred: Degree=EMPTY (956; 100%), Negative=EMPTY (956; 100%).
ADV
tokens may have the following values of Abbr
:
Yes
(956; 100% of non-emptyAbbr
): např, mj, apod, atd, resp, atp, popř, cca, ap, kupř
Abbr
seems to be lexical feature of ADV
. 100% lemmas (22) occur only with one value of Abbr
.
CONJ
182 cs-pos/CONJ tokens (0% of all CONJ
tokens) have a non-empty value of Abbr
.
CONJ
tokens may have the following values of Abbr
:
Yes
(182; 100% of non-emptyAbbr
): tj, n
ADP
23 cs-pos/ADP tokens (0% of all ADP
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which ADP
and Abbr
co-occurred: AdpType=Prep (23; 100%), Case=Ins (16; 70%).
ADP
tokens may have the following values of Abbr
:
Yes
(23; 100% of non-emptyAbbr
): n, v, př, P, m, vč
VERB
22 cs-pos/VERB tokens (0% of all VERB
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which VERB
and Abbr
co-occurred: Negative=Pos (22; 100%), Gender=EMPTY (22; 100%), Number=Sing (22; 100%), VerbForm=Fin (22; 100%), Mood=Ind (17; 77%), Person=3 (17; 77%), Tense=Pres (17; 77%), Voice=Act (17; 77%).
VERB
tokens may have the following values of Abbr
:
Yes
(22; 100% of non-emptyAbbr
): tzn, j, srov
DET
15 cs-pos/DET tokens (0% of all DET
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which DET
and Abbr
co-occurred: Reflex=EMPTY (15; 100%), Gender[psor]=EMPTY (15; 100%), Number=EMPTY (9; 60%), Poss=EMPTY (9; 60%), Person=EMPTY (9; 60%), Case=EMPTY (9; 60%), Gender=EMPTY (9; 60%), Number[psor]=EMPTY (9; 60%), PronType=Dem (9; 60%).
DET
tokens may have the following values of Abbr
:
Yes
(15; 100% of non-emptyAbbr
): t, n
X
12 cs-pos/X tokens (92% of all X
tokens) have a non-empty value of Abbr
.
X
tokens may have the following values of Abbr
:
Yes
(12; 100% of non-emptyAbbr
): A, H, M, S
PRON
6 cs-pos/PRON tokens (0% of all PRON
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which PRON
and Abbr
co-occurred: Reflex=EMPTY (6; 100%), Person=EMPTY (6; 100%), Variant=EMPTY (6; 100%), Gender=Neut (4; 67%), Case=Nom (4; 67%), Number=Sing (4; 67%), PronType=Dem (4; 67%).
PRON
tokens may have the following values of Abbr
:
Yes
(6; 100% of non-emptyAbbr
): t, mn, vš
PART
3 cs-pos/PART tokens (0% of all PART
tokens) have a non-empty value of Abbr
.
PART
tokens may have the following values of Abbr
:
Yes
(3; 100% of non-emptyAbbr
): CA
Relations with Agreement in Abbr
The 10 most frequent relations where parent and child node agree in Abbr
:
PROPN –[conj]–> PROPN (730; 66%),
ADJ –[amod]–> ADJ (48; 77%),
NOUN –[det]–> DET (15; 54%),
X –[nmod]–> X (9; 100%),
PROPN –[nsubj]–> PROPN (3; 100%),
NOUN –[foreign]–> ADV (2; 100%),
PART –[conj]–> NOUN (2; 100%),
ADP –[dep]–> NOUN (1; 100%),
PRON –[amod]–> ADJ (1; 100%),
ADJ –[case]–> ADJ (1; 100%).
Treebank Statistics (UD_Czech-CAC)
This feature is language-specific.
It occurs with 1 different values: Yes
.
6663 tokens (1%) have a non-empty value of Abbr
.
452 types (1%) occur at least once with a non-empty value of Abbr
.
446 lemmas (2%) occur at least once with a non-empty value of Abbr
.
The feature is used with 6 part-of-speech tags: cs-pos/SYM (3783; 1% instances), cs-pos/PROPN (1878; 0% instances), cs-pos/NOUN (982; 0% instances), cs-pos/ADV (10; 0% instances), cs-pos/ADJ (9; 0% instances), cs-pos/PUNCT (1; 0% instances).
SYM
3783 cs-pos/SYM tokens (100% of all SYM
tokens) have a non-empty value of Abbr
.
SYM
tokens may have the following values of Abbr
:
Yes
(3783; 100% of non-emptyAbbr
): *
PROPN
1878 cs-pos/PROPN tokens (19% of all PROPN
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which PROPN
and Abbr
co-occurred: Negative=Pos (1878; 100%), Case=EMPTY (1873; 100%), Number=EMPTY (1865; 99%), NameType=Com (1460; 78%), Animacy=EMPTY (1271; 68%), Gender=Fem (964; 51%).
PROPN
tokens may have the following values of Abbr
:
Yes
(1878; 100% of non-emptyAbbr
): KSČ, ROH, SSSR, ÚJČ, SSM, ČSAV, ČSSR, ČSR, TIBA, NDR
Abbr
seems to be lexical feature of PROPN
. 100% lemmas (287) occur only with one value of Abbr
.
NOUN
982 cs-pos/NOUN tokens (1% of all NOUN
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which NOUN
and Abbr
co-occurred: Negative=Pos (982; 100%), Case=EMPTY (981; 100%), Number=EMPTY (977; 99%), Animacy=EMPTY (510; 52%).
NOUN
tokens may have the following values of Abbr
:
Yes
(982; 100% of non-emptyAbbr
): ÚV, ZV, ZO, JZD, Kčs, ONV, ÚR, MěstNV, BSP, BP
Abbr
seems to be lexical feature of NOUN
. 100% lemmas (160) occur only with one value of Abbr
.
ADV
10 cs-pos/ADV tokens (0% of all ADV
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which ADV
and Abbr
co-occurred: Degree=Pos (10; 100%), Negative=Pos (10; 100%).
ADV
tokens may have the following values of Abbr
:
Yes
(10; 100% of non-emptyAbbr
): kt
ADJ
9 cs-pos/ADJ tokens (0% of all ADJ
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which ADJ
and Abbr
co-occurred: Gender=EMPTY (9; 100%), Degree=Pos (9; 100%), Case=EMPTY (9; 100%), Number=EMPTY (9; 100%), Negative=Pos (9; 100%), Animacy=EMPTY (9; 100%).
ADJ
tokens may have the following values of Abbr
:
Yes
(9; 100% of non-emptyAbbr
): TH, jč, HT, LP, PE, Rh
PUNCT
1 cs-pos/PUNCT tokens (0% of all PUNCT
tokens) have a non-empty value of Abbr
.
PUNCT
tokens may have the following values of Abbr
:
Yes
(1; 100% of non-emptyAbbr
): ?
Relations with Agreement in Abbr
The 10 most frequent relations where parent and child node agree in Abbr
:
SYM –[conj]–> SYM (125; 100%),
PROPN –[conj]–> PROPN (76; 72%),
SYM –[nmod]–> SYM (29; 100%),
PROPN –[nmod]–> NOUN (25; 53%),
SYM –[case]–> SYM (9; 100%),
SYM –[appos]–> SYM (3; 100%),
SYM –[advmod]–> SYM (3; 100%),
SYM –[nsubj]–> SYM (2; 100%),
SYM –[dep]–> SYM (2; 100%),
SYM –[dobj]–> SYM (1; 100%).
Treebank Statistics (UD_Czech-CLTT)
This feature is language-specific.
It occurs with 1 different values: Yes
.
35 tokens (0%) have a non-empty value of Abbr
.
8 types (0%) occur at least once with a non-empty value of Abbr
.
10 lemmas (0%) occur at least once with a non-empty value of Abbr
.
The feature is used with 4 part-of-speech tags: cs-pos/NOUN (27; 0% instances), cs-pos/ADJ (4; 0% instances), cs-pos/ADV (3; 0% instances), cs-pos/PRON (1; 0% instances).
NOUN
27 cs-pos/NOUN tokens (0% of all NOUN
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which NOUN
and Abbr
co-occurred: Negative=Pos (27; 100%), Number=EMPTY (27; 100%), Case=EMPTY (27; 100%), Gender=Fem (20; 74%), Animacy=EMPTY (20; 74%).
NOUN
tokens may have the following values of Abbr
:
Yes
(27; 100% of non-emptyAbbr
): Kč, USD, m, m2, ha, t, ČSSR
ADJ
4 cs-pos/ADJ tokens (0% of all ADJ
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which ADJ
and Abbr
co-occurred: Number=EMPTY (4; 100%), Negative=Pos (4; 100%), Animacy=EMPTY (4; 100%), Gender=EMPTY (4; 100%), Degree=Pos (4; 100%), Case=EMPTY (4; 100%).
ADJ
tokens may have the following values of Abbr
:
Yes
(4; 100% of non-emptyAbbr
): něm
ADV
3 cs-pos/ADV tokens (0% of all ADV
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which ADV
and Abbr
co-occurred: Degree=Pos (3; 100%), Negative=Pos (3; 100%).
ADV
tokens may have the following values of Abbr
:
Yes
(3; 100% of non-emptyAbbr
): něm
PRON
1 cs-pos/PRON tokens (0% of all PRON
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which PRON
and Abbr
co-occurred: Reflex=EMPTY (1; 100%), Case=EMPTY (1; 100%), Gender=EMPTY (1; 100%), PronType=Dem (1; 100%), Number=EMPTY (1; 100%), Variant=EMPTY (1; 100%).
PRON
tokens may have the following values of Abbr
:
Yes
(1; 100% of non-emptyAbbr
): t
Relations with Agreement in Abbr
The 10 most frequent relations where parent and child node agree in Abbr
:
PRON –[conj]–> NOUN (1; 100%).