Abbr
: abbreviation
Boolean feature. Is this an abbreviation? Note that the abbreviated word typically belongs to a part of speech other than cs-pos/X.
Yes: it is abbreviation
Examples
- Acronyms: ČR (Česká republika) “Czech Republic”, LN (Lidové noviny) (a newspaper), ODS (Občanská demokratická strana) “Civic Democratic Party”, OSN (Organizace spojených národů) “United Nations Organization”, ODA (Občanská demokratická aliance) “Civic Democratic Alliance”
- Initials: J, M, V, A, C
- Abbreviations: r. (rok) “year”, např. (například) “for example”, tzv. (takzvaný) “so-called”, a. s. (akciová společnost) “joint-stock company”, tel. (telefon) “phone”
Treebank Statistics (UD_Czech)
This feature is language-specific.
It occurs with 1 different values: Yes
.
21743 tokens (1%) have a non-empty value of Abbr
.
1755 types (1%) occur at least once with a non-empty value of Abbr
.
1812 lemmas (3%) occur at least once with a non-empty value of Abbr
.
The feature is used with 11 part-of-speech tags: cs-pos/PROPN (13042; 1% instances), cs-pos/NOUN (5768; 0% instances), cs-pos/ADJ (1714; 0% instances), cs-pos/ADV (956; 0% instances), cs-pos/CONJ (182; 0% instances), cs-pos/ADP (23; 0% instances), cs-pos/VERB (22; 0% instances), cs-pos/DET (15; 0% instances), cs-pos/X (12; 0% instances), cs-pos/PRON (6; 0% instances), cs-pos/PART (3; 0% instances).
PROPN
13042 cs-pos/PROPN tokens (16% of all PROPN
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which PROPN
and Abbr
co-occurred: Negative=Pos (13042; 100%), Case=EMPTY (13010; 100%), Number=EMPTY (12219; 94%), Animacy=EMPTY (9687; 74%), Gender=Fem (6911; 53%), NameType=Com (6803; 52%).
PROPN
tokens may have the following values of Abbr
:
Yes
(13042; 100% of non-emptyAbbr
): ČR, LN, ODS, J, OSN, ODA, M, ČSFR, V, A
Abbr
seems to be lexical feature of PROPN
. 100% lemmas (1236) occur only with one value of Abbr
.
NOUN
5768 cs-pos/NOUN tokens (2% of all NOUN
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which NOUN
and Abbr
co-occurred: Negative=Pos (5768; 100%), Case=EMPTY (5608; 97%), Number=EMPTY (5538; 96%), Gender=Masc (3042; 53%).
NOUN
tokens may have the following values of Abbr
:
Yes
(5768; 100% of non-emptyAbbr
): r, s, tel, m, č, km, MS, mil, Kčs, cm
Abbr
seems to be lexical feature of NOUN
. 100% lemmas (490) occur only with one value of Abbr
.
ADJ
1714 cs-pos/ADJ tokens (1% of all ADJ
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which ADJ
and Abbr
co-occurred: Negative=Pos (1714; 100%), Animacy=EMPTY (1713; 100%), Degree=Pos (1705; 99%), Case=EMPTY (1601; 93%), Number=EMPTY (1601; 93%), Gender=EMPTY (1598; 93%).
ADJ
tokens may have the following values of Abbr
:
Yes
(1714; 100% of non-emptyAbbr
): tzv, a, čs, o, sv, RM, US, Č, n, k
Abbr
seems to be lexical feature of ADJ
. 100% lemmas (185) occur only with one value of Abbr
.
ADV
956 cs-pos/ADV tokens (1% of all ADV
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which ADV
and Abbr
co-occurred: Degree=EMPTY (956; 100%), Negative=EMPTY (956; 100%).
ADV
tokens may have the following values of Abbr
:
Yes
(956; 100% of non-emptyAbbr
): např, mj, apod, atd, resp, atp, popř, cca, ap, kupř
Abbr
seems to be lexical feature of ADV
. 100% lemmas (22) occur only with one value of Abbr
.
CONJ
182 cs-pos/CONJ tokens (0% of all CONJ
tokens) have a non-empty value of Abbr
.
CONJ
tokens may have the following values of Abbr
:
Yes
(182; 100% of non-emptyAbbr
): tj, n
ADP
23 cs-pos/ADP tokens (0% of all ADP
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which ADP
and Abbr
co-occurred: AdpType=Prep (23; 100%), Case=Ins (16; 70%).
ADP
tokens may have the following values of Abbr
:
Yes
(23; 100% of non-emptyAbbr
): n, v, př, P, m, vč
VERB
22 cs-pos/VERB tokens (0% of all VERB
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which VERB
and Abbr
co-occurred: Gender=EMPTY (22; 100%), Number=Sing (22; 100%), Negative=Pos (22; 100%), VerbForm=Fin (22; 100%), Person=3 (17; 77%), Voice=Act (17; 77%), Tense=Pres (17; 77%), Mood=Ind (17; 77%).
VERB
tokens may have the following values of Abbr
:
Yes
(22; 100% of non-emptyAbbr
): tzn, j, srov
DET
15 cs-pos/DET tokens (0% of all DET
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which DET
and Abbr
co-occurred: Gender[psor]=EMPTY (15; 100%), Reflex=EMPTY (15; 100%), PronType=Dem (9; 60%), Gender=EMPTY (9; 60%), Poss=EMPTY (9; 60%), Number[psor]=EMPTY (9; 60%), Case=EMPTY (9; 60%), Person=EMPTY (9; 60%), Number=EMPTY (9; 60%).
DET
tokens may have the following values of Abbr
:
Yes
(15; 100% of non-emptyAbbr
): t, n
X
12 cs-pos/X tokens (92% of all X
tokens) have a non-empty value of Abbr
.
X
tokens may have the following values of Abbr
:
Yes
(12; 100% of non-emptyAbbr
): A, H, M, S
PRON
6 cs-pos/PRON tokens (0% of all PRON
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which PRON
and Abbr
co-occurred: Variant=EMPTY (6; 100%), Reflex=EMPTY (6; 100%), Person=EMPTY (6; 100%), PronType=Dem (4; 67%), Case=Nom (4; 67%), Number=Sing (4; 67%), Gender=Neut (4; 67%).
PRON
tokens may have the following values of Abbr
:
Yes
(6; 100% of non-emptyAbbr
): t, mn, vš
PART
3 cs-pos/PART tokens (0% of all PART
tokens) have a non-empty value of Abbr
.
PART
tokens may have the following values of Abbr
:
Yes
(3; 100% of non-emptyAbbr
): CA
Relations with Agreement in Abbr
The 10 most frequent relations where parent and child node agree in Abbr
:
PROPN –[conj]–> PROPN (730; 66%),
ADJ –[amod]–> ADJ (48; 77%),
NOUN –[det]–> DET (15; 52%),
X –[nmod]–> X (9; 100%),
PROPN –[nsubj]–> PROPN (3; 100%),
NOUN –[foreign]–> ADV (2; 100%),
PART –[conj]–> NOUN (2; 100%),
ADP –[dep]–> NOUN (1; 100%),
ADJ –[case]–> ADJ (1; 100%),
PRON –[amod]–> ADJ (1; 100%).