Treebank Statistics: UD_Czech-PDTC: Features: Abbr
This feature is universal.
It occurs with 1 different values: Yes.
35363 tokens (1%) have a non-empty value of Abbr.
2391 types (1%) occur at least once with a non-empty value of Abbr.
2417 lemmas (3%) occur at least once with a non-empty value of Abbr.
The feature is used with 10 part-of-speech tags: NOUN (25235; 1% instances), PROPN (7415; 0% instances), ADJ (1150; 0% instances), ADV (541; 0% instances), PART (458; 0% instances), NUM (292; 0% instances), CCONJ (221; 0% instances), ADP (32; 0% instances), DET (14; 0% instances), VERB (5; 0% instances).
NOUN
25235 NOUN tokens (3% of all NOUN tokens) have a non-empty value of Abbr.
The most frequent other feature values with which NOUN and Abbr co-occurred: Case=EMPTY (25113; 100%), Number=EMPTY (25113; 100%), Animacy=EMPTY (23518; 93%), Gender=EMPTY (22219; 88%).
NOUN tokens may have the following values of Abbr:
Yes(25235; 100% of non-emptyAbbr): a, p, s, j, Kč, m, r, b, d, cEMPTY(757468): společnosti, společnost, dolarů, roce, roku, let, akcií, trhu, firmy, rok
Abbr seems to be lexical feature of NOUN. 100% lemmas (1993) occur only with one value of Abbr.
PROPN
7415 PROPN tokens (6% of all PROPN tokens) have a non-empty value of Abbr.
The most frequent other feature values with which PROPN and Abbr co-occurred: Case=EMPTY (7408; 100%), Number=EMPTY (7408; 100%), Animacy=EMPTY (7292; 98%), Gender=EMPTY (7159; 97%).
PROPN tokens may have the following values of Abbr:
Yes(7415; 100% of non-emptyAbbr): ČR, USA, LN, ODS, OSN, ČSFR, SR, NATO, ČSSD, ČTEMPTY(123231): Praha, Praze, Prahy, Yorku, Jiří, Evropě, Plzni, Jan, John, Evropy
Abbr seems to be lexical feature of PROPN. 100% lemmas (360) occur only with one value of Abbr.
ADJ
1150 ADJ tokens (0% of all ADJ tokens) have a non-empty value of Abbr.
The most frequent other feature values with which ADJ and Abbr co-occurred: Animacy=EMPTY (1150; 100%), Polarity=Pos (1150; 100%), VerbForm=EMPTY (1150; 100%), Voice=EMPTY (1150; 100%), Degree=Pos (1103; 96%), Case=EMPTY (1035; 90%), Number=EMPTY (1035; 90%), Gender=EMPTY (1033; 90%).
ADJ tokens may have the following values of Abbr:
Yes(1150; 100% of non-emptyAbbr): tzv, a, čs, o, sv, ml, aj, Č, nar, hlEMPTY(359887): další, první, nové, poslední, české, velké, dalších, cenných, obchodní, hlavní
Abbr seems to be lexical feature of ADJ. 100% lemmas (51) occur only with one value of Abbr.
ADV
541 ADV tokens (0% of all ADV tokens) have a non-empty value of Abbr.
The most frequent other feature values with which ADV and Abbr co-occurred: PronType=EMPTY (541; 100%), Degree=EMPTY (529; 98%), Polarity=EMPTY (529; 98%).
ADV tokens may have the following values of Abbr:
Yes(541; 100% of non-emptyAbbr): mj, apod, atd, resp, atp, popř, ap, tzv, kupř, popřípEMPTY(164652): tam, už, tak, jak, kde, pak, kdy, více, ještě, včera
Abbr seems to be lexical feature of ADV. 100% lemmas (17) occur only with one value of Abbr.
PART
458 PART tokens (1% of all PART tokens) have a non-empty value of Abbr.
PART tokens may have the following values of Abbr:
Yes(458; 100% of non-emptyAbbr): např, cca, zejmEMPTY(64332): i, tak, asi, také, ještě, jen, až, taky, ne, už
NUM
292 NUM tokens (0% of all NUM tokens) have a non-empty value of Abbr.
The most frequent other feature values with which NUM and Abbr co-occurred: Case=EMPTY (292; 100%), NumForm=EMPTY (292; 100%), NumType=Card (292; 100%), Number=EMPTY (292; 100%), Gender=Masc (202; 69%).
NUM tokens may have the following values of Abbr:
Yes(292; 100% of non-emptyAbbr): mil, mld, tisEMPTY(104164): 1, milionů, milionu, dva, tři, 2, jeden, miliardy, 3, 4
CCONJ
221 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of Abbr.
CCONJ tokens may have the following values of Abbr:
Yes(221; 100% of non-emptyAbbr): tj, tznEMPTY(112411): a, ale, i, nebo, však, takže, či, až, proto, ani
ADP
32 ADP tokens (0% of all ADP tokens) have a non-empty value of Abbr.
The most frequent other feature values with which ADP and Abbr co-occurred: AdpType=Prep (30; 94%).
ADP tokens may have the following values of Abbr:
Yes(32; 100% of non-emptyAbbr): n, vs, v, př, m, včEMPTY(319981): v, na, z, o, s, do, za, ve, pro, k
DET
14 DET tokens (0% of all DET tokens) have a non-empty value of Abbr.
The most frequent other feature values with which DET and Abbr co-occurred: Animacy=EMPTY (14; 100%), Case=EMPTY (8; 57%), Gender=EMPTY (8; 57%), Number=EMPTY (8; 57%), Number[psor]=EMPTY (8; 57%), Person=EMPTY (8; 57%), Poss=EMPTY (8; 57%), PronType=Dem (8; 57%).
DET tokens may have the following values of Abbr:
Yes(14; 100% of non-emptyAbbr): t, nEMPTY(154592): to, které, který, která, jeho, své, jejich, tím, toho, této
VERB
5 VERB tokens (0% of all VERB tokens) have a non-empty value of Abbr.
The most frequent other feature values with which VERB and Abbr co-occurred: Animacy=EMPTY (5; 100%), Aspect=Perf (5; 100%), Gender=EMPTY (5; 100%), Mood=Imp (5; 100%), Number=Sing (5; 100%), Person=2 (5; 100%), Polarity=Pos (5; 100%), Tense=EMPTY (5; 100%), VerbForm=Fin (5; 100%), Voice=EMPTY (5; 100%).
VERB tokens may have the following values of Abbr:
Yes(5; 100% of non-emptyAbbr): srovEMPTY(319937): má, řekl, říká, měl, měli, měla, může, mají, uvedla, uvedl
Relations with Agreement in Abbr
The 10 most frequent relations where parent and child node agree in Abbr:
NOUN –[parataxis]–> ADJ (11; 73%),
ADP –[fixed]–> ADJ (2; 100%),
ADV –[cc]–> ADV (2; 100%).