Abbr

This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.

home cs/feat issue tracker

`Abbr`: abbreviation

Boolean feature. Is this an abbreviation? Note that the abbreviated word typically belongs to a part of speech other than cs-pos/X.

Yes: it is abbreviation

Examples

Acronyms: ČR (Česká republika) “Czech Republic”, LN (Lidové noviny) (a newspaper), ODS (Občanská demokratická strana) “Civic Democratic Party”, OSN (Organizace spojených národů) “United Nations Organization”, ODA (Občanská demokratická aliance) “Civic Democratic Alliance”
Initials: J, M, V, A, C
Abbreviations: r. (rok) “year”, např. (například) “for example”, tzv. (takzvaný) “so-called”, a. s. (akciová společnost) “joint-stock company”, tel. (telefon) “phone”

Treebank Statistics (UD_Czech)

This feature is language-specific. It occurs with 1 different values: Yes.

21743 tokens (1%) have a non-empty value of Abbr. 1755 types (1%) occur at least once with a non-empty value of Abbr. 1812 lemmas (3%) occur at least once with a non-empty value of Abbr. The feature is used with 11 part-of-speech tags: cs-pos/PROPN (13042; 1% instances), cs-pos/NOUN (5768; 0% instances), cs-pos/ADJ (1714; 0% instances), cs-pos/ADV (956; 0% instances), cs-pos/CONJ (182; 0% instances), cs-pos/ADP (23; 0% instances), cs-pos/VERB (22; 0% instances), cs-pos/DET (15; 0% instances), cs-pos/X (12; 0% instances), cs-pos/PRON (6; 0% instances), cs-pos/PART (3; 0% instances).

`PROPN`

13042 cs-pos/PROPN tokens (16% of all PROPN tokens) have a non-empty value of Abbr.

The most frequent other feature values with which PROPN and Abbr co-occurred: Negative=Pos (13042; 100%), Case=EMPTY (13010; 100%), Number=EMPTY (12219; 94%), Animacy=EMPTY (9687; 74%), Gender=Fem (6911; 53%), NameType=Com (6803; 52%).

PROPN tokens may have the following values of Abbr:

Yes (13042; 100% of non-empty Abbr): ČR, LN, ODS, J, OSN, ODA, M, ČSFR, V, A

Abbr seems to be lexical feature of PROPN. 100% lemmas (1236) occur only with one value of Abbr.

`NOUN`

5768 cs-pos/NOUN tokens (2% of all NOUN tokens) have a non-empty value of Abbr.

The most frequent other feature values with which NOUN and Abbr co-occurred: Negative=Pos (5768; 100%), Case=EMPTY (5608; 97%), Number=EMPTY (5538; 96%), Gender=Masc (3042; 53%).

NOUN tokens may have the following values of Abbr:

Yes (5768; 100% of non-empty Abbr): r, s, tel, m, č, km, MS, mil, Kčs, cm

Abbr seems to be lexical feature of NOUN. 100% lemmas (490) occur only with one value of Abbr.

`ADJ`

1714 cs-pos/ADJ tokens (1% of all ADJ tokens) have a non-empty value of Abbr.

The most frequent other feature values with which ADJ and Abbr co-occurred: Negative=Pos (1714; 100%), Animacy=EMPTY (1713; 100%), Degree=Pos (1705; 99%), Number=EMPTY (1601; 93%), Case=EMPTY (1601; 93%), Gender=EMPTY (1598; 93%).

ADJ tokens may have the following values of Abbr:

Yes (1714; 100% of non-empty Abbr): tzv, a, čs, o, sv, RM, US, Č, n, k

Abbr seems to be lexical feature of ADJ. 100% lemmas (185) occur only with one value of Abbr.

`ADV`

956 cs-pos/ADV tokens (1% of all ADV tokens) have a non-empty value of Abbr.

The most frequent other feature values with which ADV and Abbr co-occurred: Degree=EMPTY (956; 100%), Negative=EMPTY (956; 100%).

ADV tokens may have the following values of Abbr:

Yes (956; 100% of non-empty Abbr): např, mj, apod, atd, resp, atp, popř, cca, ap, kupř

Abbr seems to be lexical feature of ADV. 100% lemmas (22) occur only with one value of Abbr.

`CONJ`

182 cs-pos/CONJ tokens (0% of all CONJ tokens) have a non-empty value of Abbr.

CONJ tokens may have the following values of Abbr:

Yes (182; 100% of non-empty Abbr): tj, n

`ADP`

23 cs-pos/ADP tokens (0% of all ADP tokens) have a non-empty value of Abbr.

The most frequent other feature values with which ADP and Abbr co-occurred: AdpType=Prep (23; 100%), Case=Ins (16; 70%).

ADP tokens may have the following values of Abbr:

Yes (23; 100% of non-empty Abbr): n, v, př, P, m, vč

`VERB`

22 cs-pos/VERB tokens (0% of all VERB tokens) have a non-empty value of Abbr.

The most frequent other feature values with which VERB and Abbr co-occurred: Negative=Pos (22; 100%), Gender=EMPTY (22; 100%), Number=Sing (22; 100%), VerbForm=Fin (22; 100%), Mood=Ind (17; 77%), Person=3 (17; 77%), Tense=Pres (17; 77%), Voice=Act (17; 77%).

VERB tokens may have the following values of Abbr:

Yes (22; 100% of non-empty Abbr): tzn, j, srov

`DET`

15 cs-pos/DET tokens (0% of all DET tokens) have a non-empty value of Abbr.

The most frequent other feature values with which DET and Abbr co-occurred: Reflex=EMPTY (15; 100%), Gender[psor]=EMPTY (15; 100%), Number=EMPTY (9; 60%), Poss=EMPTY (9; 60%), Person=EMPTY (9; 60%), Case=EMPTY (9; 60%), Gender=EMPTY (9; 60%), Number[psor]=EMPTY (9; 60%), PronType=Dem (9; 60%).

DET tokens may have the following values of Abbr:

Yes (15; 100% of non-empty Abbr): t, n

`X`

12 cs-pos/X tokens (92% of all X tokens) have a non-empty value of Abbr.

X tokens may have the following values of Abbr:

Yes (12; 100% of non-empty Abbr): A, H, M, S

`PRON`

6 cs-pos/PRON tokens (0% of all PRON tokens) have a non-empty value of Abbr.

The most frequent other feature values with which PRON and Abbr co-occurred: Reflex=EMPTY (6; 100%), Person=EMPTY (6; 100%), Variant=EMPTY (6; 100%), Gender=Neut (4; 67%), Case=Nom (4; 67%), Number=Sing (4; 67%), PronType=Dem (4; 67%).

PRON tokens may have the following values of Abbr:

Yes (6; 100% of non-empty Abbr): t, mn, vš

`PART`

3 cs-pos/PART tokens (0% of all PART tokens) have a non-empty value of Abbr.

PART tokens may have the following values of Abbr:

Yes (3; 100% of non-empty Abbr): CA

Relations with Agreement in `Abbr`

The 10 most frequent relations where parent and child node agree in Abbr: PROPN –[conj]–> PROPN (730; 66%), ADJ –[amod]–> ADJ (48; 77%), NOUN –[det]–> DET (15; 54%), X –[nmod]–> X (9; 100%), PROPN –[nsubj]–> PROPN (3; 100%), NOUN –[foreign]–> ADV (2; 100%), PART –[conj]–> NOUN (2; 100%), ADP –[dep]–> NOUN (1; 100%), PRON –[amod]–> ADJ (1; 100%), ADJ –[case]–> ADJ (1; 100%).

Treebank Statistics (UD_Czech-CAC)

This feature is language-specific. It occurs with 1 different values: Yes.

6663 tokens (1%) have a non-empty value of Abbr. 452 types (1%) occur at least once with a non-empty value of Abbr. 446 lemmas (2%) occur at least once with a non-empty value of Abbr. The feature is used with 6 part-of-speech tags: cs-pos/SYM (3783; 1% instances), cs-pos/PROPN (1878; 0% instances), cs-pos/NOUN (982; 0% instances), cs-pos/ADV (10; 0% instances), cs-pos/ADJ (9; 0% instances), cs-pos/PUNCT (1; 0% instances).

`SYM`

3783 cs-pos/SYM tokens (100% of all SYM tokens) have a non-empty value of Abbr.

SYM tokens may have the following values of Abbr:

Yes (3783; 100% of non-empty Abbr): *

`PROPN`

1878 cs-pos/PROPN tokens (19% of all PROPN tokens) have a non-empty value of Abbr.

The most frequent other feature values with which PROPN and Abbr co-occurred: Negative=Pos (1878; 100%), Case=EMPTY (1873; 100%), Number=EMPTY (1865; 99%), NameType=Com (1460; 78%), Animacy=EMPTY (1271; 68%), Gender=Fem (964; 51%).

PROPN tokens may have the following values of Abbr:

Yes (1878; 100% of non-empty Abbr): KSČ, ROH, SSSR, ÚJČ, SSM, ČSAV, ČSSR, ČSR, TIBA, NDR

Abbr seems to be lexical feature of PROPN. 100% lemmas (287) occur only with one value of Abbr.

`NOUN`

982 cs-pos/NOUN tokens (1% of all NOUN tokens) have a non-empty value of Abbr.

The most frequent other feature values with which NOUN and Abbr co-occurred: Negative=Pos (982; 100%), Case=EMPTY (981; 100%), Number=EMPTY (977; 99%), Animacy=EMPTY (510; 52%).

NOUN tokens may have the following values of Abbr:

Yes (982; 100% of non-empty Abbr): ÚV, ZV, ZO, JZD, Kčs, ONV, ÚR, MěstNV, BSP, BP

Abbr seems to be lexical feature of NOUN. 100% lemmas (160) occur only with one value of Abbr.

`ADV`

10 cs-pos/ADV tokens (0% of all ADV tokens) have a non-empty value of Abbr.

The most frequent other feature values with which ADV and Abbr co-occurred: Degree=Pos (10; 100%), Negative=Pos (10; 100%).

ADV tokens may have the following values of Abbr:

Yes (10; 100% of non-empty Abbr): kt

`ADJ`

9 cs-pos/ADJ tokens (0% of all ADJ tokens) have a non-empty value of Abbr.

The most frequent other feature values with which ADJ and Abbr co-occurred: Gender=EMPTY (9; 100%), Degree=Pos (9; 100%), Case=EMPTY (9; 100%), Number=EMPTY (9; 100%), Negative=Pos (9; 100%), Animacy=EMPTY (9; 100%).

ADJ tokens may have the following values of Abbr:

Yes (9; 100% of non-empty Abbr): TH, jč, HT, LP, PE, Rh

`PUNCT`

1 cs-pos/PUNCT tokens (0% of all PUNCT tokens) have a non-empty value of Abbr.

PUNCT tokens may have the following values of Abbr:

Yes (1; 100% of non-empty Abbr): ?

Relations with Agreement in `Abbr`

The 10 most frequent relations where parent and child node agree in Abbr: SYM –[conj]–> SYM (125; 100%), PROPN –[conj]–> PROPN (76; 72%), SYM –[nmod]–> SYM (29; 100%), PROPN –[nmod]–> NOUN (25; 53%), SYM –[case]–> SYM (9; 100%), SYM –[appos]–> SYM (3; 100%), SYM –[advmod]–> SYM (3; 100%), SYM –[nsubj]–> SYM (2; 100%), SYM –[dep]–> SYM (2; 100%), SYM –[dobj]–> SYM (1; 100%).

Treebank Statistics (UD_Czech-CLTT)

This feature is language-specific. It occurs with 1 different values: Yes.

35 tokens (0%) have a non-empty value of Abbr. 8 types (0%) occur at least once with a non-empty value of Abbr. 10 lemmas (0%) occur at least once with a non-empty value of Abbr. The feature is used with 4 part-of-speech tags: cs-pos/NOUN (27; 0% instances), cs-pos/ADJ (4; 0% instances), cs-pos/ADV (3; 0% instances), cs-pos/PRON (1; 0% instances).

`NOUN`

27 cs-pos/NOUN tokens (0% of all NOUN tokens) have a non-empty value of Abbr.

The most frequent other feature values with which NOUN and Abbr co-occurred: Negative=Pos (27; 100%), Number=EMPTY (27; 100%), Case=EMPTY (27; 100%), Gender=Fem (20; 74%), Animacy=EMPTY (20; 74%).

NOUN tokens may have the following values of Abbr:

Yes (27; 100% of non-empty Abbr): Kč, USD, m, m2, ha, t, ČSSR

`ADJ`

4 cs-pos/ADJ tokens (0% of all ADJ tokens) have a non-empty value of Abbr.

The most frequent other feature values with which ADJ and Abbr co-occurred: Number=EMPTY (4; 100%), Negative=Pos (4; 100%), Animacy=EMPTY (4; 100%), Gender=EMPTY (4; 100%), Degree=Pos (4; 100%), Case=EMPTY (4; 100%).

ADJ tokens may have the following values of Abbr:

Yes (4; 100% of non-empty Abbr): něm

`ADV`

3 cs-pos/ADV tokens (0% of all ADV tokens) have a non-empty value of Abbr.

The most frequent other feature values with which ADV and Abbr co-occurred: Degree=Pos (3; 100%), Negative=Pos (3; 100%).

ADV tokens may have the following values of Abbr:

Yes (3; 100% of non-empty Abbr): něm

`PRON`

1 cs-pos/PRON tokens (0% of all PRON tokens) have a non-empty value of Abbr.

The most frequent other feature values with which PRON and Abbr co-occurred: Reflex=EMPTY (1; 100%), Case=EMPTY (1; 100%), Gender=EMPTY (1; 100%), PronType=Dem (1; 100%), Number=EMPTY (1; 100%), Variant=EMPTY (1; 100%).

PRON tokens may have the following values of Abbr:

Yes (1; 100% of non-empty Abbr): t

Relations with Agreement in `Abbr`

The 10 most frequent relations where parent and child node agree in Abbr: PRON –[conj]–> NOUN (1; 100%).

Abbr: abbreviation

Yes: it is abbreviation

Examples

Treebank Statistics (UD_Czech)

PROPN

NOUN

ADJ

ADV

CONJ

ADP

VERB

DET

X

PRON

PART

Relations with Agreement in Abbr

Treebank Statistics (UD_Czech-CAC)

SYM

PROPN

NOUN

ADV

ADJ

PUNCT

Relations with Agreement in Abbr

Treebank Statistics (UD_Czech-CLTT)

NOUN

ADJ

ADV

PRON

Relations with Agreement in Abbr

`Abbr`: abbreviation

`PROPN`

`NOUN`

`ADJ`

`ADV`

`CONJ`

`ADP`

`VERB`

`DET`

`X`

`PRON`

`PART`

Relations with Agreement in `Abbr`

`SYM`

`PROPN`

`NOUN`

`ADV`

`ADJ`

`PUNCT`

Relations with Agreement in `Abbr`

`NOUN`

`ADJ`

`ADV`

`PRON`

Relations with Agreement in `Abbr`