This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home fi/feat issue tracker

Abbr: abbreviation

Boolean feature marking a word as an abbreviation.

Note that UD Finnish does not differentiate between different types of shortened forms. In particular, there is no separate feature identifying acronyms, which are also marked with Abbr=Yes.

Yes: word is abbreviation

Note that there is no No value. If the word is not an abbreviation, the Abbr feature will not appear.

Examples


Treebank Statistics (UD_Finnish)

This feature is language-specific. It occurs with 1 different values: Yes.

906 tokens (1%) have a non-empty value of Abbr. 251 types (1%) occur at least once with a non-empty value of Abbr. 209 lemmas (1%) occur at least once with a non-empty value of Abbr. The feature is used with 7 part-of-speech tags: fi-pos/NOUN (556; 0% instances), fi-pos/PROPN (256; 0% instances), fi-pos/ADV (55; 0% instances), fi-pos/VERB (25; 0% instances), fi-pos/ADJ (11; 0% instances), fi-pos/NUM (2; 0% instances), fi-pos/INTJ (1; 0% instances).

NOUN

556 fi-pos/NOUN tokens (1% of all NOUN tokens) have a non-empty value of Abbr.

The most frequent other feature values with which NOUN and Abbr co-occurred: Number=Sing (518; 93%), Case=Nom (346; 62%).

NOUN tokens may have the following values of Abbr:

Abbr seems to be lexical feature of NOUN. 100% lemmas (152) occur only with one value of Abbr.

PROPN

256 fi-pos/PROPN tokens (2% of all PROPN tokens) have a non-empty value of Abbr.

The most frequent other feature values with which PROPN and Abbr co-occurred: Number=Sing (246; 96%), Case=Gen (164; 64%).

PROPN tokens may have the following values of Abbr:

Abbr seems to be lexical feature of PROPN. 100% lemmas (43) occur only with one value of Abbr.

ADV

55 fi-pos/ADV tokens (0% of all ADV tokens) have a non-empty value of Abbr.

ADV tokens may have the following values of Abbr:

Abbr seems to be lexical feature of ADV. 100% lemmas (11) occur only with one value of Abbr.

VERB

25 fi-pos/VERB tokens (0% of all VERB tokens) have a non-empty value of Abbr.

The most frequent other feature values with which VERB and Abbr co-occurred: PartForm=Past (25; 100%), InfForm=EMPTY (25; 100%), Tense=EMPTY (25; 100%), Number=Sing (25; 100%), Case=Nom (25; 100%), VerbForm=Part (25; 100%), Person=EMPTY (25; 100%), Degree=Pos (25; 100%), Mood=EMPTY (25; 100%), Voice=Act (24; 96%).

VERB tokens may have the following values of Abbr:

ADJ

11 fi-pos/ADJ tokens (0% of all ADJ tokens) have a non-empty value of Abbr.

The most frequent other feature values with which ADJ and Abbr co-occurred: Case=EMPTY (8; 73%), Number=EMPTY (8; 73%), Degree=EMPTY (8; 73%).

ADJ tokens may have the following values of Abbr:

NUM

2 fi-pos/NUM tokens (0% of all NUM tokens) have a non-empty value of Abbr.

The most frequent other feature values with which NUM and Abbr co-occurred: Number=Sing (2; 100%), NumType=Card (2; 100%).

NUM tokens may have the following values of Abbr:

INTJ

1 fi-pos/INTJ tokens (1% of all INTJ tokens) have a non-empty value of Abbr.

INTJ tokens may have the following values of Abbr:

Relations with Agreement in Abbr

The 10 most frequent relations where parent and child node agree in Abbr: NOUN –[name]–> NOUN (1; 100%), NOUN –[acl:relcl]–> NOUN (1; 100%), NOUN –[compound]–> NOUN (1; 100%).


Treebank Statistics (UD_Finnish-FTB)

This feature is language-specific. It occurs with 1 different values: Yes.

484 tokens (0%) have a non-empty value of Abbr. 256 types (1%) occur at least once with a non-empty value of Abbr. 190 lemmas (1%) occur at least once with a non-empty value of Abbr. The feature is used with 4 part-of-speech tags: fi-pos/PROPN (201; 0% instances), fi-pos/NOUN (196; 0% instances), fi-pos/PART (86; 0% instances), fi-pos/ADJ (1; 0% instances).

PROPN

201 fi-pos/PROPN tokens (3% of all PROPN tokens) have a non-empty value of Abbr.

The most frequent other feature values with which PROPN and Abbr co-occurred: Number=Sing (200; 100%).

PROPN tokens may have the following values of Abbr:

Abbr seems to be lexical feature of PROPN. 100% lemmas (104) occur only with one value of Abbr.

NOUN

196 fi-pos/NOUN tokens (1% of all NOUN tokens) have a non-empty value of Abbr.

The most frequent other feature values with which NOUN and Abbr co-occurred: Number=Sing (194; 99%), Case=Nom (125; 64%).

NOUN tokens may have the following values of Abbr:

Abbr seems to be lexical feature of NOUN. 100% lemmas (53) occur only with one value of Abbr.

PART

86 fi-pos/PART tokens (2% of all PART tokens) have a non-empty value of Abbr.

PART tokens may have the following values of Abbr:

Abbr seems to be lexical feature of PART. 100% lemmas (34) occur only with one value of Abbr.

ADJ

1 fi-pos/ADJ tokens (0% of all ADJ tokens) have a non-empty value of Abbr.

The most frequent other feature values with which ADJ and Abbr co-occurred: Case=EMPTY (1; 100%), Number=EMPTY (1; 100%).

ADJ tokens may have the following values of Abbr:

Relations with Agreement in Abbr

The 10 most frequent relations where parent and child node agree in Abbr: NOUN –[conj]–> NOUN (14; 100%), PART –[conj]–> NOUN (1; 100%).