Abbr: abbreviation
Boolean feature marking a word as an abbreviation.
Note that UD Finnish does not differentiate between different types of
shortened forms. In particular, there is no separate feature
identifying acronyms, which are also marked with Abbr=Yes.
Yes: word is abbreviation
Note that there is no No value. If the word is not an abbreviation,
the Abbr feature will not appear.
Examples
- [fi] mm. “among others”
- [fi] esim. “for example”
- [fi] USA, EU
Treebank Statistics (UD_Finnish)
This feature is language-specific.
It occurs with 1 different values: Yes.
906 tokens (1%) have a non-empty value of Abbr.
251 types (1%) occur at least once with a non-empty value of Abbr.
209 lemmas (1%) occur at least once with a non-empty value of Abbr.
The feature is used with 7 part-of-speech tags: fi-pos/NOUN (556; 0% instances), fi-pos/PROPN (256; 0% instances), fi-pos/ADV (55; 0% instances), fi-pos/VERB (25; 0% instances), fi-pos/ADJ (11; 0% instances), fi-pos/NUM (2; 0% instances), fi-pos/INTJ (1; 0% instances).
NOUN
556 fi-pos/NOUN tokens (1% of all NOUN tokens) have a non-empty value of Abbr.
The most frequent other feature values with which NOUN and Abbr co-occurred: Number=Sing (518; 93%), Case=Nom (346; 62%).
NOUN tokens may have the following values of Abbr:
Yes(556; 100% of non-emptyAbbr): N:o, EY, eaa., a, b, kpl, ETY, g, cm, A:n
Abbr seems to be lexical feature of NOUN. 100% lemmas (152) occur only with one value of Abbr.
PROPN
256 fi-pos/PROPN tokens (2% of all PROPN tokens) have a non-empty value of Abbr.
The most frequent other feature values with which PROPN and Abbr co-occurred: Number=Sing (246; 96%), Case=Gen (164; 64%).
PROPN tokens may have the following values of Abbr:
Yes(256; 100% of non-emptyAbbr): EU:n, EKP:n, EKP, SDP:n, YK:n, MTV3, EU, SDP, SDP:tä, EU:ssa
Abbr seems to be lexical feature of PROPN. 100% lemmas (43) occur only with one value of Abbr.
ADV
55 fi-pos/ADV tokens (0% of all ADV tokens) have a non-empty value of Abbr.
ADV tokens may have the following values of Abbr:
Yes(55; 100% of non-emptyAbbr): mm., esim., n., jne, oik., ns., esim, ym
Abbr seems to be lexical feature of ADV. 100% lemmas (11) occur only with one value of Abbr.
VERB
25 fi-pos/VERB tokens (0% of all VERB tokens) have a non-empty value of Abbr.
The most frequent other feature values with which VERB and Abbr co-occurred: PartForm=Past (25; 100%), InfForm=EMPTY (25; 100%), Tense=EMPTY (25; 100%), Number=Sing (25; 100%), Case=Nom (25; 100%), VerbForm=Part (25; 100%), Person=EMPTY (25; 100%), Degree=Pos (25; 100%), Mood=EMPTY (25; 100%), Voice=Act (24; 96%).
VERB tokens may have the following values of Abbr:
Yes(25; 100% of non-emptyAbbr): s., Em., k.
ADJ
11 fi-pos/ADJ tokens (0% of all ADJ tokens) have a non-empty value of Abbr.
The most frequent other feature values with which ADJ and Abbr co-occurred: Case=EMPTY (8; 73%), Number=EMPTY (8; 73%), Degree=EMPTY (8; 73%).
ADJ tokens may have the following values of Abbr:
Yes(11; 100% of non-emptyAbbr): ns., nk., ev.-lut.
NUM
2 fi-pos/NUM tokens (0% of all NUM tokens) have a non-empty value of Abbr.
The most frequent other feature values with which NUM and Abbr co-occurred: Number=Sing (2; 100%), NumType=Card (2; 100%).
NUM tokens may have the following values of Abbr:
Yes(2; 100% of non-emptyAbbr): milj., u18
INTJ
1 fi-pos/INTJ tokens (1% of all INTJ tokens) have a non-empty value of Abbr.
INTJ tokens may have the following values of Abbr:
Yes(1; 100% of non-emptyAbbr): Huom
Relations with Agreement in Abbr
The 10 most frequent relations where parent and child node agree in Abbr:
NOUN –[name]–> NOUN (1; 100%),
NOUN –[acl:relcl]–> NOUN (1; 100%),
NOUN –[compound]–> NOUN (1; 100%).
Treebank Statistics (UD_Finnish-FTB)
This feature is language-specific.
It occurs with 1 different values: Yes.
484 tokens (0%) have a non-empty value of Abbr.
256 types (1%) occur at least once with a non-empty value of Abbr.
190 lemmas (1%) occur at least once with a non-empty value of Abbr.
The feature is used with 4 part-of-speech tags: fi-pos/PROPN (201; 0% instances), fi-pos/NOUN (196; 0% instances), fi-pos/PART (86; 0% instances), fi-pos/ADJ (1; 0% instances).
PROPN
201 fi-pos/PROPN tokens (3% of all PROPN tokens) have a non-empty value of Abbr.
The most frequent other feature values with which PROPN and Abbr co-occurred: Number=Sing (200; 100%).
PROPN tokens may have the following values of Abbr:
Yes(201; 100% of non-emptyAbbr): EU:n, NN:n, EU, EU:hun, SAK, YK:n, SAK:n, SDP:n, TBK, USA:n
Abbr seems to be lexical feature of PROPN. 100% lemmas (104) occur only with one value of Abbr.
NOUN
196 fi-pos/NOUN tokens (1% of all NOUN tokens) have a non-empty value of Abbr.
The most frequent other feature values with which NOUN and Abbr co-occurred: Number=Sing (194; 99%), Case=Nom (125; 64%).
NOUN tokens may have the following values of Abbr:
Yes(196; 100% of non-emptyAbbr): x, x:n, y, klo, A, mk, A:n, Oy:n, B, B:n
Abbr seems to be lexical feature of NOUN. 100% lemmas (53) occur only with one value of Abbr.
PART
86 fi-pos/PART tokens (2% of all PART tokens) have a non-empty value of Abbr.
PART tokens may have the following values of Abbr:
Yes(86; 100% of non-emptyAbbr): mm., ns., esim., n., OK, jne, km/h, 70,00%, em., mk/kg
Abbr seems to be lexical feature of PART. 100% lemmas (34) occur only with one value of Abbr.
ADJ
1 fi-pos/ADJ tokens (0% of all ADJ tokens) have a non-empty value of Abbr.
The most frequent other feature values with which ADJ and Abbr co-occurred: Case=EMPTY (1; 100%), Number=EMPTY (1; 100%).
ADJ tokens may have the following values of Abbr:
Yes(1; 100% of non-emptyAbbr): huumorintaj.
Relations with Agreement in Abbr
The 10 most frequent relations where parent and child node agree in Abbr:
NOUN –[conj]–> NOUN (14; 100%),
PART –[conj]–> NOUN (1; 100%).