Treebank Statistics: UD_Estonian-EDT: Features: Abbr
This feature is universal.
It occurs with 1 different values: Yes
.
4077 tokens (1%) have a non-empty value of Abbr
.
1048 types (1%) occur at least once with a non-empty value of Abbr
.
814 lemmas (2%) occur at least once with a non-empty value of Abbr
.
The feature is used with 9 part-of-speech tags: PROPN (1731; 0% instances), NOUN (1518; 0% instances), ADV (397; 0% instances), X (134; 0% instances), ADJ (119; 0% instances), SYM (105; 0% instances), VERB (48; 0% instances), CCONJ (13; 0% instances), NUM (12; 0% instances).
PROPN
1731 PROPN tokens (7% of all PROPN
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which PROPN
and Abbr
co-occurred: Number=EMPTY (1358; 78%), Case=EMPTY (1357; 78%).
PROPN
tokens may have the following values of Abbr
:
Yes
(1731; 100% of non-emptyAbbr
): USA, AS, NATO, A., LRE, EL, N., HA, de, CDUEMPTY
(24740): eesti, Eestis, Euroopa, Tartu, Tallinna, Peeter, Maa, Vene, Jan, Venemaa
Abbr
seems to be lexical feature of PROPN
. 100% lemmas (389) occur only with one value of Abbr
.
NOUN
1518 NOUN tokens (1% of all NOUN
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which NOUN
and Abbr
co-occurred: Case=EMPTY (1387; 91%), Number=EMPTY (1387; 91%).
NOUN
tokens may have the following values of Abbr
:
Yes
(1518; 100% of non-emptyAbbr
): a., a, USB, p, VD, g, km, cm, m, krEMPTY
(113851): aasta, aastal, aastat, raha, osa, krooni, korda, ajal, sissetulekute, mõju
Abbr
seems to be lexical feature of NOUN
. 100% lemmas (349) occur only with one value of Abbr
.
ADV
397 ADV tokens (1% of all ADV
tokens) have a non-empty value of Abbr
.
ADV
tokens may have the following values of Abbr
:
Yes
(397; 100% of non-emptyAbbr
): jne, jt., jt, s.t, sh, n-ö, nt, st, jm, jmsEMPTY
(41717): ka, siis, nii, kas, juba, välja, aga, veel, väga, mitte
Abbr
seems to be lexical feature of ADV
. 100% lemmas (38) occur only with one value of Abbr
.
X
134 X tokens (11% of all X
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which X
and Abbr
co-occurred: Foreign=Yes (86; 64%).
X
tokens may have the following values of Abbr
:
Yes
(134; 100% of non-emptyAbbr
): al., of, in, a, b, n, to, x, AT, IEMPTY
(1034): 000, et, of, the, in, drive, 900, and, de, 600
Abbr
seems to be lexical feature of X
. 100% lemmas (28) occur only with one value of Abbr
.
ADJ
119 ADJ tokens (0% of all ADJ
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which ADJ
and Abbr
co-occurred: Tense=EMPTY (119; 100%), VerbForm=EMPTY (119; 100%), Voice=EMPTY (119; 100%), Degree=EMPTY (114; 96%), Case=EMPTY (113; 95%), Number=EMPTY (113; 95%).
ADJ
tokens may have the following values of Abbr
:
Yes
(119; 100% of non-emptyAbbr
): nn, nn., van, %-lise, 80’ndate, nim, nim., %-se, %-st, II-gaEMPTY
(36640): suur, hea, võimalik, eesti, suurem, uue, suure, raske, esimene, oluline
SYM
105 SYM tokens (14% of all SYM
tokens) have a non-empty value of Abbr
.
SYM
tokens may have the following values of Abbr
:
Yes
(105; 100% of non-emptyAbbr
): *, §, sulev@ekspress.ee, =, C18:2n-6, C18:3n-3, anne@ekspress.ee, s., ‘i, +EMPTY
(636): %, %-l, %-ni, &, =, ω-3-, &, %-lt, ?, 1-
Abbr
seems to be lexical feature of SYM
. 100% lemmas (80) occur only with one value of Abbr
.
VERB
48 VERB tokens (0% of all VERB
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which VERB
and Abbr
co-occurred: Mood=Imp (48; 100%), Number=EMPTY (48; 100%), Person=EMPTY (48; 100%), Tense=EMPTY (48; 100%), VerbForm=Fin (48; 100%), Voice=EMPTY (48; 100%).
VERB
tokens may have the following values of Abbr
:
Yes
(48; 100% of non-emptyAbbr
): vt, vt., vrdEMPTY
(47947): on, tuleb, teha, ütles, saada, sai, saanud, tuli, saab, jääb
CCONJ
13 CCONJ tokens (0% of all CCONJ
tokens) have a non-empty value of Abbr
.
CCONJ
tokens may have the following values of Abbr
:
Yes
(13; 100% of non-emptyAbbr
): &, e, e.EMPTY
(16074): ja, ning, või, aga, kuid, kui, ega, vaid, ehk, ent
NUM
12 NUM tokens (0% of all NUM
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which NUM
and Abbr
co-occurred: Case=EMPTY (12; 100%), NumForm=Word (12; 100%), NumType=Card (12; 100%), Number=EMPTY (12; 100%).
NUM
tokens may have the following values of Abbr
:
Yes
(12; 100% of non-emptyAbbr
): milj., mln, miljEMPTY
(9030): kaks, 1, üks, 10, 2, kolm, kahe, ühe, 3, miljonit
Relations with Agreement in Abbr
The 10 most frequent relations where parent and child node agree in Abbr
:
NOUN –[conj]–> NOUN (89; 63%),
PROPN –[flat]–> ADV (29; 85%),
NOUN –[fixed]–> NOUN (2; 100%),
ADV –[conj]–> ADV (1; 100%),
NOUN –[list]–> NOUN (1; 100%),
SYM –[conj]–> SYM (1; 100%),
SYM –[list]–> SYM (1; 100%),
SYM –[nmod]–> SYM (1; 100%),
X –[fixed]–> X (1; 100%).