Treebank Statistics: UD_Estonian-EDT: Features: Abbr
This feature is universal.
It occurs with 1 different values: Yes.
3963 tokens (1%) have a non-empty value of Abbr.
1045 types (1%) occur at least once with a non-empty value of Abbr.
811 lemmas (2%) occur at least once with a non-empty value of Abbr.
The feature is used with 9 part-of-speech tags: PROPN (1727; 0% instances), NOUN (1534; 0% instances), ADV (376; 0% instances), ADJ (119; 0% instances), SYM (105; 0% instances), VERB (47; 0% instances), X (30; 0% instances), CCONJ (13; 0% instances), NUM (12; 0% instances).
PROPN
1727 PROPN tokens (7% of all PROPN tokens) have a non-empty value of Abbr.
The most frequent other feature values with which PROPN and Abbr co-occurred: Number=EMPTY (1361; 79%), Case=EMPTY (1360; 79%).
PROPN tokens may have the following values of Abbr:
Yes(1727; 100% of non-emptyAbbr): USA, AS, NATO, A., LRE, EL, N., HA, ETV, deEMPTY(24554): eesti, Eestis, Euroopa, Tartu, Tallinna, Peeter, Maa, Vene, Jan, Venemaa
Abbr seems to be lexical feature of PROPN. 100% lemmas (386) occur only with one value of Abbr.
NOUN
1534 NOUN tokens (1% of all NOUN tokens) have a non-empty value of Abbr.
The most frequent other feature values with which NOUN and Abbr co-occurred: Case=EMPTY (1401; 91%), Number=EMPTY (1400; 91%).
NOUN tokens may have the following values of Abbr:
Yes(1534; 100% of non-emptyAbbr): a., a, USB, p, VD, g, km, cm, m, krEMPTY(114165): aasta, aastal, aastat, raha, osa, krooni, korda, ajal, sissetulekute, mõju
Abbr seems to be lexical feature of NOUN. 100% lemmas (355) occur only with one value of Abbr.
ADV
376 ADV tokens (1% of all ADV tokens) have a non-empty value of Abbr.
ADV tokens may have the following values of Abbr:
Yes(376; 100% of non-emptyAbbr): jne, jt., jt, s.t, sh, n-ö, nt, st, jm, jmsEMPTY(41721): ka, siis, nii, kas, juba, välja, aga, veel, väga, mitte
Abbr seems to be lexical feature of ADV. 100% lemmas (37) occur only with one value of Abbr.
ADJ
119 ADJ tokens (0% of all ADJ tokens) have a non-empty value of Abbr.
The most frequent other feature values with which ADJ and Abbr co-occurred: Tense=EMPTY (119; 100%), VerbForm=EMPTY (119; 100%), Voice=EMPTY (119; 100%), Degree=EMPTY (114; 96%), Case=EMPTY (113; 95%), Number=EMPTY (113; 95%).
ADJ tokens may have the following values of Abbr:
Yes(119; 100% of non-emptyAbbr): nn, nn., van, %-lise, 80’ndate, jm, nim, nim., %-se, %-stEMPTY(36738): suur, hea, võimalik, eesti, suurem, uue, suure, raske, esimene, oluline
Abbr seems to be lexical feature of ADJ. 100% lemmas (10) occur only with one value of Abbr.
SYM
105 SYM tokens (14% of all SYM tokens) have a non-empty value of Abbr.
SYM tokens may have the following values of Abbr:
Yes(105; 100% of non-emptyAbbr): *, §, sulev@ekspress.ee, =, C18:2n-6, C18:3n-3, anne@ekspress.ee, s., ‘i, +EMPTY(635): %, %-l, &, %-ni, =, ω-3-, %-lt, ?, 1-, &
Abbr seems to be lexical feature of SYM. 100% lemmas (80) occur only with one value of Abbr.
VERB
47 VERB tokens (0% of all VERB tokens) have a non-empty value of Abbr.
The most frequent other feature values with which VERB and Abbr co-occurred: Mood=Imp (47; 100%), Number=EMPTY (47; 100%), Person=EMPTY (47; 100%), Tense=EMPTY (47; 100%), VerbForm=Fin (47; 100%), Voice=EMPTY (47; 100%).
VERB tokens may have the following values of Abbr:
Yes(47; 100% of non-emptyAbbr): vt, vt., vrdEMPTY(47811): tuleb, on, teha, ütles, saada, sai, saanud, tuli, saab, jääb
X
30 X tokens (4% of all X tokens) have a non-empty value of Abbr.
The most frequent other feature values with which X and Abbr co-occurred: Foreign=EMPTY (30; 100%).
X tokens may have the following values of Abbr:
Yes(30; 100% of non-emptyAbbr): of, a, b, in, n, x, AT, NB, P., S.EMPTY(735): 000, al., et, 900, in, of, 500, 600, 700, ceteris
Abbr seems to be lexical feature of X. 100% lemmas (22) occur only with one value of Abbr.
CCONJ
13 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of Abbr.
CCONJ tokens may have the following values of Abbr:
Yes(13; 100% of non-emptyAbbr): &, e, e.EMPTY(16079): ja, ning, või, aga, kuid, kui, ega, vaid, ehk, ent
NUM
12 NUM tokens (0% of all NUM tokens) have a non-empty value of Abbr.
The most frequent other feature values with which NUM and Abbr co-occurred: Case=EMPTY (12; 100%), NumForm=Word (12; 100%), NumType=Card (12; 100%), Number=EMPTY (12; 100%).
NUM tokens may have the following values of Abbr:
Yes(12; 100% of non-emptyAbbr): milj., mln, miljEMPTY(9009): kaks, 1, üks, 10, 2, kolm, ühe, kahe, 3, miljonit
Relations with Agreement in Abbr
The 10 most frequent relations where parent and child node agree in Abbr:
NOUN –[conj]–> NOUN (84; 58%),
PROPN –[flat]–> ADV (25; 86%),
NOUN –[flat]–> NOUN (13; 65%),
X –[flat]–> X (2; 100%),
ADV –[conj]–> ADV (1; 100%),
NOUN –[list]–> NOUN (1; 100%),
SYM –[conj]–> SYM (1; 100%),
SYM –[list]–> SYM (1; 100%),
SYM –[nmod]–> SYM (1; 100%).