Statistics of Abbr in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Estonian-EDT: Features: `Abbr`

This feature is universal. It occurs with 1 different values: Yes.

3963 tokens (1%) have a non-empty value of Abbr. 1045 types (1%) occur at least once with a non-empty value of Abbr. 811 lemmas (2%) occur at least once with a non-empty value of Abbr. The feature is used with 9 part-of-speech tags: PROPN (1727; 0% instances), NOUN (1534; 0% instances), ADV (376; 0% instances), ADJ (119; 0% instances), SYM (105; 0% instances), VERB (47; 0% instances), X (30; 0% instances), CCONJ (13; 0% instances), NUM (12; 0% instances).

`PROPN`

1727 PROPN tokens (7% of all PROPN tokens) have a non-empty value of Abbr.

The most frequent other feature values with which PROPN and Abbr co-occurred: Number=EMPTY (1361; 79%), Case=EMPTY (1360; 79%).

PROPN tokens may have the following values of Abbr:

Yes (1727; 100% of non-empty Abbr): USA, AS, NATO, A., LRE, EL, N., HA, ETV, de
EMPTY (24554): eesti, Eestis, Euroopa, Tartu, Tallinna, Peeter, Maa, Vene, Jan, Venemaa

Abbr seems to be lexical feature of PROPN. 100% lemmas (386) occur only with one value of Abbr.

`NOUN`

1534 NOUN tokens (1% of all NOUN tokens) have a non-empty value of Abbr.

The most frequent other feature values with which NOUN and Abbr co-occurred: Case=EMPTY (1401; 91%), Number=EMPTY (1400; 91%).

NOUN tokens may have the following values of Abbr:

Yes (1534; 100% of non-empty Abbr): a., a, USB, p, VD, g, km, cm, m, kr
EMPTY (114146): aasta, aastal, aastat, raha, osa, krooni, korda, ajal, sissetulekute, mõju

Abbr seems to be lexical feature of NOUN. 100% lemmas (355) occur only with one value of Abbr.

`ADV`

376 ADV tokens (1% of all ADV tokens) have a non-empty value of Abbr.

ADV tokens may have the following values of Abbr:

Yes (376; 100% of non-empty Abbr): jne, jt., jt, s.t, sh, n-ö, nt, st, jm, jms
EMPTY (41735): ka, siis, nii, kas, juba, välja, aga, veel, väga, mitte

Abbr seems to be lexical feature of ADV. 100% lemmas (37) occur only with one value of Abbr.

`ADJ`

119 ADJ tokens (0% of all ADJ tokens) have a non-empty value of Abbr.

The most frequent other feature values with which ADJ and Abbr co-occurred: Tense=EMPTY (119; 100%), VerbForm=EMPTY (119; 100%), Voice=EMPTY (119; 100%), Degree=EMPTY (114; 96%), Case=EMPTY (113; 95%), Number=EMPTY (113; 95%).

ADJ tokens may have the following values of Abbr:

Yes (119; 100% of non-empty Abbr): nn, nn., van, %-lise, 80’ndate, jm, nim, nim., %-se, %-st
EMPTY (36738): suur, hea, võimalik, eesti, suurem, uue, suure, raske, esimene, oluline

Abbr seems to be lexical feature of ADJ. 100% lemmas (10) occur only with one value of Abbr.

`SYM`

105 SYM tokens (14% of all SYM tokens) have a non-empty value of Abbr.

SYM tokens may have the following values of Abbr:

Yes (105; 100% of non-empty Abbr): *, §, sulev@ekspress.ee, =, C18:2n-6, C18:3n-3, anne@ekspress.ee, s., ‘i, +
EMPTY (635): %, %-l, &, %-ni, =, ω-3-, %-lt, ?, 1-, &

Abbr seems to be lexical feature of SYM. 100% lemmas (80) occur only with one value of Abbr.

`VERB`

47 VERB tokens (0% of all VERB tokens) have a non-empty value of Abbr.

The most frequent other feature values with which VERB and Abbr co-occurred: Mood=Imp (47; 100%), Number=EMPTY (47; 100%), Person=EMPTY (47; 100%), Tense=EMPTY (47; 100%), VerbForm=Fin (47; 100%), Voice=EMPTY (47; 100%).

VERB tokens may have the following values of Abbr:

Yes (47; 100% of non-empty Abbr): vt, vt., vrd
EMPTY (47814): tuleb, on, teha, ütles, saada, sai, saanud, tuli, saab, jääb

`X`

30 X tokens (4% of all X tokens) have a non-empty value of Abbr.

The most frequent other feature values with which X and Abbr co-occurred: Foreign=EMPTY (30; 100%).

X tokens may have the following values of Abbr:

Yes (30; 100% of non-empty Abbr): of, a, b, in, n, x, AT, NB, P., S.
EMPTY (735): 000, al., et, 900, in, of, 500, 600, 700, ceteris

Abbr seems to be lexical feature of X. 100% lemmas (22) occur only with one value of Abbr.

`CCONJ`

13 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of Abbr.

CCONJ tokens may have the following values of Abbr:

Yes (13; 100% of non-empty Abbr): &, e, e.
EMPTY (16079): ja, ning, või, aga, kuid, kui, ega, vaid, ehk, ent

`NUM`

12 NUM tokens (0% of all NUM tokens) have a non-empty value of Abbr.

The most frequent other feature values with which NUM and Abbr co-occurred: Case=EMPTY (12; 100%), NumForm=Word (12; 100%), NumType=Card (12; 100%), Number=EMPTY (12; 100%).

NUM tokens may have the following values of Abbr:

Yes (12; 100% of non-empty Abbr): milj., mln, milj
EMPTY (9010): kaks, 1, üks, 10, 2, kolm, ühe, kahe, 3, miljonit

Relations with Agreement in `Abbr`

The 10 most frequent relations where parent and child node agree in Abbr: NOUN –[conj]–> NOUN (84; 58%), PROPN –[flat]–> ADV (25; 86%), NOUN –[flat]–> NOUN (13; 65%), X –[flat]–> X (2; 100%), ADV –[conj]–> ADV (1; 100%), NOUN –[list]–> NOUN (1; 100%), SYM –[conj]–> SYM (1; 100%), SYM –[list]–> SYM (1; 100%), SYM –[nmod]–> SYM (1; 100%).

Treebank Statistics: UD_Estonian-EDT: Features: Abbr

PROPN

NOUN

ADV

ADJ

SYM

VERB

X

CCONJ

NUM

Relations with Agreement in Abbr