Treebank Statistics: UD_Upper_Sorbian-UFAL: Features: Abbr
This feature is universal.
It occurs with 1 different values: Yes.
192 tokens (2%) have a non-empty value of Abbr.
37 types (1%) occur at least once with a non-empty value of Abbr.
45 lemmas (1%) occur at least once with a non-empty value of Abbr.
The feature is used with 11 part-of-speech tags: NOUN (74; 1% instances), ADP (38; 0% instances), DET (37; 0% instances), PROPN (10; 0% instances), ADV (8; 0% instances), X (8; 0% instances), NUM (7; 0% instances), ADJ (6; 0% instances), SYM (2; 0% instances), PRON (1; 0% instances), VERB (1; 0% instances).
NOUN
74 NOUN tokens (3% of all NOUN tokens) have a non-empty value of Abbr.
The most frequent other feature values with which NOUN and Abbr co-occurred: Number=Sing (61; 82%), Animacy=EMPTY (51; 69%).
NOUN tokens may have the following values of Abbr:
Yes(74; 100% of non-emptyAbbr): l, př, km, m, CEST, hodź, jan, dr, nakł, přirEMPTY(2463): město, rěč, woda, rěčow, lěta, stolica, lěće, kilometrow, mócnarstwo, nastawki
Abbr seems to be lexical feature of NOUN. 100% lemmas (12) occur only with one value of Abbr.
ADP
38 ADP tokens (3% of all ADP tokens) have a non-empty value of Abbr.
ADP tokens may have the following values of Abbr:
Yes(38; 100% of non-emptyAbbr): př, nEMPTY(1058): w, na, z, wot, za, do, k, přez, po, při
DET
37 DET tokens (11% of all DET tokens) have a non-empty value of Abbr.
The most frequent other feature values with which DET and Abbr co-occurred: Person=1 (37; 100%), PronType=Prs (37; 100%), Case=Ins (36; 97%), Number=Sing (36; 97%), Number[psor]=Plur (36; 97%), Poss=Yes (36; 97%), Animacy=EMPTY (33; 89%), Gender=Fem (31; 84%).
DET tokens may have the following values of Abbr:
Yes(37; 100% of non-emptyAbbr): nEMPTY(289): kotrež, jeho, jich, tute, kotryž, wjele, kotraž, tutón, swoje, tuta
PROPN
10 PROPN tokens (2% of all PROPN tokens) have a non-empty value of Abbr.
The most frequent other feature values with which PROPN and Abbr co-occurred: Animacy=EMPTY (10; 100%), Gender=EMPTY (7; 70%), Number=EMPTY (7; 70%), Case=EMPTY (6; 60%).
PROPN tokens may have the following values of Abbr:
Yes(10; 100% of non-emptyAbbr): C, GNU, CET, KPD, OZN, H, ISGV, NDREMPTY(586): Mezopotamiskeje, Aššur, Mezopotamiska, Mezopotamiskej, Sumeričanow, Wikimedia, Łužicy, Europje, Assur, Assyriska
ADV
8 ADV tokens (1% of all ADV tokens) have a non-empty value of Abbr.
The most frequent other feature values with which ADV and Abbr co-occurred: Degree=EMPTY (7; 88%), PronType=EMPTY (5; 63%).
ADV tokens may have the following values of Abbr:
Yes(8; 100% of non-emptyAbbr): resp, t, atd, łać, jendźEMPTY(527): tež, tak, hišće, zwjetša, hač, něhdźe, hižo, tu, wjace, najprjedy
X
8 X tokens (4% of all X tokens) have a non-empty value of Abbr.
X tokens may have the following values of Abbr:
Yes(8; 100% of non-emptyAbbr): APG, DDR, PD, SORBISCHES, dr, m, mjEMPTY(189): a, i, Vitis, backen, o, H, al, au, b, c
NUM
7 NUM tokens (2% of all NUM tokens) have a non-empty value of Abbr.
The most frequent other feature values with which NUM and Abbr co-occurred: NumType=Card (7; 100%).
NUM tokens may have the following values of Abbr:
Yes(7; 100% of non-emptyAbbr): III, Mio, 02625EMPTY(375): 2, 1, 6, 4, 3, jedyn, 5, 7, I, 000
ADJ
6 ADJ tokens (0% of all ADJ tokens) have a non-empty value of Abbr.
The most frequent other feature values with which ADJ and Abbr co-occurred: Degree=EMPTY (5; 83%), Animacy=EMPTY (4; 67%), VerbForm=Part (4; 67%), Voice=Pass (4; 67%).
ADJ tokens may have the following values of Abbr:
Yes(6; 100% of non-emptyAbbr): mj, d, jendź, zEMPTY(1413): serbski, druhe, druhich, najwjetše, prěni, prěnje, serbskeje, Serbskeho, wulki, ablawtowych
SYM
2 SYM tokens (6% of all SYM tokens) have a non-empty value of Abbr.
SYM tokens may have the following values of Abbr:
Yes(2; 100% of non-emptyAbbr): O2, O3EMPTY(30): °, %, ‘, *, ², ³, †, :, =, ~
PRON
1 PRON tokens (0% of all PRON tokens) have a non-empty value of Abbr.
The most frequent other feature values with which PRON and Abbr co-occurred: Case=Nom (1; 100%), Gender=Neut (1; 100%), Number=Sing (1; 100%), Person=EMPTY (1; 100%), PronType=Dem (1; 100%), Reflex=EMPTY (1; 100%).
PRON tokens may have the following values of Abbr:
Yes(1; 100% of non-emptyAbbr): tEMPTY(334): so, to, toho, tym, wona, wón, kiž, je, wone, wono
VERB
1 VERB tokens (0% of all VERB tokens) have a non-empty value of Abbr.
The most frequent other feature values with which VERB and Abbr co-occurred: Mood=Ind (1; 100%), Number=Sing (1; 100%), Person=3 (1; 100%), Tense=Pres (1; 100%), VerbForm=Fin (1; 100%).
VERB tokens may have the following values of Abbr:
Yes(1; 100% of non-emptyAbbr): rEMPTY(817): ma, leži, móže, wobsahuje, móžeš, su, hlej, maja, rěči, běchu
Relations with Agreement in Abbr
The 10 most frequent relations where parent and child node agree in Abbr:
NOUN –[det]–> DET (36; 97%),
NOUN –[case]–> ADP (35; 90%),
ADV –[fixed]–> ADJ (3; 100%),
PRON –[fixed]–> VERB (1; 100%),
SYM –[conj]–> SYM (1; 100%),
X –[fixed]–> X (1; 100%).