Treebank Statistics: UD_Lithuanian-ALKSNIS: Features: Gender
This feature is universal.
It occurs with 4 different values: Com, Fem, Masc, Neut.
34426 tokens (49%) have a non-empty value of Gender.
14372 types (80%) occur at least once with a non-empty value of Gender.
6232 lemmas (72%) occur at least once with a non-empty value of Gender.
The feature is used with 9 part-of-speech tags: NOUN (21127; 30% instances), ADJ (4648; 7% instances), VERB (3471; 5% instances), DET (1780; 3% instances), PROPN (1574; 2% instances), PRON (1476; 2% instances), NUM (338; 0% instances), AUX (10; 0% instances), X (2; 0% instances).
NOUN
21127 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (13634; 65%).
NOUN tokens may have the following values of Gender:
Com(8; 0% of non-emptyGender): kūdikis, giminės, kiaulė, nauda, pabaisai, valkatosFem(9101; 43% of non-emptyGender): kultūros, valstybės, paslaugos, įmonės, apsaugos, politikos, teisės, visuomenės, knygos, šeimosMasc(12018; 57% of non-emptyGender): duomenų, pašto, darbo, asmens, verslo, metų, tyrimų, komiteto, metu, litųEMPTY(150): m, Ego, EGO
| Paradigm darželis | Masc | Fem |
|---|---|---|
| Case=Acc|Number=Plur | darželius | |
| Case=Dat|Number=Plur | darželiams | |
| Case=Gen|Number=Sing | darželio | darželio |
| Case=Gen|Number=Plur | darželių |
Gender seems to be lexical feature of NOUN. 100% lemmas (3399) occur only with one value of Gender.
ADJ
4648 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (4360; 94%), Definite=Ind (4300; 93%), Number=Sing (2627; 57%).
ADJ tokens may have the following values of Gender:
Fem(1920; 41% of non-emptyGender): socialinės, geros, metinės, viešosios, naujų, teisinės, gretutinių, skirtingų, svarbi, įvairiosMasc(2523; 54% of non-emptyGender): vidutinio, smulkiojo, mokslinių, socialinių, naujų, pagrindinis, viešojo, finansiniai, didelis, pagrindiniaiNeut(205; 4% of non-emptyGender): būtina, svarbu, sunku, sveika, nesvarbu, tikėtina, verta, aišku, keista, tikslingaEMPTY(3): makro-, mikro-, mini
| Paradigm svarbus | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Degree=Pos|Number=Sing | svarbų | svarbią | |
| Case=Acc|Degree=Pos|Number=Plur | svarbius | svarbias | |
| Case=Acc|Degree=Sup|Number=Plur | svarbiausius | svarbiausias | |
| Case=Dat|Degree=Sup|Number=Plur | svarbiausioms | ||
| Case=Gen|Degree=Pos|Number=Sing | svarbaus | ||
| Case=Gen|Degree=Pos|Number=Plur | svarbių | ||
| Case=Gen|Degree=Sup|Number=Plur | svarbiausių | svarbiausių | |
| Case=Ins|Degree=Pos|Number=Plur | svarbiais | ||
| Case=Ins|Degree=Cmp|Number=Plur | svarbesniais | ||
| Case=Ins|Degree=Sup|Number=Sing | svarbiausiu | ||
| Case=Ins|Degree=Sup|Number=Plur | svarbiausiomis | ||
| Case=Nom|Degree=Pos|Number=Sing | svarbus | svarbi | |
| Case=Nom|Degree=Pos|Number=Plur | svarbūs | svarbios | |
| Case=Nom|Degree=Cmp|Number=Sing | svarbesnis | ||
| Case=Nom|Degree=Cmp|Number=Plur | svarbesni | ||
| Case=Nom|Degree=Sup|Number=Sing | svarbiausias | svarbiausia | |
| Case=Nom|Degree=Sup|Number=Plur | svarbiausi | svarbiausios | |
| Degree=Pos | svarbu | ||
| Degree=Sup | svarbiausia |
VERB
3471 VERB tokens (34% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (3471; 100%), Mood=EMPTY (3445; 99%), Polarity=Pos (3273; 94%), VerbForm=Part (3266; 94%), Reflex=EMPTY (3202; 92%), Definite=Ind (3137; 90%), Aspect=EMPTY (2953; 85%), Voice=Pass (2237; 64%).
VERB tokens may have the following values of Gender:
Fem(1263; 36% of non-emptyGender): nustatyta, žyminčių, aptariamos, pateikiamos, susijusios, suteiktą, įgyvendindama, nustatytos, pateikiama, skirtaMasc(1797; 52% of non-emptyGender): nurodyto, aptariami, susiję, pateikti, nagrinėjamas, analizuojami, gauti, skirti, susijęs, tiriamojoNeut(411; 12% of non-emptyGender): galima, siekiama, planuota, žinoma, nustatyta, neįmanoma, pateikiama, skiriama, skirta, rašomaEMPTY(6709): gali, turi, yra, nėra, reikia, siekiant, nebuvo, rodo, buvo, teikti
| Paradigm galėti | Masc | Fem | Neut |
|---|---|---|---|
| Aspect=Perf|Case=Nom|Number=Sing|Tense=Past|Voice=Act | galėjęs | ||
| Case=Acc|Number=Plur|Tense=Pres|Voice=Act | galinčias | ||
| Case=Acc|Number=Plur|Tense=Pres|Voice=Pass | galimus | ||
| Case=Gen|Number=Plur|Tense=Pres|Voice=Act | galinčių | ||
| Case=Ins|Number=Sing|Tense=Pres|Voice=Pass | galima | ||
| Case=Nom|Number=Sing|Tense=Pres|Voice=Pass | galimas | ||
| Case=Nom|Number=Plur|Tense=Pres|Voice=Pass | galimi | ||
| Tense=Pres|Voice=Pass | galima |
DET
1780 DET tokens (100% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Definite=Ind (1773; 100%), Number=Sing (995; 56%), PronType=Dem (973; 55%).
DET tokens may have the following values of Gender:
Fem(573; 32% of non-emptyGender): kurios, šios, ši, kuri, kurioje, tokia, šią, ta, tokios, tąMasc(995; 56% of non-emptyGender): to, kurie, šio, kuris, toks, šį, šiame, visą, kurių, visusNeut(212; 12% of non-emptyGender): tai, visa, tatai, Šitai
| Paradigm tas | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Definite=Ind|Hyph=Yes|Number=Sing | tą | ||
| Case=Acc|Definite=Ind|Hyph=Yes|Number=Plur | tuos | ||
| Case=Acc|Definite=Ind|Number=Sing | tą | tą | |
| Case=Acc|Definite=Ind|Number=Plur | tuos | tas | |
| Case=Dat|Definite=Ind|Hyph=Yes|Number=Plur | tiems | ||
| Case=Dat|Definite=Ind|Number=Sing | tam, tuo | ||
| Case=Dat|Definite=Ind|Number=Plur | tiems | ||
| Case=Gen|Definite=Ind|Hyph=Yes|Number=Sing | to | ||
| Case=Gen|Definite=Ind|Number=Sing | to | tos | |
| Case=Gen|Definite=Ind|Number=Plur | tų | tų | |
| Case=Ins|Definite=Ind|Hyph=Yes|Number=Sing | tuo | ta | |
| Case=Ins|Definite=Ind|Hyph=Yes|Number=Plur | tomis | ||
| Case=Ins|Definite=Ind|Number=Sing | tuo | ta | |
| Case=Ins|Definite=Ind|Number=Plur | tais | tomis | |
| Case=Loc|Definite=Ind|Hyph=Yes|Number=Sing | toje | ||
| Case=Loc|Definite=Ind|Number=Sing | tame | toje | |
| Case=Loc|Definite=Ind|Number=Plur | tose | ||
| Case=Nom|Definite=Def|Number=Sing | Tasai | toji | |
| Case=Nom|Definite=Ind|Hyph=Yes|Number=Sing | tas | ta | |
| Case=Nom|Definite=Ind|Hyph=Yes|Number=Plur | tie | tos | |
| Case=Nom|Definite=Ind|Number=Sing | tas | ta | |
| Case=Nom|Definite=Ind|Number=Plur | tie | tos | |
| Definite=Def | tatai | ||
| Definite=Ind | tai |
PROPN
1574 PROPN tokens (99% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1540; 98%).
PROPN tokens may have the following values of Gender:
Fem(901; 57% of non-emptyGender): Lietuvos, Europos, Lietuvoje, Lietuva, Lietuvai, Kalėdų, LIETUVOS, EUROPOS, Lietuvą, MarcinkevičienėMasc(673; 43% of non-emptyGender): Kauno, Vilniaus, Šengeno, Vilnius, Mažuolis, Vilniuje, Glaveckas, Algirdas, Europolo, HaičioEMPTY(19): Čornij, Chaime, Golf, Vilmorus, Achenbach, Ford, Garbo, Kahkonen, Laozi, Pjer-Luji
| Paradigm Klaipėda | Masc | Fem |
|---|---|---|
| Case=Loc | Klaipėdoje | |
| Case=Nom | Klaipėda |
Gender seems to be lexical feature of PROPN. 100% lemmas (545) occur only with one value of Gender.
PRON
1476 PRON tokens (61% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Definite=Ind (1470; 100%), PronType=Prs (905; 61%), Person=3 (898; 61%).
PRON tokens may have the following values of Gender:
Fem(510; 35% of non-emptyGender): jos, ji, jų, kitų, ją, viena, jai, tam, kai, kitosMasc(949; 64% of non-emptyGender): jų, jis, jie, jo, juos, jį, jam, vienas, kitų, kaiNeut(17; 1% of non-emptyGender): visa, kas, viena, taiEMPTY(943): savo, kas, ką, aš, mano, man, mūsų, mes, tu, mane
| Paradigm vienas | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Number=Sing | vieną | vieną | |
| Case=Acc|Number=Plur | vienus | ||
| Case=Dat|Number=Sing | vienam | ||
| Case=Dat|Number=Plur | vieniems | ||
| Case=Gen|Number=Sing | vieno | vienos | |
| Case=Gen|Number=Plur | vienų | ||
| Case=Ins|Hyph=Yes|Number=Sing | vienu | ||
| Case=Ins|Number=Sing | vienu | ||
| Case=Loc|Number=Sing | viename | vienoje | |
| Case=Loc|Number=Plur | Vienose | ||
| Case=Nom|Hyph=Yes|Number=Sing | vienas | ||
| Case=Nom|Hyph=Yes|Number=Plur | vieni | ||
| Case=Nom|Number=Sing | vienas | viena | |
| Case=Nom|Number=Plur | vieni | ||
| viena |
NUM
338 NUM tokens (20% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (319; 94%), Definite=EMPTY (190; 56%), NumType=Card (190; 56%).
NUM tokens may have the following values of Gender:
Fem(122; 36% of non-emptyGender): dvi, vieną, pirmoji, antroji, trijų, viena, penkias, 15-oji, antroje, dviejųMasc(205; 61% of non-emptyGender): du, vieną, milijonų, pirmą, tūkstančių, abu, dviejų, trijų, tris, vienoNeut(11; 3% of non-emptyGender): antra, pirma, TrečiaEMPTY(1361): 1, 2, 3, 2006, 4, 5, 6, 25, 7, 10
| Paradigm pirmas | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Definite=Def|Number=Sing|NumForm=Word | pirmąjį | pirmąją | |
| Case=Acc|Definite=Ind|Hyph=Yes|Number=Sing|NumForm=Word | pirmą | ||
| Case=Acc|Definite=Ind|Number=Sing|NumForm=Word | pirmą | ||
| Case=Dat|Definite=Def|Number=Sing|NumForm=Word | Pirmajai | ||
| Case=Gen|Definite=Def|Number=Sing|NumForm=Word | pirmojo | ||
| Case=Gen|Definite=Def|Number=Plur|NumForm=Word | pirmųjų | ||
| Case=Gen|Definite=Ind|Number=Sing|NumForm=Word | pirmo | pirmos | |
| Case=Gen|Definite=Ind|Number=Plur|NumForm=Word | pirmo, pirmųjų | ||
| Case=Ins|Definite=Def|Number=Plur|NumForm=Word | pirmaisiais | ||
| Case=Loc|Definite=Def|Number=Sing|NumForm=Word | pirmajame | pirmojoje | |
| Case=Loc|Definite=Def|Number=Plur|NumForm=Word | pirmuosiuose | ||
| Case=Loc|Definite=Ind|Number=Sing|NumForm=Word | pirmame | ||
| Case=Loc|Definite=Ind|Number=Plur|NumForm=Word | Pirmuose | ||
| Case=Nom|Definite=Def|Number=Sing|NumForm=Word | pirmasis | pirmoji | |
| Case=Nom|Definite=Def|Number=Sing | pirmoji | ||
| Case=Nom|Definite=Def|Number=Plur|NumForm=Word | pirmieji | pirmosios | |
| Case=Nom|Definite=Ind|Number=Sing|NumForm=Word | pirmas | pirmoji | |
| Case=Nom|Definite=Ind|Number=Plur|NumForm=Word | pirmi | ||
| Definite=Ind|NumForm=Word | pirma |
AUX
10 AUX tokens (1% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (10; 100%), Person=EMPTY (10; 100%), Polarity=Pos (10; 100%), VerbForm=Part (9; 90%), Number=Sing (8; 80%), Aspect=EMPTY (7; 70%), Tense=Pres (6; 60%).
AUX tokens may have the following values of Gender:
Fem(2; 20% of non-emptyGender): buvusi, būdamaMasc(7; 70% of non-emptyGender): esąs, buvęs, esąNeut(1; 10% of non-emptyGender): buvęEMPTY(674): yra, buvo, būti, būtų, bus, esu, buvau, būna, esate, esi
| Paradigm būti | Masc | Fem | Neut |
|---|---|---|---|
| Aspect=Perf|Case=Nom|Definite=Ind|Number=Sing|Tense=Past|VerbForm=Part|Voice=Act | buvęs | buvusi | |
| Aspect=Perf|Definite=Ind|Tense=Past|VerbForm=Part|Voice=Act | buvę | ||
| Case=Nom|Definite=Ind|Number=Sing|Tense=Pres|VerbForm=Part|Voice=Act | esąs | ||
| Case=Nom|Definite=Ind|Number=Plur|Tense=Pres|VerbForm=Part|Voice=Act | esą | ||
| Number=Sing|VerbForm=Conv | būdama |
X
2 X tokens (0% of all X tokens) have a non-empty value of Gender.
The most frequent other feature values with which X and Gender co-occurred: Abbr=EMPTY (2; 100%), Hyph=EMPTY (2; 100%).
X tokens may have the following values of Gender:
Fem(1; 50% of non-emptyGender): naudosMasc(1; 50% of non-emptyGender): jaunųjųEMPTY(1569): pat, ES, d, proc, Nr, nors, a, p, to, tūkst
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[amod]–> ADJ (3652; 100%),
NOUN –[acl]–> VERB (1529; 84%),
NOUN –[conj]–> NOUN (1260; 59%),
NOUN –[det]–> DET (841; 100%),
NOUN –[nmod]–> PROPN (527; 65%),
NOUN –[nmod]–> PRON (515; 55%),
VERB –[nsubj:pass]–> NOUN (402; 98%),
ADJ –[conj]–> ADJ (278; 96%),
ADJ –[nsubj]–> NOUN (202; 96%),
NOUN –[flat]–> NOUN (142; 76%).