Treebank Statistics: UD_Faroese-OFT: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem, Masc, Neut.
3898 tokens (39%) have a non-empty value of Gender.
2250 types (72%) occur at least once with a non-empty value of Gender.
1557 lemmas (65%) occur at least once with a non-empty value of Gender.
The feature is used with 8 part-of-speech tags: NOUN (2434; 24% instances), ADJ (721; 7% instances), PROPN (329; 3% instances), DET (185; 2% instances), PRON (184; 2% instances), NUM (34; 0% instances), VERB (7; 0% instances), ADV (4; 0% instances).
NOUN
2434 NOUN tokens (97% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (1875; 77%), Definite=Ind (1496; 61%), Case=Nom (1468; 60%).
NOUN tokens may have the following values of Gender:
Fem(691; 28% of non-emptyGender): kommuna, kommunur, kommunu, ár, oyggin, oynni, øld, bygdini, kommununi, ferðavinnaMasc(1174; 48% of non-emptyGender): býur, høvuðsstaður, býurin, høvuðsstaðurin, landslutinum, partur, týdning, Meginparturin, limur, landsluturNeut(569; 23% of non-emptyGender): fólkinum, fólk, landinum, landi, landið, grundarlagið, mál, Endamálið, fólkatalið, lýðveldiEMPTY(63): USA, ES, t.d., uml., mió, A, á.Kr., FLOT, FU, FUR
| Paradigm ár | Fem | Neut |
|---|---|---|
| Case=Acc|Definite=Def|Number=Plur | árini | |
| Case=Dat|Definite=Ind|Number=Sing | ár | |
| Case=Gen|Definite=Def|Number=Sing | Ársins | |
| Case=Gen|Definite=Ind|Number=Plur | ára | |
| Case=Nom|Definite=Def|Number=Sing | árið | |
| Case=Nom|Definite=Def|Number=Plur | Árarnar | |
| Case=Nom|Definite=Ind|Number=Plur | ár |
Gender seems to be lexical feature of NOUN. 100% lemmas (1154) occur only with one value of Gender.
ADJ
721 ADJ tokens (95% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Degree=EMPTY (616; 85%), Case=Nom (528; 73%), Definite=Ind (485; 67%), Number=Sing (470; 65%).
ADJ tokens may have the following values of Gender:
Fem(153; 21% of non-emptyGender): størsta, fleiri, nógvar, stór, turr, aðrar, føroysku, nógv, onnur, somuMasc(409; 57% of non-emptyGender): størsti, stórur, stóran, nógvur, stórir, føroyskur, aðrir, mangir, amerikanska, einastiNeut(159; 22% of non-emptyGender): nógv, mong, stórt, Flestu, stór, sama, ymisk, annað, fleiri, føroysktEMPTY(39): 2., 1., 18., 19., 11., 12., 16., 17., 29., 3.
| Paradigm stórur | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Definite=Def|Degree=Sup|Number=Plur | Størstu | ||
| Case=Acc|Definite=Ind|Number=Sing | stóran | stóra | stórt |
| Case=Acc|Definite=Ind|Number=Plur | stórar | stór | |
| Case=Dat|Definite=Def|Degree=Sup|Number=Sing | størsta | ||
| Case=Dat|Definite=Def|Degree=Sup|Number=Plur | størstu | ||
| Case=Dat|Definite=Def|Number=Plur | stóru | ||
| Case=Dat|Definite=Ind|Number=Sing | stórum | stórum | |
| Case=Dat|Definite=Ind|Number=Plur | stórum | stórum | |
| Case=Nom|Definite=Def|Degree=Sup|Number=Sing | størsti | størsta | størsta |
| Case=Nom|Definite=Def|Degree=Sup|Number=Plur | Størstu | ||
| Case=Nom|Definite=Def|Number=Sing | stóri | ||
| Case=Nom|Definite=Ind|Number=Sing | stórur | stór | stórt, størri |
| Case=Nom|Definite=Ind|Number=Plur | stórir | stórar | stór |
PROPN
329 PROPN tokens (41% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (251; 76%), Definite=Ind (236; 72%).
PROPN tokens may have the following values of Gender:
Fem(126; 38% of non-emptyGender): Føroyum, Føroya, Føroyar, Danmark, Kina, Keypmannahavn, Florida, Tórshavnar, Tórshavn, BergtóraMasc(90; 27% of non-emptyGender): Kalifornia, Tróndur, Jákupsson, Bergur, Dávid, Gásadali, Hanus, Jóannes, Jógvan, MagnusNeut(113; 34% of non-emptyGender): Noregi, Fraklandi, Niðurlondum, Noregs, Grønlandi, Hordalandi, Island, Russlandi, Estlandi, GrønlandEMPTY(476): Kanada, Amerika, Italia, New, Nigeria, York, Asia, Jackson, Kastrup, Mississippi
Gender seems to be lexical feature of PROPN. 100% lemmas (131) occur only with one value of Gender.
DET
185 DET tokens (99% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (172; 93%), Case=Nom (145; 78%).
DET tokens may have the following values of Gender:
Fem(32; 17% of non-emptyGender): ein, eina, øll, Allar, eini, sína, Summi, allari, ei, einariMasc(94; 51% of non-emptyGender): ein, einum, allir, Summir, allan, allur, sínum, mínirNeut(59; 32% of non-emptyGender): eitt, einum, annað, síni, sínum, ØllEMPTY(1): alt
| Paradigm ein | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc | ein | eina | eitt |
| Case=Dat | einum | eini, einari | einum |
| Case=Nom | ein | ein, ei | eitt |
PRON
184 PRON tokens (63% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (168; 91%), Case=Nom (146; 79%), Person=3 (143; 78%), PronType=Prs (143; 78%).
PRON tokens may have the following values of Gender:
Fem(68; 37% of non-emptyGender): hon, henni, hennara, hana, onga, tærMasc(83; 45% of non-emptyGender): hann, teir, hansara, honum, nakrir, Allir, Báðir, Summir, hesir, nakarNeut(33; 18% of non-emptyGender): hetta, Hatta, Hettar, hvat, okkurtEMPTY(108): tað, sum, seg, tey, ið, sær, vit, eg, Hon, man
| Paradigm hesin | Masc | Neut |
|---|---|---|
| Case=Acc|Number=Sing | hetta | |
| Case=Nom|Number=Sing | hetta, Hettar | |
| Case=Nom|Number=Plur | hesir |
NUM
34 NUM tokens (14% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: Number=Sing (34; 100%), Case=Nom (27; 79%).
NUM tokens may have the following values of Gender:
Fem(23; 68% of non-emptyGender): ein, tvær, trimum, tríggjarMasc(6; 18% of non-emptyGender): tveirNeut(5; 15% of non-emptyGender): trý, tveimum, tveyEMPTY(205): %, 2005, 2011, 10, 2010, 4, 18, 20, 2008, 26
| Paradigm tvey | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc | tveir | tvær | |
| Case=Dat | tveimum | ||
| Case=Nom | tveir | tvær | tvey |
VERB
7 VERB tokens (1% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=Ind (7; 100%), Person=EMPTY (7; 100%), Tense=Past (7; 100%), VerbForm=Part (7; 100%), Number=Plur (5; 71%).
VERB tokens may have the following values of Gender:
Fem(2; 29% of non-emptyGender): Sameindu, nevndarMasc(4; 57% of non-emptyGender): flettir, kendastur, keyptir, prentaðirNeut(1; 14% of non-emptyGender): samlaðaEMPTY(566): býr, hevur, kom, liggur, Sí, eru, fer, varð, fór, er
ADV
4 ADV tokens (1% of all ADV tokens) have a non-empty value of Gender.
ADV tokens may have the following values of Gender:
Masc(1; 25% of non-emptyGender): vanligaNeut(3; 75% of non-emptyGender): størsta, vanliga, veldigaEMPTY(422): eisini, ikki, har, nú, so, enn, tó, tá, fyrr, ofta
| Paradigm vanligur | Masc | Neut |
|---|---|---|
| vanliga | vanliga |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[amod]–> ADJ (558; 92%),
NOUN –[det]–> DET (180; 98%),
NOUN –[conj]–> NOUN (86; 57%),
ADJ –[nsubj]–> NOUN (64; 85%),
NOUN –[nsubj]–> PRON (50; 56%),
NOUN –[parataxis]–> NOUN (23; 51%),
ADJ –[conj]–> ADJ (20; 91%),
ADJ –[nsubj]–> PRON (12; 100%),
ADJ –[nmod]–> NOUN (4; 80%),
ADJ –[nsubj]–> PROPN (4; 67%).