home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Ruuli-RDT: Features: NounClass

This feature is language-specific. It occurs with 19 different values: Bantu1, Bantu10, Bantu11, Bantu12, Bantu13, Bantu14, Bantu15, Bantu16, Bantu17, Bantu18, Bantu2, Bantu23, Bantu3, Bantu4, Bantu5, Bantu6, Bantu7, Bantu8, Bantu9.

This is a layered feature with the following layers: NounClass, NounClass[iobj], NounClass[obj], NounClass[psed], NounClass[psor].

3184 tokens (51%) have a non-empty value of NounClass. 1736 types (75%) occur at least once with a non-empty value of NounClass. 858 lemmas (77%) occur at least once with a non-empty value of NounClass. The feature is used with 11 part-of-speech tags: NOUN (1108; 18% instances), VERB (769; 12% instances), ADV (305; 5% instances), PRON (246; 4% instances), ADP (170; 3% instances), AUX (160; 3% instances), PART (126; 2% instances), ADJ (100; 2% instances), PROPN (96; 2% instances), DET (94; 1% instances), NUM (10; 0% instances).

NOUN

1108 NOUN tokens (100% of all NOUN tokens) have a non-empty value of NounClass.

The most frequent other feature values with which NOUN and NounClass co-occurred: Referent=Yes (684; 62%).

NOUN tokens may have the following values of NounClass:

Paradigm kiduumaBantu14Bantu7Bantu8
obuduumaekiduumaebiduuma

VERB

769 VERB tokens (54% of all VERB tokens) have a non-empty value of NounClass.

The most frequent other feature values with which VERB and NounClass co-occurred: VerbForm=Fin (734; 95%), Referent=EMPTY (714; 93%), Mood=Ind (692; 90%), Person=3 (661; 86%), Number=EMPTY (604; 79%), Person[obj]=EMPTY (511; 66%), Tense=Pres (424; 55%), Aspect=EMPTY (406; 53%).

VERB tokens may have the following values of NounClass:

Paradigm bbaBantu1Bantu10Bantu14Bantu16Bantu2Bantu3Bantu6Bantu7Bantu8Bantu9
Aspect=Hab|Mood=Ind|Polarity=Neg|Referent=Yes|Tense=Past|VerbForm=Finabataabbaanga
Aspect=Hab|Mood=Ind|Referent=Yes|Tense=Past|VerbForm=Finebyabbanga
Aspect=Hab|Mood=Ind|Tense=Past|VerbForm=Finbwabbangabyabbanga
Aspect=Perf|Mood=Ind|Tense=Past|VerbForm=Finyabbaire
Aspect=Perf|Mood=Ind|Tense=Pres|VerbForm=Finabbaire, Abairebabbairegubaire
Aspect=Pers|Mood=Ind|Polarity=Neg|Tense=Pres|VerbForm=Fintebakyabba
Mood=Ind|Polarity=Neg|Tense=Nar|VerbForm=Finnetabbaa
Mood=Ind|Polarity=Neg|Tense=Pres|VerbForm=Fintabbatigubba
Mood=Ind|Tense=Fut|VerbForm=Finkyabba
Mood=Ind|Tense=Nar|VerbForm=FinNaabba, nabbaniwabba
Mood=Ind|Tense=Pres|VerbForm=Finabba, abbaazibbababbagubbagabbakibba, kibabibbaaebba
Mood=Sub|Tense=Preskibbee
Mood=Sub|VerbForm=Finebbee
Tense=Presabbakibaeba

ADV

305 ADV tokens (55% of all ADV tokens) have a non-empty value of NounClass.

The most frequent other feature values with which ADV and NounClass co-occurred: Deixis=EMPTY (229; 75%), PronType=EMPTY (221; 72%).

ADV tokens may have the following values of NounClass:

Paradigm tyaiBantu1Bantu2Bantu8Bantu9
atyaibatyaibityaietyai

PRON

246 PRON tokens (62% of all PRON tokens) have a non-empty value of NounClass.

The most frequent other feature values with which PRON and NounClass co-occurred: Deixis=EMPTY (206; 84%), Person=EMPTY (167; 68%), Person[psed]=EMPTY (163; 66%), Person[psor]=EMPTY (163; 66%), Poss=EMPTY (163; 66%), Number=EMPTY (161; 65%), PronType=Prs (127; 52%).

PRON tokens may have the following values of NounClass:

Paradigm eBantu1Bantu10Bantu12Bantu14Bantu16Bantu23Bantu4Bantu6Bantu7Bantu8Bantu9
gwe, gw'zek'bw', bwewe, wa, w'gye, gy', jejeg'kye, ky'byegye, gy', je, Ze

ADP

170 ADP tokens (75% of all ADP tokens) have a non-empty value of NounClass.

The most frequent other feature values with which ADP and NounClass co-occurred: Referent=Yes (107; 63%).

ADP tokens may have the following values of NounClass:

AUX

160 AUX tokens (78% of all AUX tokens) have a non-empty value of NounClass.

The most frequent other feature values with which AUX and NounClass co-occurred: Number=EMPTY (160; 100%), Person=3 (160; 100%), Aspect=EMPTY (145; 91%), Tense=Pres (127; 79%), InfStruct=EMPTY (118; 74%).

AUX tokens may have the following values of NounClass:

Paradigm liBantu1Bantu10Bantu11Bantu14Bantu15Bantu16Bantu17Bantu18Bantu2Bantu3Bantu4Bantu5Bantu6Bantu7Bantu8Bantu9
Aspect=Pers|Polarity=Neg|Tense=Prestiekyali
Aspect=Pers|Tense=Presakyalibukyalibakyali, gukyaaligakyali
InfStruct=Foc|Tense=Presntali
mulu
Polarity=Neg|Referent=Yes|Tense=Pastegitaali
Polarity=Neg|Referent=Yes|Tense=Presataali
Polarity=Neg|Tense=Pasttiyali, taaloTibwaalitiwaaliTikyali
Polarity=Neg|Tense=Prestalitiguliteri
Referent=Yes|Tense=Pastolwaliokwaliabaaliebyalo
Referent=Yes|Tense=Presabalioguliekiri
Tense=Pastyaali, yaliBwalibaaligwali
Tense=Presali, alaa, erizirilulibuliwali, waalokulimuli, Mulubali, balaagulilirigalikiri, kiruBuli, biri, birueri, ero

PART

126 PART tokens (48% of all PART tokens) have a non-empty value of NounClass.

The most frequent other feature values with which PART and NounClass co-occurred: InfStruct=EMPTY (126; 100%).

PART tokens may have the following values of NounClass:

Paradigm aBantu1Bantu10Bantu11Bantu12Bantu14Bantu15Bantu16Bantu2Bantu3Bantu4Bantu5Bantu6Bantu7Bantu8Bantu9
_wa, owaeza, zalwa, olwakabwakwawabagwayalyagakyabyaya
Referent=Yesowaezaobwaokwaowaabaogwaeyaekyaebyaeya

ADJ

100 ADJ tokens (100% of all ADJ tokens) have a non-empty value of NounClass.

The most frequent other feature values with which ADJ and NounClass co-occurred: Referent=EMPTY (81; 81%).

ADJ tokens may have the following values of NounClass:

Paradigm saiBantu1Bantu2Bantu3Bantu7Bantu9
_musaibasaigusaikisai
Referent=Yesegisai

PROPN

96 PROPN tokens (100% of all PROPN tokens) have a non-empty value of NounClass.

The most frequent other feature values with which PROPN and NounClass co-occurred: Referent=EMPTY (61; 64%).

PROPN tokens may have the following values of NounClass:

Paradigm NakasongolaBantu1Bantu9
NakasongolaNakasongola

NounClass seems to be lexical feature of PROPN. 94% lemmas (61) occur only with one value of NounClass.

DET

94 DET tokens (94% of all DET tokens) have a non-empty value of NounClass.

The most frequent other feature values with which DET and NounClass co-occurred: PronType=Dem (77; 82%).

DET tokens may have the following values of NounClass:

Paradigm niBantu1Bantu10Bantu11Bantu12Bantu14Bantu17Bantu2Bantu3Bantu4Bantu6Bantu7Bantu8Bantu9
onu, oniziniluniKanibunilunibaniguniginiganu, ganikinibini, byanieni

NUM

10 NUM tokens (19% of all NUM tokens) have a non-empty value of NounClass.

The most frequent other feature values with which NUM and NounClass co-occurred: Referent=EMPTY (9; 90%), NumForm=Word (8; 80%), NumType=Card (8; 80%).

NUM tokens may have the following values of NounClass:

Paradigm mweBantu1Bantu3Bantu4Bantu6Bantu7Bantu9
_omweigimwei
NumForm=Word|NumType=Cardogumweigamweikimwei
NumForm=Word|NumType=Card|Referent=Yesemwe

Relations with Agreement in NounClass

The 10 most frequent relations where parent and child node agree in NounClass: VERB –[nsubj]–> NOUN (161; 91%), NOUN –[det]–> DET (72; 92%), NOUN –[nmod:poss]–> PRON (64; 73%), NOUN –[amod]–> ADJ (34; 97%), VERB –[nsubj]–> PRON (33; 65%), NOUN –[acl:relcl]–> VERB (26; 55%), ADJ –[nsubj]–> NOUN (20; 100%), VERB –[conj]–> VERB (16; 52%), NOUN –[conj]–> NOUN (13; 72%), VERB –[aux]–> AUX (12; 52%).