home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Irish-Cadhan: Features: Form

This feature is language-specific. It occurs with 7 different values: Direct, Ecl, Emp, HPref, Indirect, Len, VF. Some words have combined values of the feature; 3 combinations have been observed: Direct|Len, Ecl|Emp, Emp|Len.

845 tokens (18%) have a non-empty value of Form. 566 types (30%) occur at least once with a non-empty value of Form. 364 lemmas (33%) occur at least once with a non-empty value of Form. The feature is used with 11 part-of-speech tags: NOUN (375; 8% instances), VERB (241; 5% instances), PART (90; 2% instances), PROPN (40; 1% instances), ADJ (31; 1% instances), ADP (22; 0% instances), NUM (21; 0% instances), PRON (12; 0% instances), AUX (11; 0% instances), DET (1; 0% instances), SCONJ (1; 0% instances).

NOUN

375 NOUN tokens (33% of all NOUN tokens) have a non-empty value of Form.

The most frequent other feature values with which NOUN and Form co-occurred: VerbForm=EMPTY (326; 87%), Number=Sing (275; 73%), Case=Nom (241; 64%), Gender=Masc (211; 56%), Definite=EMPTY (208; 55%).

NOUN tokens may have the following values of Form:

Paradigm duineEclLen
Case=Dat|Definite=Def|Number=Plurdhaoinibh
Case=Dat|Number=Plurdhaoinibh
Case=Gen|Definite=Def|NounType=Strong|Number=Plurndáoine
Case=Nom|Definite=Def|Number=Singnduine
Case=Nom|Number=Singdhuine

VERB

241 VERB tokens (56% of all VERB tokens) have a non-empty value of Form.

The most frequent other feature values with which VERB and Form co-occurred: Mood=Ind (197; 82%), Number=EMPTY (184; 76%), Person=EMPTY (172; 71%), Tense=Past (159; 66%).

VERB tokens may have the following values of Form:

Paradigm DirectEclLen
Aspect=Hab|Mood=Ind|Number=Plur|Person=3|Tense=Presbhíd
Aspect=Hab|Mood=Ind|Polarity=Neg|Tense=Presbhíonn
Aspect=Imp|Number=Plur|Person=3|Tense=Pastbhídís
Aspect=Imp|Tense=Pastmbiodhbhíodh
Mood=Cndmbeadh, mbeathbheadh
Mood=Cnd|Number=Sing|Person=1mbéinn
Mood=Cnd|Number=Plur|Person=3mbéidís
Mood=Cnd|Polarity=Negbheadh, bhiadh
Mood=Ind|Number=Sing|Person=1|Tense=PastBhíos
Mood=Ind|Number=Plur|Person=1|Tense=PastBhamar
Mood=Ind|PronType=Rel|Tense=Futbhias
Mood=Ind|PronType=Rel|Tense=Presatá, tá, a-ta, áta
Mood=Ind|Tense=Futbheidh, bhéas
Mood=Ind|Tense=Pastbhí, bhi
Mood=Ind|Tense=Presbhfuil, bhfhuilimfhuil
Mood=Sub|Tense=Presmbeith

PART

90 PART tokens (29% of all PART tokens) have a non-empty value of Form.

The most frequent other feature values with which PART and Form co-occurred: Polarity=EMPTY (89; 99%), PronType=Rel (87; 97%), PartType=Vb (81; 90%).

PART tokens may have the following values of Form:

Paradigm aDirectDirect,LenIndirectLen
PartType=Infdho
PartType=Vb|PronType=Rela, do, noch, rodhoa

PROPN

40 PROPN tokens (18% of all PROPN tokens) have a non-empty value of Form.

The most frequent other feature values with which PROPN and Form co-occurred: Definite=Def (36; 90%), Number=Sing (31; 78%), Gender=Masc (28; 70%).

PROPN tokens may have the following values of Form:

Paradigm ÉireEclHPref
Case=Datn-Éirinn
Case=GenhÉireann

Form seems to be lexical feature of PROPN. 93% lemmas (28) occur only with one value of Form.

ADJ

31 ADJ tokens (15% of all ADJ tokens) have a non-empty value of Form.

The most frequent other feature values with which ADJ and Form co-occurred: Degree=EMPTY (21; 68%), Number=Sing (21; 68%), Case=Nom (16; 52%).

ADJ tokens may have the following values of Form:

Form seems to be lexical feature of ADJ. 100% lemmas (23) occur only with one value of Form.

ADP

22 ADP tokens (3% of all ADP tokens) have a non-empty value of Form.

The most frequent other feature values with which ADP and Form co-occurred: Number=Sing (15; 68%), Gender=EMPTY (14; 64%).

ADP tokens may have the following values of Form:

NUM

21 NUM tokens (35% of all NUM tokens) have a non-empty value of Form.

The most frequent other feature values with which NUM and Form co-occurred: NumType=Card (18; 86%).

NUM tokens may have the following values of Form:

Paradigm tríEclLen
ttríthrí, tri

PRON

12 PRON tokens (6% of all PRON tokens) have a non-empty value of Form.

The most frequent other feature values with which PRON and Form co-occurred: PronType=EMPTY (11; 92%), Number=Sing (9; 75%), Person=3 (8; 67%), Gender=Masc (7; 58%).

PRON tokens may have the following values of Form:

AUX

11 AUX tokens (14% of all AUX tokens) have a non-empty value of Form.

The most frequent other feature values with which AUX and Form co-occurred: Polarity=EMPTY (11; 100%), PronType=EMPTY (11; 100%), VerbForm=Cop (11; 100%), Tense=Pres (6; 55%).

AUX tokens may have the following values of Form:

Paradigm isEclVF
Tense=Pastmbagurab, dárab
Tense=Presgurab, darab

DET

1 DET tokens (0% of all DET tokens) have a non-empty value of Form.

The most frequent other feature values with which DET and Form co-occurred: Case=EMPTY (1; 100%), Definite=Def (1; 100%), Gender=EMPTY (1; 100%), Number=EMPTY (1; 100%), Person=EMPTY (1; 100%), Poss=EMPTY (1; 100%), PronType=EMPTY (1; 100%).

DET tokens may have the following values of Form:

SCONJ

1 SCONJ tokens (1% of all SCONJ tokens) have a non-empty value of Form.

SCONJ tokens may have the following values of Form:

Relations with Agreement in Form

The 10 most frequent relations where parent and child node agree in Form: VERB –[conj]–> VERB (21; 55%), NOUN –[obl]–> PROPN (2; 67%), NOUN –[vocative]–> NOUN (2; 67%), NUM –[nmod]–> NUM (2; 100%), NUM –[nummod]–> NUM (2; 67%), PART –[fixed]–> PART (2; 67%), VERB –[xcomp:pred]–> NUM (2; 100%), NOUN –[advcl]–> ADJ (1; 100%), NOUN –[ccomp]–> NOUN (1; 100%), NOUN –[xcomp:pred]–> PROPN (1; 100%).