home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Irish-IDT: Features: Form

This feature is language-specific. It occurs with 7 different values: Direct, Ecl, Emp, HPref, Indirect, Len, VF. Some words have combined values of the feature; 5 combinations have been observed: Direct|Emp, Ecl|Emp, Ecl|Indirect, Ecl|VF, Emp|Len.

19755 tokens (17%) have a non-empty value of Form. 4830 types (32%) occur at least once with a non-empty value of Form. 2788 lemmas (31%) occur at least once with a non-empty value of Form. The feature is used with 13 part-of-speech tags: NOUN (9937; 9% instances), VERB (4511; 4% instances), PART (2324; 2% instances), PROPN (1443; 1% instances), ADJ (924; 1% instances), NUM (261; 0% instances), AUX (180; 0% instances), PRON (66; 0% instances), DET (61; 0% instances), ADP (33; 0% instances), SCONJ (7; 0% instances), ADV (6; 0% instances), X (2; 0% instances).

NOUN

9937 NOUN tokens (30% of all NOUN tokens) have a non-empty value of Form.

The most frequent other feature values with which NOUN and Form co-occurred: VerbForm=EMPTY (8030; 81%), Case=Nom (6549; 66%), Number=Sing (6429; 65%), Definite=EMPTY (5327; 54%).

NOUN tokens may have the following values of Form:

Paradigm tuairimEclEmp,LenLen
Definite=Def|Number=Singdtuairimthuairimsethuairim
Definite=Def|Number=Plurdtuairimíthuairimí
Number=Singthuairim
Number=Plurthuairimí

VERB

4511 VERB tokens (51% of all VERB tokens) have a non-empty value of Form.

The most frequent other feature values with which VERB and Form co-occurred: Mood=Ind (3931; 87%), Person=EMPTY (3873; 86%).

VERB tokens may have the following values of Form:

Paradigm DirectDirect,EmpEclEcl,EmpEmpLen
Aspect=Hab|Mood=Ind|Number=Plur|Person=1|Tense=Presmbímidbhímid
Aspect=Hab|Mood=Ind|Polarity=Neg|Tense=Presmbíonnbhíonn
Aspect=Hab|Mood=Ind|Tense=Presmbíonnbhíonn, bhíos
Aspect=Imp|Number=Sing|Person=1|Tense=Pastmbínnmbínnse
Aspect=Imp|Number=Plur|Person=1|Tense=Pastmbímis
Aspect=Imp|Number=Plur|Person=3|Polarity=Neg|Tense=Pastbhídís
Aspect=Imp|Number=Plur|Person=3|Tense=Pastbhídís
Aspect=Imp|Polarity=Neg|Tense=Pastmbíodhbhíodh
Aspect=Imp|PronType=Rel|Tense=Pastbhíos
Aspect=Imp|Tense=Pastmbíodhbhíodh
Dialect=Munster|Mood=Ind|Number=Plur|Person=3|Tense=Presbhfuilid
Mood=Cndmbeadhbheadh
Mood=Cnd,Intmbeadh
Mood=Cnd|Number=Sing|Person=1mbeinnBheinn
Mood=Cnd|Number=Sing|Person=2bheifeá, bheitheá
Mood=Cnd|Number=Sing|Person=2|Polarity=Negmbeifeá
Mood=Cnd|Number=Plur|Person=1mbeimisbheimís
Mood=Cnd|Number=Plur|Person=3mbeidísbheidís
Mood=Cnd|Number=Plur|Person=3|Polarity=Negmbeidís
Mood=Cnd|Person=0bheifí
Mood=Cnd|Polarity=Negmbeadhbheadh
Mood=Ind|Number=Sing|Person=1|PronType=Rel|Tense=Presatáimatáimse
Mood=Ind|Number=Sing|Person=1|Tense=Pastbhíos
Mood=Ind|Number=Sing|Person=1|Tense=PresTáimse
Mood=Ind|Number=Sing|Person=2|Polarity=Neg|Tense=Presnílirse
Mood=Ind|Number=Plur|Person=1|Polarity=Neg|Tense=Presbhfuilimíd
Mood=Ind|Number=Plur|Person=1|Tense=Futmbeimidbheimid
Mood=Ind|Number=Plur|Person=1|Tense=PastBhíomar
Mood=Ind|Number=Plur|Person=1|Tense=Presbhfuilimid
Mood=Ind|Number=Plur|Person=3|PronType=Rel|Tense=Presatáid
Mood=Ind|Number=Plur|Person=3|Tense=Pastbhíodar
Mood=Ind|Number=Plur|Person=3|Tense=Presbhfuilid
Mood=Ind|Person=0|Polarity=Neg|Tense=Futmbeifear
Mood=Ind|Person=0|PronType=Rel|Tense=Presatáthar
Mood=Ind|Person=0|Tense=Pastbhíothas
Mood=Ind|Person=0|Tense=Presbhfuiltear
Mood=Ind|Polarity=Neg|Tense=Futmbeidhbheidh
Mood=Ind|Polarity=Neg|Tense=Presbhfuil
Mood=Ind|PronType=Rel|Tense=Futbheas, bhéas
Mood=Ind|PronType=Rel|Tense=Presatá, tá
Mood=Ind|PronType=Rel|Tense=Pres|Typo=Yesata
Mood=Ind|Tense=Futmbeidhbheidh, bhéidh
Mood=Ind|Tense=Pastbhí
Mood=Ind|Tense=Presbhfuil

PART

2324 PART tokens (33% of all PART tokens) have a non-empty value of Form.

The most frequent other feature values with which PART and Form co-occurred: PronType=Rel (2305; 99%), PartType=Vb (2151; 93%).

PART tokens may have the following values of Form:

Paradigm aDirectEcl,IndirectIndirect
a, don-aa, go

PROPN

1443 PROPN tokens (24% of all PROPN tokens) have a non-empty value of Form.

The most frequent other feature values with which PROPN and Form co-occurred: Definite=Def (1439; 100%), Number=Sing (1368; 95%), Gender=Fem (730; 51%).

PROPN tokens may have the following values of Form:

Paradigm GaeltachtEclLen
Case=Gen|NounType=Strong|Number=PlurnGaeltachtaíGhaeltachtaí
Case=Gen|Number=SingGhaeltachta
Case=Nom|Number=SingnGaeltachtGhaeltacht
Number=SingGhaeltacht

ADJ

924 ADJ tokens (14% of all ADJ tokens) have a non-empty value of Form.

The most frequent other feature values with which ADJ and Form co-occurred: VerbForm=EMPTY (897; 97%), NounType=EMPTY (839; 91%), Degree=EMPTY (594; 64%), Case=Nom (500; 54%), Number=Sing (479; 52%).

ADJ tokens may have the following values of Form:

Paradigm céannaEclLen
Gender=Masc|NounType=Slender|Number=Plurchéanna
Gender=Masc|Number=Singgcéannachéanna
Gender=Fem|Number=Singchéanna

Form seems to be lexical feature of ADJ. 99% lemmas (292) occur only with one value of Form.

NUM

261 NUM tokens (13% of all NUM tokens) have a non-empty value of Form.

The most frequent other feature values with which NUM and Form co-occurred: NumType=Card (152; 58%).

NUM tokens may have the following values of Form:

Paradigm céadEclLen
NumType=Cardgcéadchéad
NumType=Ordgcéadchéad

AUX

180 AUX tokens (12% of all AUX tokens) have a non-empty value of Form.

The most frequent other feature values with which AUX and Form co-occurred: VerbForm=Cop (180; 100%), Polarity=EMPTY (162; 90%), Tense=Past (145; 81%).

AUX tokens may have the following values of Form:

Paradigm isEclEcl,VFLenVF
Dialect=Ulster|Polarity=Neg|Tense=Preschan
Mood=CndmbaB'
Mood=Int|Tense=Pastarbh, arb
Polarity=Neg|PronType=Rel|Tense=Pastnárbh
Polarity=Neg|Tense=Pastníorbh, nárbh
Polarity=Neg|Tense=PresChan
PronType=Rel|Tense=Pastab
Tense=Pastmbamb'b', gurbh, b’, darbh, arb
Tense=Presgurb, darb

PRON

66 PRON tokens (2% of all PRON tokens) have a non-empty value of Form.

The most frequent other feature values with which PRON and Form co-occurred: Gender=EMPTY (54; 82%), Number=EMPTY (43; 65%), Person=EMPTY (43; 65%), PronType=Dem (34; 52%).

PRON tokens may have the following values of Form:

DET

61 DET tokens (1% of all DET tokens) have a non-empty value of Form.

The most frequent other feature values with which DET and Form co-occurred: Case=EMPTY (61; 100%), Gender=EMPTY (60; 98%), Number=EMPTY (59; 97%), Person=EMPTY (59; 97%), Poss=EMPTY (59; 97%), PronType=EMPTY (50; 82%), Definite=Def (48; 79%).

DET tokens may have the following values of Form:

ADP

33 ADP tokens (0% of all ADP tokens) have a non-empty value of Form.

The most frequent other feature values with which ADP and Form co-occurred: PronType=EMPTY (28; 85%), Person=3 (22; 67%).

ADP tokens may have the following values of Form:

SCONJ

7 SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Form.

SCONJ tokens may have the following values of Form:

ADV

6 ADV tokens (0% of all ADV tokens) have a non-empty value of Form.

ADV tokens may have the following values of Form:

X

2 X tokens (1% of all X tokens) have a non-empty value of Form.

The most frequent other feature values with which X and Form co-occurred: Foreign=Yes (2; 100%).

X tokens may have the following values of Form:

Relations with Agreement in Form

The 10 most frequent relations where parent and child node agree in Form: ADJ –[conj]–> ADJ (17; 57%), ADJ –[advcl]–> ADJ (4; 100%), NOUN –[ccomp]–> ADJ (3; 60%), ADJ –[obl]–> NUM (2; 100%), PRON –[vocative]–> NOUN (2; 67%), ADJ –[csubj:cleft]–> NOUN (1; 100%), ADJ –[parataxis]–> ADJ (1; 100%), ADJ –[vocative]–> NOUN (1; 100%), NOUN –[xcomp:pred]–> PROPN (1; 100%), VERB –[dislocated]–> VERB (1; 100%).