Treebank Statistics: UD_Catalan: Features: Number
This feature is universal.
It occurs with 2 different values: Plur
, Sing
.
This is a layered feature with the following layers: Number, Number[psor].
260657 tokens (49%) have a non-empty value of Number
.
20447 types (63%) occur at least once with a non-empty value of Number
.
11540 lemmas (49%) occur at least once with a non-empty value of Number
.
The feature is used with 10 part-of-speech tags: NOUN (88924; 17% instances), DET (72502; 14% instances), ADJ (29655; 6% instances), VERB (24400; 5% instances), AUX (21349; 4% instances), ADP (14673; 3% instances), PRON (6427; 1% instances), NUM (2666; 1% instances), ADV (60; 0% instances), PROPN (1; 0% instances).
NOUN
88924 NOUN tokens (90% of all NOUN
tokens) have a non-empty value of Number
.
NOUN
tokens may have the following values of Number
:
Plur
(27193; 31% of non-emptyNumber
): anys, milions, persones, obres, mesos, joves, dies, empreses, agents, activitatsSing
(61731; 69% of non-emptyNumber
): any, president, part, terme, grup, projecte, cap, lloc, cas, portaveuEMPTY
(9827): pessetes, any, través, temps, euros, juny, partir, dia, fa, tal
Paradigm any | Sing | Plur |
---|---|---|
any | anys |
DET
72502 DET tokens (100% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: PronType=Art (53752; 74%), Definite=Def (48972; 68%).
DET
tokens may have the following values of Number
:
Plur
(14641; 20% of non-emptyNumber
): les, els, seus, altres, aquests, seves, tots, aquestes, uns, diferentsSing
(57861; 80% of non-emptyNumber
): la, el, l’, un, una, aquest, seva, aquesta, seu, totEMPTY
(94): meva, prou, gaire, massa, meves, cada, força, teus, teva, Que
Paradigm el | Sing | Plur |
---|---|---|
Definite=Def|Gender=Masc|PronType=Art | el | els |
Definite=Def|Gender=Fem|PronType=Art | la, L' | les |
Definite=Def|PronType=Art | l' | |
Gender=Masc|PronType=Art | el | els |
Gender=Fem|Person=3|Poss=Yes|PronType=Prs | les | |
Gender=Fem|PronType=Art | la | les |
PronType=Art | l' |
ADJ
29655 ADJ tokens (99% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: VerbForm=EMPTY (24066; 81%).
ADJ
tokens may have the following values of Number
:
Plur
(9066; 31% of non-emptyNumber
): grans, principals, importants, municipals, noves, nous, socials, locals, últims, culturalsSing
(20589; 69% of non-emptyNumber
): gran, general, passat, primer, nou, primera, actual, nova, important, socialEMPTY
(390): baix, gran, clau, especial, directe, nord, pilot, límit, xàrter, sud
Paradigm nou | Sing | Plur |
---|---|---|
Gender=Masc | nou | nous |
Gender=Fem | nova | noves |
VERB
24400 VERB tokens (61% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: Gender=EMPTY (17731; 73%), VerbForm=Fin (17731; 73%), Person=3 (16919; 69%), Mood=Ind (15501; 64%), Tense=Pres (12488; 51%).
VERB
tokens may have the following values of Number
:
Plur
(5174; 21% of non-emptyNumber
): tenen, fan, tenim, faran, volen, formen, van, consideren, destaquen, tenienSing
(19226; 79% of non-emptyNumber
): té, ha, fet, explicat, dit, fa, considera, cal, farà, volEMPTY
(15514): fer, tenir, dir, donar, arribar, aconseguir, passar, presentar, veure, anar
Paradigm fer | Sing | Plur |
---|---|---|
Gender=Masc|Tense=Past|VerbForm=Part | fet | |
Gender=Fem|Tense=Past|VerbForm=Part | fetes | |
Mood=Cnd|Person=1|VerbForm=Fin | faria | faríem |
Mood=Cnd|Person=3|VerbForm=Fin | faria | farien |
Mood=Imp|Person=1|VerbForm=Fin | fem | |
Mood=Imp|Person=3|VerbForm=Fin | Faci | |
Mood=Ind|Person=1|Tense=Fut|VerbForm=Fin | faré | farem |
Mood=Ind|Person=1|Tense=Imp|VerbForm=Fin | feia | fèiem |
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | faig | fem |
Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | fas | |
Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | farà | faran |
Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | feia | feien |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | fa | fan |
Mood=Sub|Person=1|Tense=Pres|VerbForm=Fin | faci | |
Mood=Sub|Person=3|Tense=Imp|VerbForm=Fin | fes | fessin |
Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | faci | facin |
AUX
21349 AUX tokens (89% of all AUX
tokens) have a non-empty value of Number
.
The most frequent other feature values with which AUX
and Number
co-occurred: VerbForm=Fin (20553; 96%), Person=3 (19974; 94%), Mood=Ind (19394; 91%), Tense=Pres (18118; 85%).
AUX
tokens may have the following values of Number
:
Plur
(5053; 24% of non-emptyNumber
): van, han, són, estan, hem, poden, havien, seran, podran, erenSing
(16296; 76% of non-emptyNumber
): va, ha, és, estat, està, havia, pot, serà, era, faEMPTY
(2677): ser, haver, poder, fer, estar, tornar, començar, deixar, sent, intentar
Paradigm haver | Sing | Plur |
---|---|---|
Gender=Masc|Tense=Past|VerbForm=Part | hagut | |
Mood=Cnd|Person=1|VerbForm=Fin | hauríem | |
Mood=Cnd|Person=3|VerbForm=Fin | hauria | haurien |
Mood=Ind|Person=1|Tense=Fut|VerbForm=Fin | hauré | haurem |
Mood=Ind|Person=1|Tense=Imp|VerbForm=Fin | havia | havíem |
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | he | hem |
Mood=Ind|Person=2|Tense=Fut|VerbForm=Fin | haureu | |
Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | has | heu |
Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | haurà | hauran |
Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | havia | havien |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | ha | han |
Mood=Sub|Person=1|Tense=Imp|VerbForm=Fin | haguéssim | |
Mood=Sub|Person=1|Tense=Pres|VerbForm=Fin | haguem, hàgim | |
Mood=Sub|Person=3|Tense=Imp|VerbForm=Fin | hagués | haguessin, haguéssin |
Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | hagi | hagin |
ADP
14673 ADP tokens (17% of all ADP
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADP
and Number
co-occurred: AdpType=Preppron (14673; 100%), Gender=Masc (14670; 100%).
ADP
tokens may have the following values of Number
:
Plur
(3850; 26% of non-emptyNumber
): dels, als, pelsSing
(10823; 74% of non-emptyNumber
): del, al, pel, do, daEMPTY
(73302): de, a, d’, en, per, amb, entre, sobre, segons, des
Paradigm del | Sing | Plur |
---|---|---|
del | dels |
PRON
6427 PRON tokens (28% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: Person=EMPTY (3663; 57%).
PRON
tokens may have the following values of Number
:
Plur
(1844; 29% of non-emptyNumber
): els, quals, ens, altres, uns, les, ells, los, alguns, nosaltresSing
(4583; 71% of non-emptyNumber
): un, li, tot, això, qual, la, el, l’, ell, unaEMPTY
(16942): que, es, s’, hi, se, on, ho, què, qui, n’
Paradigm ell | Sing | Plur |
---|---|---|
Case=Acc|Gender=Masc | el, lo, 'l, li, -lo | els, 'ls |
Case=Acc|Gender=Fem | la, -la | les |
Case=Acc | l' | |
Case=Dat | li | |
Gender=Masc | ell | ells, els |
Gender=Fem | ella | elles |
els, los, 'ls |
NUM
2666 NUM tokens (29% of all NUM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NUM
and Number
co-occurred: NumForm=EMPTY (2666; 100%), NumType=Card (2666; 100%), Gender=EMPTY (1345; 50%).
NUM
tokens may have the following values of Number
:
Plur
(2219; 83% of non-emptyNumber
): dos, tres, dues, quatre, cinc, sis, set, vuit, deu, nouSing
(447; 17% of non-emptyNumber
): un, una, mig, mitja, doble, tercer, quart, triple, desena, cinquenaEMPTY
(6595): cent, 10, 15, 30, 20, 5, 4, 12, 2, 2000
Paradigm dos | Sing | Plur |
---|---|---|
Gender=Masc | dos | dos |
Gender=Fem | dues | |
dos |
Number
seems to be lexical feature of NUM
. 95% lemmas (63) occur only with one value of Number
.
ADV
60 ADV tokens (0% of all ADV
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADV
and Number
co-occurred: Polarity=EMPTY (60; 100%).
ADV
tokens may have the following values of Number
:
Plur
(18; 30% of non-emptyNumber
): més, enfront, fins, quantSing
(42; 70% of non-emptyNumber
): fins, més, entorn, enfront, enllà, prop, quant, enmigEMPTY
(15396): no, més, també, ja, després, ahir, molt, avui, només, ara
Paradigm més | Sing | Plur |
---|---|---|
més | més |
PROPN
1 PROPN tokens (0% of all PROPN
tokens) have a non-empty value of Number
.
PROPN
tokens may have the following values of Number
:
Sing
(1; 100% of non-emptyNumber
): JustíciaEMPTY
(46731): Catalunya, Barcelona, Generalitat, Govern, sant, Ajuntament, Girona, Josep, CiU, PP
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
NOUN –[det]–> DET (56456; 96%),
NOUN –[amod]–> ADJ (22797; 97%),
NOUN –[nmod]–> NOUN (13181; 52%),
VERB –[nsubj]–> NOUN (8077; 65%),
NOUN –[acl]–> VERB (3961; 51%),
NOUN –[conj]–> NOUN (3922; 78%),
ADJ –[cop]–> AUX (1720; 86%),
VERB –[conj]–> VERB (1536; 67%),
NOUN –[cop]–> AUX (1486; 70%),
DET –[det]–> DET (1449; 100%).