Treebank Statistics: UD_Romanian-TueCL: Features: Typo
This feature is language-specific.
It occurs with 1 different values: Yes.
430 tokens (10%) have a non-empty value of Typo.
262 types (17%) occur at least once with a non-empty value of Typo.
216 lemmas (19%) occur at least once with a non-empty value of Typo.
The feature is used with 13 part-of-speech tags: NOUN (90; 2% instances), VERB (80; 2% instances), ADP (40; 1% instances), ADJ (35; 1% instances), PART (34; 1% instances), SCONJ (34; 1% instances), PRON (30; 1% instances), CCONJ (29; 1% instances), ADV (27; 1% instances), AUX (18; 0% instances), DET (8; 0% instances), PROPN (3; 0% instances), NUM (2; 0% instances).
NOUN
90 NOUN tokens (11% of all NOUN tokens) have a non-empty value of Typo.
The most frequent other feature values with which NOUN and Typo co-occurred: Number=Sing (66; 73%), Case=Acc,Nom (60; 67%), Definite=Ind (56; 62%), Gender=Fem (51; 57%).
NOUN tokens may have the following values of Typo:
Yes(90; 100% of non-emptyTypo): barbat, barbati, carucior, fata, fisa, soti, tarfa, viata, ESCORTA, FUNDULET
Typo seems to be lexical feature of NOUN. 100% lemmas (74) occur only with one value of Typo.
VERB
80 VERB tokens (15% of all VERB tokens) have a non-empty value of Typo.
The most frequent other feature values with which VERB and Typo co-occurred: Gender=EMPTY (68; 85%), VerbForm=Fin (66; 83%), Tense=Pres (59; 74%), Mood=Ind (53; 66%).
VERB tokens may have the following values of Typo:
Yes(80; 100% of non-emptyTypo): facut, stiu, injure, vad, ACCEPTI, BATA, Incepuse, Intreb, MERITI, Multumesc
Typo seems to be lexical feature of VERB. 100% lemmas (60) occur only with one value of Typo.
ADP
40 ADP tokens (8% of all ADP tokens) have a non-empty value of Typo.
The most frequent other feature values with which ADP and Typo co-occurred: AdpType=Prep (40; 100%), Case=Acc (37; 93%).
ADP tokens may have the following values of Typo:
Yes(40; 100% of non-emptyTypo): in, dupa, ex, fara, intre, Intr, ca, fata, fu, impotriva
Typo seems to be lexical feature of ADP. 100% lemmas (11) occur only with one value of Typo.
ADJ
35 ADJ tokens (15% of all ADJ tokens) have a non-empty value of Typo.
The most frequent other feature values with which ADJ and Typo co-occurred: Definite=Ind (32; 91%), Degree=Pos (30; 86%), Gender=Fem (27; 77%), Number=Sing (27; 77%), Case=Acc,Nom (24; 69%).
ADJ tokens may have the following values of Typo:
Yes(35; 100% of non-emptyTypo): frumoasa, FRUMOSE, SEXSY, apetisanta, arsa, buna, ciudati, comunista, desteapta, divin
Typo seems to be lexical feature of ADJ. 100% lemmas (28) occur only with one value of Typo.
PART
34 PART tokens (20% of all PART tokens) have a non-empty value of Typo.
The most frequent other feature values with which PART and Typo co-occurred: Mood=Sub (32; 94%), Polarity=EMPTY (32; 94%).
PART tokens may have the following values of Typo:
Yes(34; 100% of non-emptyTypo): sa, n
SCONJ
34 SCONJ tokens (33% of all SCONJ tokens) have a non-empty value of Typo.
The most frequent other feature values with which SCONJ and Typo co-occurred: Polarity=Pos (33; 97%).
SCONJ tokens may have the following values of Typo:
Yes(34; 100% of non-emptyTypo): ca, daca, cand
PRON
30 PRON tokens (8% of all PRON tokens) have a non-empty value of Typo.
The most frequent other feature values with which PRON and Typo co-occurred: Reflex=EMPTY (27; 90%), Variant=EMPTY (22; 73%), Gender=EMPTY (20; 67%), PronType=Prs (19; 63%), Strength=Weak (19; 63%), Number=Sing (18; 60%).
PRON tokens may have the following values of Typo:
Yes(30; 100% of non-emptyTypo): isi, ma, va, astia, iti, te, ti, Ala, Nici, aia
Typo seems to be lexical feature of PRON. 100% lemmas (12) occur only with one value of Typo.
CCONJ
29 CCONJ tokens (21% of all CCONJ tokens) have a non-empty value of Typo.
The most frequent other feature values with which CCONJ and Typo co-occurred: Polarity=Pos (28; 97%).
CCONJ tokens may have the following values of Typo:
Yes(29; 100% of non-emptyTypo): si, da
ADV
27 ADV tokens (10% of all ADV tokens) have a non-empty value of Typo.
The most frequent other feature values with which ADV and Typo co-occurred: PronType=EMPTY (20; 74%), Degree=Pos (16; 59%).
ADV tokens may have the following values of Typo:
Yes(27; 100% of non-emptyTypo): asa, cand, niciodata, ca, decat, dupa, parca, Alaltaieri, Cat, MACAR
Typo seems to be lexical feature of ADV. 100% lemmas (18) occur only with one value of Typo.
AUX
18 AUX tokens (7% of all AUX tokens) have a non-empty value of Typo.
The most frequent other feature values with which AUX and Typo co-occurred: Number=Sing (15; 83%), Mood=Ind (10; 56%), Tense=Pres (10; 56%), VerbForm=Fin (10; 56%).
AUX tokens may have the following values of Typo:
Yes(18; 100% of non-emptyTypo): esti, as, Find, ati, ii, o, s
DET
8 DET tokens (4% of all DET tokens) have a non-empty value of Typo.
The most frequent other feature values with which DET and Typo co-occurred: Number[psor]=EMPTY (6; 75%), Poss=EMPTY (6; 75%), Person=3 (5; 63%), Position=EMPTY (5; 63%).
DET tokens may have the following values of Typo:
Yes(8; 100% of non-emptyTypo): asta, niste, aceleasi, tai, tau, unui
PROPN
3 PROPN tokens (4% of all PROPN tokens) have a non-empty value of Typo.
PROPN tokens may have the following values of Typo:
Yes(3; 100% of non-emptyTypo): Franta, parazitii, romania
NUM
2 NUM tokens (6% of all NUM tokens) have a non-empty value of Typo.
The most frequent other feature values with which NUM and Typo co-occurred: NumType=Card (2; 100%).
NUM tokens may have the following values of Typo:
Yes(2; 100% of non-emptyTypo): 2,5, doua
Relations with Agreement in Typo
The 10 most frequent relations where parent and child node agree in Typo:
VERB –[mark]–> PART (16; 52%),
ADJ –[conj]–> ADJ (4; 100%),
NOUN –[compound]–> ADP (2; 100%),
NOUN –[list]–> NOUN (2; 100%),
VERB –[xcomp]–> NOUN (2; 100%),
ADJ –[obl]–> ADJ (1; 100%),
NOUN –[advmod:tmod]–> ADV (1; 100%),
NOUN –[advmod]–> PART (1; 100%),
NOUN –[conj]–> ADJ (1; 100%),
NOUN –[mark]–> PART (1; 100%).