Treebank Statistics: UD_Latvian-LVTB: POS Tags: DET
There are 55 DET
lemmas (0%), 260 DET
types (0%) and 7549 DET
tokens (2%).
Out of 17 observed tags, the rank of DET
is: 11 in number of lemmas, 8 in number of types and 11 in number of tokens.
The 10 most frequent DET
lemmas: šis, šī, sava, tas, savs, viss, tā, kāds, visa, cita
The 10 most frequent DET
types: savu, šo, to, šī, šajā, šīs, tā, tās, kādu, savas
The 10 most frequent ambiguous lemmas: šis (DET 805, PRON 99), šī (DET 662, PRON 37), sava (DET 607, PRON 3), tas (PRON 2830, DET 535, ADV 2), savs (DET 448, PRON 10), viss (PRON 472, DET 381), tā (PRON 996, ADV 418, DET 362, SCONJ 46, PART 34, CCONJ 11), kāds (DET 292, PRON 177), visa (DET 279, PRON 17), cita (DET 218, PRON 33)
The 10 most frequent ambiguous types: savu (DET 447, PRON 7), šo (DET 322, PRON 11), to (PRON 998, DET 246, X 4, PART 1), šī (DET 200, PRON 12), šajā (DET 151, PRON 2), šīs (DET 162, PRON 6), tā (PRON 410, ADV 341, DET 165, PART 22, SCONJ 13, CCONJ 10), tās (PRON 248, DET 160), kādu (DET 144, PRON 34), visu (DET 126, PRON 125)
- savu
- šo
- to
- PRON 998: Viņš par to jau bija iedomājies .
- DET 246: Nu to saticību – to mēs piedomāsim klāt .
- X 4: Aktrise Keita Hadsone izdevusi grāmatu « Pretty happy : Healthy ways to love your body » .
- PART 1: A to sanāk , ka tikai tie kādi pieci vai desmit , vai piecdesmit gudrinieki tur , augšā , to jēgu zina .
- šī
- šajā
- šīs
- tā
- PRON 410: Es zināju , ka tā ir nejauka provokācija .
- ADV 341: Kā tu tā , bračkiņ ?!
- DET 165: - Klau , tā sieviete .
- PART 22: - Nu nav taču kur , jau tā kājas slapjas .
- SCONJ 13: Viņai ir negatīvais rēzuss , tā ka jūs nākat kā saukts .
- CCONJ 10: Kā pirka lepnas mājas par skaidru naudu agrāk , tā pērk tagad .
- tās
- kādu
- visu
Morphology
The form / lemma ratio of DET
is 4.727273 (the average of all parts of speech is 2.340184).
The 1st highest number of forms (13) was observed with the lemma “tas”: t, tai, tais, tajos, tajā, tam, tanī, tas, tie, tiem, to, tos, tā.
The 2nd highest number of forms (11) was observed with the lemma “šis”: šai, šajos, šajā, šie, šiem, šim, šis, šo, šos, šā, šī.
The 3rd highest number of forms (10) was observed with the lemma “daudzi”: daudzajiem, daudzas, daudzi, daudziem, daudzo, daudzos, daudzu, daudzus, daudzām, daudzās.
DET
occurs with 9 features: Case (7549; 100% instances), Number (7470; 99% instances), Gender (7469; 99% instances), PronType (7184; 95% instances), Person (2368; 31% instances), Poss (1366; 18% instances), Definite (365; 5% instances), Degree (365; 5% instances), Typo (12; 0% instances)
DET
occurs with 23 feature-value pairs: Case=Acc
, Case=Dat
, Case=Gen
, Case=Loc
, Case=Nom
, Definite=Def
, Definite=Ind
, Degree=Pos
, Gender=Fem
, Gender=Masc
, Number=Plur
, Number=Sing
, Person=2
, Person=3
, Poss=Yes
, PronType=Dem
, PronType=Ind
, PronType=Ind,Neg
, PronType=Int
, PronType=Prs
, PronType=Rel
, PronType=Tot
, Typo=Yes
DET
occurs with 182 feature combinations.
The most frequent feature combination is Case=Gen|Gender=Masc|Number=Sing|Person=3|PronType=Dem
(284 tokens).
Examples: tā, šī, šā
Relations
DET
nodes are attached to their parents using 1 different relations: det (7549; 100% instances)
Parents of DET
nodes belong to 11 different parts of speech: NOUN (7163; 95% instances), ADJ (187; 2% instances), VERB (75; 1% instances), PROPN (55; 1% instances), NUM (34; 0% instances), PRON (17; 0% instances), DET (9; 0% instances), ADV (3; 0% instances), INTJ (3; 0% instances), X (2; 0% instances), SYM (1; 0% instances)
7097 (94%) DET
nodes are leaves.
384 (5%) DET
nodes have one child.
63 (1%) DET
nodes have two children.
5 (0%) DET
nodes have three or more children.
The highest child degree of a DET
node is 3.
Children of DET
nodes are attached using 19 different relations: discourse (148; 28% instances), compound (90; 17% instances), advmod (81; 15% instances), case (62; 12% instances), acl (58; 11% instances), conj (21; 4% instances), fixed (20; 4% instances), ccomp (13; 2% instances), det (9; 2% instances), nmod (6; 1% instances), obl (5; 1% instances), punct (4; 1% instances), dep (2; 0% instances), advcl (1; 0% instances), amod (1; 0% instances), cc (1; 0% instances), goeswith (1; 0% instances), iobj (1; 0% instances), parataxis (1; 0% instances)
Children of DET
nodes belong to 13 different parts of speech: PART (148; 28% instances), PRON (106; 20% instances), ADV (83; 16% instances), VERB (65; 12% instances), ADP (62; 12% instances), SCONJ (20; 4% instances), ADJ (13; 2% instances), NOUN (11; 2% instances), DET (9; 2% instances), PUNCT (4; 1% instances), PROPN (2; 0% instances), CCONJ (1; 0% instances), X (1; 0% instances)