Treebank Statistics: UD_Lithuanian-ALKSNIS: POS Tags: DET
There are 11 DET lemmas (0%), 154 DET types (1%) and 1780 DET tokens (3%).
Out of 17 observed tags, the rank of DET is: 15 in number of lemmas, 9 in number of types and 9 in number of tokens.
The 10 most frequent DET lemmas: tas, kuris, šis, visas, toks, pats, koks, kiekvienas, joks, šitas
The 10 most frequent DET types: tai, to, kurie, kurios, šio, šios, kuris, kurių, toks, visą
The 10 most frequent ambiguous lemmas: tas (DET 436, X 2, PART 1), kuris (DET 382, X 50), pats (DET 84, X 27), koks (DET 77, X 1)
The 10 most frequent ambiguous types: tai (DET 156, PART 42, X 17, CCONJ 7, ADV 1, PRON 1), to (DET 62, X 35), kurie (DET 61, X 17), kurios (DET 57, X 10), kurių (DET 38, X 10), visų (DET 31, ADV 9), kuri (DET 27, X 1), tą (DET 24, X 1), pats (DET 24, X 3), tuo (DET 19, SCONJ 12, ADV 2)
- tai
- DET 156: Prezidentūra aiškina , kad V . Adamkus siūlė ne tai .
- PART 42: Darbas namuose – tai ne karjera tradicine šio žodžio prasme .
- X 17: Visa tai sukelia tik milžinišką įtampą ir savotiškas varžybas .
- CCONJ 7: Jei ji tikrai yra stambi , tai pati seniai tą žino , todėl svarbu to neakcentuoti .
- ADV 1: Anot ULPKC specialistų , ši grėsminga liga dažniau nustatoma vyresniame amžiuje , tai yra 55 - 64 ir per 65 metų amžiaus žmonėms .
- PRON 1: Tačiau demografinių charakteristikų nustatymu tyrime neapsiribojama , o siekiama atsakyti į klausimą , „ kokia dalis išvykusiųjų neišnaudoja savo turimos kvalifikacijos ir užsienyje dirba žemesnės kvalifikacijos darbus “ ir kokią tai daro įtaką išvykusiųjų pajamoms , žmogiškajam kapitalui ir galimybėms grįžti ( p . 8 ) .
- to
- kurie
- kurios
- kurių
- visų
- kuri
- tą
- pats
- tuo
Morphology
The form / lemma ratio of DET is 14.000000 (the average of all parts of speech is 2.065341).
The 1st highest number of forms (23) was observed with the lemma “kuris”: kuri, kuria, kuriai, kuriais, kuriam, kuriame, kurias, kurie, kuriems, kurio, kurioj, kurioje, kuriomis, kurioms, kurios, kuriose, kuris, kuriuo, kuriuos, kuriuose, kurią, kurių, kurį.
The 2nd highest number of forms (21) was observed with the lemma “šis”: Šiai, ši, šia, šiais, šiam, šiame, šias, šie, šio, šioje, šiomis, šioms, šios, šiose, šis, šiuo, šiuos, šiuose, šią, šių, šį.
The 3rd highest number of forms (20) was observed with the lemma “tas”: Tasai, ta, tai, tais, tam, tame, tas, tatai, tie, tiems, to, toje, toji, tomis, tos, tose, tuo, tuos, tą, tų.
DET occurs with 6 features: Definite (1780; 100% instances), Gender (1780; 100% instances), PronType (1780; 100% instances), Case (1568; 88% instances), Number (1568; 88% instances), Hyph (55; 3% instances)
DET occurs with 19 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Gender=Neut, Hyph=Yes, Number=Plur, Number=Sing, PronType=Dem, PronType=Emp, PronType=Int,Rel, PronType=Neg, PronType=Tot
DET occurs with 126 feature combinations.
The most frequent feature combination is Definite=Ind|Gender=Neut|PronType=Dem (206 tokens).
Examples: tai, Šitai
Relations
DET nodes are attached to their parents using 15 different relations: det (926; 52% instances), nsubj (316; 18% instances), obl:arg (189; 11% instances), nmod (79; 4% instances), obj (77; 4% instances), obl (62; 3% instances), amod (46; 3% instances), root (23; 1% instances), nsubj:pass (22; 1% instances), conj (21; 1% instances), dep (10; 1% instances), xcomp (4; 0% instances), ccomp (2; 0% instances), flat (2; 0% instances), advcl (1; 0% instances)
Parents of DET nodes belong to 11 different parts of speech: NOUN (1017; 57% instances), VERB (590; 33% instances), ADJ (84; 5% instances), PRON (26; 1% instances), (23; 1% instances), PROPN (14; 1% instances), ADV (10; 1% instances), DET (7; 0% instances), X (5; 0% instances), NUM (3; 0% instances), PART (1; 0% instances)
1451 (82%) DET nodes are leaves.
223 (13%) DET nodes have one child.
56 (3%) DET nodes have two children.
50 (3%) DET nodes have three or more children.
The highest child degree of a DET node is 7.
Children of DET nodes are attached using 24 different relations: case (111; 21% instances), nmod (73; 14% instances), advmod:emph (60; 11% instances), punct (53; 10% instances), acl:relcl (40; 8% instances), ccomp (27; 5% instances), nsubj (26; 5% instances), conj (24; 5% instances), cop (20; 4% instances), cc (12; 2% instances), csubj (12; 2% instances), dep (12; 2% instances), acl (11; 2% instances), advmod (11; 2% instances), advcl (7; 1% instances), mark (6; 1% instances), amod (5; 1% instances), obl (5; 1% instances), appos (3; 1% instances), det (2; 0% instances), flat (2; 0% instances), obl:arg (2; 0% instances), nummod (1; 0% instances), parataxis (1; 0% instances)
Children of DET nodes belong to 16 different parts of speech: ADP (111; 21% instances), VERB (105; 20% instances), PART (59; 11% instances), X (55; 10% instances), NOUN (54; 10% instances), PUNCT (53; 10% instances), AUX (20; 4% instances), PRON (15; 3% instances), ADJ (13; 2% instances), ADV (12; 2% instances), CCONJ (12; 2% instances), DET (7; 1% instances), SCONJ (6; 1% instances), NUM (2; 0% instances), INTJ (1; 0% instances), PROPN (1; 0% instances)