Treebank Statistics: UD_Lithuanian-ALKSNIS: POS Tags: DET
There are 11 DET
lemmas (0%), 154 DET
types (1%) and 1780 DET
tokens (3%).
Out of 17 observed tags, the rank of DET
is: 15 in number of lemmas, 9 in number of types and 9 in number of tokens.
The 10 most frequent DET
lemmas: tas, kuris, šis, visas, toks, pats, koks, kiekvienas, joks, šitas
The 10 most frequent DET
types: tai, to, kurie, kurios, šio, šios, kuris, kurių, toks, visą
The 10 most frequent ambiguous lemmas: tas (DET 436, X 2, PART 1), kuris (DET 382, X 50), pats (DET 84, X 27), koks (DET 77, X 1)
The 10 most frequent ambiguous types: tai (DET 156, PART 42, X 17, CCONJ 7, ADV 1, PRON 1), to (DET 62, X 35), kurie (DET 61, X 17), kurios (DET 57, X 10), kurių (DET 38, X 10), visų (DET 31, ADV 9), kuri (DET 27, X 1), tą (DET 24, X 1), pats (DET 24, X 3), tuo (DET 19, SCONJ 12, ADV 2)
- tai
- DET 156: Prezidentūra aiškina , kad V . Adamkus siūlė ne tai .
- PART 42: Darbas namuose – tai ne karjera tradicine šio žodžio prasme .
- X 17: Visa tai sukelia tik milžinišką įtampą ir savotiškas varžybas .
- CCONJ 7: Jei ji tikrai yra stambi , tai pati seniai tą žino , todėl svarbu to neakcentuoti .
- ADV 1: Anot ULPKC specialistų , ši grėsminga liga dažniau nustatoma vyresniame amžiuje , tai yra 55 - 64 ir per 65 metų amžiaus žmonėms .
- PRON 1: Tačiau demografinių charakteristikų nustatymu tyrime neapsiribojama , o siekiama atsakyti į klausimą , „ kokia dalis išvykusiųjų neišnaudoja savo turimos kvalifikacijos ir užsienyje dirba žemesnės kvalifikacijos darbus “ ir kokią tai daro įtaką išvykusiųjų pajamoms , žmogiškajam kapitalui ir galimybėms grįžti ( p . 8 ) .
- to
- kurie
- kurios
- kurių
- visų
- kuri
- tą
- pats
- tuo
Morphology
The form / lemma ratio of DET
is 14.000000 (the average of all parts of speech is 2.065341).
The 1st highest number of forms (23) was observed with the lemma “kuris”: kuri, kuria, kuriai, kuriais, kuriam, kuriame, kurias, kurie, kuriems, kurio, kurioj, kurioje, kuriomis, kurioms, kurios, kuriose, kuris, kuriuo, kuriuos, kuriuose, kurią, kurių, kurį.
The 2nd highest number of forms (21) was observed with the lemma “šis”: Šiai, ši, šia, šiais, šiam, šiame, šias, šie, šio, šioje, šiomis, šioms, šios, šiose, šis, šiuo, šiuos, šiuose, šią, šių, šį.
The 3rd highest number of forms (20) was observed with the lemma “tas”: Tasai, ta, tai, tais, tam, tame, tas, tatai, tie, tiems, to, toje, toji, tomis, tos, tose, tuo, tuos, tą, tų.
DET
occurs with 6 features: Definite (1780; 100% instances), Gender (1780; 100% instances), PronType (1780; 100% instances), Case (1568; 88% instances), Number (1568; 88% instances), Hyph (55; 3% instances)
DET
occurs with 19 feature-value pairs: Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Definite=Def
, Definite=Ind
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Hyph=Yes
, Number=Plur
, Number=Sing
, PronType=Dem
, PronType=Emp
, PronType=Int,Rel
, PronType=Neg
, PronType=Tot
DET
occurs with 126 feature combinations.
The most frequent feature combination is Definite=Ind|Gender=Neut|PronType=Dem
(206 tokens).
Examples: tai, Šitai
Relations
DET
nodes are attached to their parents using 15 different relations: det (988; 56% instances), nsubj (318; 18% instances), obl:arg (204; 11% instances), obj (77; 4% instances), obl (63; 4% instances), amod (47; 3% instances), root (23; 1% instances), nsubj:pass (22; 1% instances), conj (21; 1% instances), dep (7; 0% instances), xcomp (4; 0% instances), ccomp (2; 0% instances), flat (2; 0% instances), advcl (1; 0% instances), csubj:pass (1; 0% instances)
Parents of DET
nodes belong to 11 different parts of speech: NOUN (1017; 57% instances), VERB (590; 33% instances), ADJ (84; 5% instances), PRON (26; 1% instances), (23; 1% instances), PROPN (14; 1% instances), ADV (10; 1% instances), DET (7; 0% instances), X (5; 0% instances), NUM (3; 0% instances), PART (1; 0% instances)
1451 (82%) DET
nodes are leaves.
222 (12%) DET
nodes have one child.
56 (3%) DET
nodes have two children.
51 (3%) DET
nodes have three or more children.
The highest child degree of a DET
node is 7.
Children of DET
nodes are attached using 24 different relations: case (111; 21% instances), nmod (67; 13% instances), advmod:emph (60; 11% instances), punct (53; 10% instances), acl:relcl (39; 7% instances), nsubj (29; 5% instances), ccomp (27; 5% instances), conj (24; 5% instances), cop (20; 4% instances), csubj (20; 4% instances), advmod (12; 2% instances), cc (12; 2% instances), acl (11; 2% instances), advcl (7; 1% instances), amod (7; 1% instances), mark (6; 1% instances), obl:arg (6; 1% instances), obl (5; 1% instances), appos (3; 1% instances), det (3; 1% instances), dep (2; 0% instances), flat (2; 0% instances), nummod (1; 0% instances), parataxis (1; 0% instances)
Children of DET
nodes belong to 16 different parts of speech: ADP (111; 21% instances), VERB (105; 20% instances), PART (59; 11% instances), X (55; 10% instances), NOUN (54; 10% instances), PUNCT (53; 10% instances), AUX (20; 4% instances), PRON (16; 3% instances), ADJ (13; 2% instances), ADV (13; 2% instances), CCONJ (12; 2% instances), DET (7; 1% instances), SCONJ (6; 1% instances), NUM (2; 0% instances), INTJ (1; 0% instances), PROPN (1; 0% instances)