home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Lithuanian-ALKSNIS: POS Tags: DET

There are 11 DET lemmas (0%), 154 DET types (1%) and 1780 DET tokens (3%). Out of 17 observed tags, the rank of DET is: 15 in number of lemmas, 9 in number of types and 9 in number of tokens.

The 10 most frequent DET lemmas: tas, kuris, šis, visas, toks, pats, koks, kiekvienas, joks, šitas

The 10 most frequent DET types: tai, to, kurie, kurios, šio, šios, kuris, kurių, toks, visą

The 10 most frequent ambiguous lemmas: tas (DET 436, X 2, PART 1), kuris (DET 382, X 50), pats (DET 84, X 27), koks (DET 77, X 1)

The 10 most frequent ambiguous types: tai (DET 156, PART 42, X 17, CCONJ 7, ADV 1, PRON 1), to (DET 62, X 35), kurie (DET 61, X 17), kurios (DET 57, X 10), kurių (DET 38, X 10), visų (DET 31, ADV 9), kuri (DET 27, X 1), (DET 24, X 1), pats (DET 24, X 3), tuo (DET 19, SCONJ 12, ADV 2)

Morphology

The form / lemma ratio of DET is 14.000000 (the average of all parts of speech is 2.065341).

The 1st highest number of forms (23) was observed with the lemma “kuris”: kuri, kuria, kuriai, kuriais, kuriam, kuriame, kurias, kurie, kuriems, kurio, kurioj, kurioje, kuriomis, kurioms, kurios, kuriose, kuris, kuriuo, kuriuos, kuriuose, kurią, kurių, kurį.

The 2nd highest number of forms (21) was observed with the lemma “šis”: Šiai, ši, šia, šiais, šiam, šiame, šias, šie, šio, šioje, šiomis, šioms, šios, šiose, šis, šiuo, šiuos, šiuose, šią, šių, šį.

The 3rd highest number of forms (20) was observed with the lemma “tas”: Tasai, ta, tai, tais, tam, tame, tas, tatai, tie, tiems, to, toje, toji, tomis, tos, tose, tuo, tuos, tą, tų.

DET occurs with 6 features: Definite (1780; 100% instances), Gender (1780; 100% instances), PronType (1780; 100% instances), Case (1568; 88% instances), Number (1568; 88% instances), Hyph (55; 3% instances)

DET occurs with 19 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Gender=Neut, Hyph=Yes, Number=Plur, Number=Sing, PronType=Dem, PronType=Emp, PronType=Int,Rel, PronType=Neg, PronType=Tot

DET occurs with 126 feature combinations. The most frequent feature combination is Definite=Ind|Gender=Neut|PronType=Dem (206 tokens). Examples: tai, Šitai

Relations

DET nodes are attached to their parents using 15 different relations: det (988; 56% instances), nsubj (318; 18% instances), obl:arg (204; 11% instances), obj (77; 4% instances), obl (63; 4% instances), amod (47; 3% instances), root (23; 1% instances), nsubj:pass (22; 1% instances), conj (21; 1% instances), dep (7; 0% instances), xcomp (4; 0% instances), ccomp (2; 0% instances), flat (2; 0% instances), advcl (1; 0% instances), csubj:pass (1; 0% instances)

Parents of DET nodes belong to 11 different parts of speech: NOUN (1017; 57% instances), VERB (590; 33% instances), ADJ (84; 5% instances), PRON (26; 1% instances), (23; 1% instances), PROPN (14; 1% instances), ADV (10; 1% instances), DET (7; 0% instances), X (5; 0% instances), NUM (3; 0% instances), PART (1; 0% instances)

1451 (82%) DET nodes are leaves.

222 (12%) DET nodes have one child.

56 (3%) DET nodes have two children.

51 (3%) DET nodes have three or more children.

The highest child degree of a DET node is 7.

Children of DET nodes are attached using 24 different relations: case (111; 21% instances), nmod (67; 13% instances), advmod:emph (60; 11% instances), punct (53; 10% instances), acl:relcl (39; 7% instances), nsubj (29; 5% instances), ccomp (27; 5% instances), conj (24; 5% instances), cop (20; 4% instances), csubj (20; 4% instances), advmod (12; 2% instances), cc (12; 2% instances), acl (11; 2% instances), advcl (7; 1% instances), amod (7; 1% instances), mark (6; 1% instances), obl:arg (6; 1% instances), obl (5; 1% instances), appos (3; 1% instances), det (3; 1% instances), dep (2; 0% instances), flat (2; 0% instances), nummod (1; 0% instances), parataxis (1; 0% instances)

Children of DET nodes belong to 16 different parts of speech: ADP (111; 21% instances), VERB (105; 20% instances), PART (59; 11% instances), X (55; 10% instances), NOUN (54; 10% instances), PUNCT (53; 10% instances), AUX (20; 4% instances), PRON (16; 3% instances), ADJ (13; 2% instances), ADV (13; 2% instances), CCONJ (12; 2% instances), DET (7; 1% instances), SCONJ (6; 1% instances), NUM (2; 0% instances), INTJ (1; 0% instances), PROPN (1; 0% instances)