home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Lithuanian-ALKSNIS: POS Tags: X

There are 307 X lemmas (3%), 341 X types (2%) and 1571 X tokens (2%). Out of 17 observed tags, the rank of X is: 7 in number of lemmas, 7 in number of types and 12 in number of tokens.

The 10 most frequent X lemmas: pat, ES, d., proc., nr., kuris, tikras, nors, p., to

The 10 most frequent X types: pat, ES, d, proc, Nr, nors, a, p, to, tūkst

The 10 most frequent ambiguous lemmas: pat (X 105, ADV 1), kuris (DET 382, X 50), tikras (X 38, ADJ 20), nors (SCONJ 45, X 37, PART 1), pats (DET 84, X 27), kas (PRON 159, X 20, PART 5), tai (PART 44, X 16, CCONJ 7, ADV 1, PRON 1), tiek (ADV 56, X 16), pirma (ADV 11, X 11), t. (ADV 16, X 8)

The 10 most frequent ambiguous types: pat (X 105, ADV 1), nors (X 37, SCONJ 26, PART 1), to (DET 62, X 35), V (X 19, NUM 3), kas (PRON 62, X 18, PART 5), kurie (DET 61, X 17), tai (DET 156, PART 42, X 17, CCONJ 7, ADV 1, PRON 1), tiek (ADV 53, X 16), esmės (X 15, NOUN 4), pirma (X 11, NUM 1)

Morphology

The form / lemma ratio of X is 1.110749 (the average of all parts of speech is 2.065341).

The 1st highest number of forms (13) was observed with the lemma “tikras”: tikra, tikrai, tikrais, tikras, tikri, tikro, tikroje, tikromis, tikros, tikru, tikrus, tikrą, tikrų.

The 2nd highest number of forms (12) was observed with the lemma “pats”: pati, patiems, paties, pats, patys, pačia, pačioje, pačiomis, pačios, pačiu, pačius, pačią.

The 3rd highest number of forms (11) was observed with the lemma “kuris”: kuri, kurias, kurie, kurioms, kurios, kuriose, kuriuo, kuriuos, kuriuose, kurią, kurių.

X occurs with 8 features: Abbr (866; 55% instances), Hyph (579; 37% instances), Foreign (130; 8% instances), Case (2; 0% instances), Gender (2; 0% instances), Number (2; 0% instances), Definite (1; 0% instances), Degree (1; 0% instances)

X occurs with 10 feature-value pairs: Abbr=Yes, Case=Gen, Definite=Def, Degree=Pos, Foreign=Yes, Gender=Fem, Gender=Masc, Hyph=Yes, Number=Plur, Number=Sing

X occurs with 7 feature combinations. The most frequent feature combination is Abbr=Yes (864 tokens). Examples: ES, d, proc, Nr, p, a, tūkst, R, mln, pan

Relations

X nodes are attached to their parents using 13 different relations: nmod (1122; 71% instances), conj (130; 8% instances), obl (102; 6% instances), parataxis (43; 3% instances), flat:foreign (31; 2% instances), obl:arg (28; 2% instances), flat (26; 2% instances), nsubj (26; 2% instances), obj (24; 2% instances), root (21; 1% instances), appos (15; 1% instances), nsubj:pass (2; 0% instances), dep (1; 0% instances)

Parents of X nodes belong to 12 different parts of speech: NOUN (439; 28% instances), ADV (245; 16% instances), VERB (182; 12% instances), X (157; 10% instances), PROPN (146; 9% instances), PRON (133; 8% instances), PART (113; 7% instances), NUM (67; 4% instances), DET (55; 4% instances), (21; 1% instances), ADJ (11; 1% instances), INTJ (2; 0% instances)

806 (51%) X nodes are leaves.

292 (19%) X nodes have one child.

190 (12%) X nodes have two children.

283 (18%) X nodes have three or more children.

The highest child degree of a X node is 9.

Children of X nodes are attached using 23 different relations: punct (920; 53% instances), nummod (244; 14% instances), nmod (192; 11% instances), conj (117; 7% instances), cc (70; 4% instances), case (56; 3% instances), flat:foreign (31; 2% instances), obl (16; 1% instances), acl (14; 1% instances), advmod (12; 1% instances), amod (7; 0% instances), appos (6; 0% instances), parataxis (6; 0% instances), det (5; 0% instances), flat (5; 0% instances), mark (4; 0% instances), obl:arg (4; 0% instances), advcl (3; 0% instances), advmod:emph (3; 0% instances), acl:relcl (2; 0% instances), nsubj (2; 0% instances), dep (1; 0% instances), obj (1; 0% instances)

Children of X nodes belong to 15 different parts of speech: PUNCT (920; 53% instances), NUM (279; 16% instances), X (157; 9% instances), NOUN (154; 9% instances), CCONJ (68; 4% instances), ADP (56; 3% instances), VERB (30; 2% instances), ADV (12; 1% instances), PROPN (12; 1% instances), ADJ (11; 1% instances), SCONJ (6; 0% instances), DET (5; 0% instances), SYM (5; 0% instances), PART (3; 0% instances), PRON (3; 0% instances)