home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latvian-LVTB: POS Tags: X

There are 718 X lemmas (3%), 721 X types (1%) and 1004 X tokens (0%). Out of 17 observed tags, the rank of X is: 6 in number of lemmas, 6 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: the, Pillar, Imprimatur, of, Capital, fund, al, twitter, Initiatives, Spatial

The 10 most frequent X types: the, Pillar, Imprimatur, of, Capital, fund, Twitter, al, Initiatives, Management

The 10 most frequent ambiguous lemmas: _ (PUNCT 122, X 6), to (X 4, PART 1), Instagram (X 4, PROPN 3, SYM 1), Gazprom (X 3, SYM 1), Real (X 3, SYM 1), a (SYM 17, CCONJ 6, X 2, INTJ 1), art (X 3, PART 1, VERB 1), Dance (X 2, PROPN 1), Katy (X 2, PROPN 1, SYM 1), da (X 2, INTJ 1, PART 1)

The 10 most frequent ambiguous types: to (PRON 827, DET 209, X 4, PART 1), Instagram (X 4, PROPN 3, SYM 1), Gazprom (X 3, SYM 1), Real (X 3, SYM 1), a (SYM 16, X 2, CCONJ 1, INTJ 1), la (X 2, SCONJ 1), Dance (X 2, PROPN 1), EX (NUM 2, X 2), Katy (X 2, PROPN 1, SYM 1), der (VERB 5, X 1)

Morphology

The form / lemma ratio of X is 1.004178 (the average of all parts of speech is 2.233228).

The 1st highest number of forms (5) was observed with the lemma “_”: būt, ko, pat, traumatiskas, vienu.

The 2nd highest number of forms (1) was observed with the lemma “&”: &.

The 3rd highest number of forms (1) was observed with the lemma “A”: A.

X occurs with 2 features: Foreign (998; 99% instances), Typo (9; 1% instances)

X occurs with 2 feature-value pairs: Foreign=Yes, Typo=Yes

X occurs with 3 feature combinations. The most frequent feature combination is Foreign=Yes (995 tokens). Examples: the, Pillar, Imprimatur, of, Capital, fund, Twitter, al, Initiatives, Management

Relations

X nodes are attached to their parents using 18 different relations: flat:name (377; 38% instances), nmod (181; 18% instances), flat:foreign (108; 11% instances), nsubj (90; 9% instances), conj (54; 5% instances), obl (49; 5% instances), parataxis (49; 5% instances), iobj (34; 3% instances), obj (13; 1% instances), dep (12; 1% instances), root (10; 1% instances), nsubj:pass (7; 1% instances), goeswith (6; 1% instances), flat (5; 0% instances), appos (3; 0% instances), xcomp (3; 0% instances), acl (2; 0% instances), amod (1; 0% instances)

Parents of X nodes belong to 13 different parts of speech: X (444; 44% instances), NOUN (236; 24% instances), VERB (195; 19% instances), PROPN (76; 8% instances), SYM (20; 2% instances), (10; 1% instances), ADJ (6; 1% instances), ADV (6; 1% instances), NUM (6; 1% instances), SCONJ (2; 0% instances), DET (1; 0% instances), PART (1; 0% instances), PRON (1; 0% instances)

528 (53%) X nodes are leaves.

48 (5%) X nodes have one child.

116 (12%) X nodes have two children.

312 (31%) X nodes have three or more children.

The highest child degree of a X node is 18.

Children of X nodes are attached using 19 different relations: punct (741; 46% instances), flat:name (347; 21% instances), nmod (203; 13% instances), flat:foreign (109; 7% instances), conj (52; 3% instances), case (34; 2% instances), cc (32; 2% instances), acl (24; 1% instances), dep (21; 1% instances), parataxis (15; 1% instances), discourse (11; 1% instances), amod (9; 1% instances), orphan (5; 0% instances), flat (4; 0% instances), nummod (3; 0% instances), appos (2; 0% instances), det (2; 0% instances), cop (1; 0% instances), nsubj (1; 0% instances)

Children of X nodes belong to 16 different parts of speech: PUNCT (741; 46% instances), X (444; 27% instances), NOUN (219; 14% instances), PROPN (49; 3% instances), NUM (38; 2% instances), ADP (29; 2% instances), CCONJ (28; 2% instances), VERB (26; 2% instances), SYM (13; 1% instances), PART (9; 1% instances), ADJ (8; 0% instances), SCONJ (6; 0% instances), ADV (2; 0% instances), PRON (2; 0% instances), AUX (1; 0% instances), DET (1; 0% instances)