home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latvian-LVTB: POS Tags: PART

There are 96 PART lemmas (0%), 97 PART types (0%) and 5457 PART tokens (2%). Out of 17 observed tags, the rank of PART is: 9 in number of lemmas, 11 in number of types and 13 in number of tokens.

The 10 most frequent PART lemmas: arī, tikai, pat, vai, ne, kaut, vien, gan, tieši, jau

The 10 most frequent PART types: arī, tikai, pat, vai, ne, kaut, vien, gan, tieši, jau

The 10 most frequent ambiguous lemmas: arī (PART 1494, CCONJ 266, SCONJ 30), tikai (PART 591, CCONJ 2), vai (SCONJ 482, CCONJ 448, PART 258, INTJ 2), ne (PART 251, CCONJ 218, SCONJ 1), kaut (PART 232, SCONJ 31), vien (PART 192, CCONJ 11, ADV 2), gan (CCONJ 430, PART 178, SCONJ 57), tieši (PART 169, ADV 17), jau (ADV 631, PART 165, CCONJ 3), nu (PART 143, ADV 96, INTJ 5, CCONJ 1)

The 10 most frequent ambiguous types: arī (PART 1360, CCONJ 265, SCONJ 30), tikai (PART 547, CCONJ 2), pat (PART 283, X 2, ADP 1), vai (SCONJ 478, CCONJ 422, PART 101, INTJ 1), ne (PART 231, CCONJ 212, DET 1, SCONJ 1), kaut (PART 218, SCONJ 21), vien (PART 188, CCONJ 11, ADV 2), gan (CCONJ 414, PART 175, SCONJ 57), tieši (PART 144, ADV 10, ADJ 1), jau (ADV 564, PART 165, CCONJ 3)

Morphology

The form / lemma ratio of PART is 1.010417 (the average of all parts of speech is 2.244795).

The 1st highest number of forms (4) was observed with the lemma “arī”: ar, ar’, ari, arī.

The 2nd highest number of forms (2) was observed with the lemma “ar”: ar, ar’.

The 3rd highest number of forms (2) was observed with the lemma “nu”: no, nu.

PART occurs with 2 features: Polarity (380; 7% instances), Typo (9; 0% instances)

PART occurs with 3 feature-value pairs: Polarity=Neg, Polarity=Pos, Typo=Yes

PART occurs with 5 feature combinations. The most frequent feature combination is _ (5069 tokens). Examples: arī, tikai, pat, vai, kaut, vien, gan, tieši, jau, nu

Relations

PART nodes are attached to their parents using 15 different relations: discourse (4908; 90% instances), fixed (401; 7% instances), root (41; 1% instances), mark (29; 1% instances), dep (25; 0% instances), parataxis (18; 0% instances), conj (16; 0% instances), flat (8; 0% instances), cc (4; 0% instances), det (2; 0% instances), advmod (1; 0% instances), case (1; 0% instances), iobj (1; 0% instances), obj (1; 0% instances), obl (1; 0% instances)

Parents of PART nodes belong to 16 different parts of speech: VERB (1861; 34% instances), NOUN (1573; 29% instances), ADV (581; 11% instances), PRON (359; 7% instances), ADJ (218; 4% instances), PART (189; 3% instances), CCONJ (179; 3% instances), PROPN (151; 3% instances), DET (129; 2% instances), NUM (82; 2% instances), SCONJ (68; 1% instances), (41; 1% instances), SYM (13; 0% instances), X (9; 0% instances), AUX (2; 0% instances), INTJ (2; 0% instances)

4888 (90%) PART nodes are leaves.

420 (8%) PART nodes have one child.

117 (2%) PART nodes have two children.

32 (1%) PART nodes have three or more children.

The highest child degree of a PART node is 7.

Children of PART nodes are attached using 17 different relations: punct (480; 63% instances), fixed (196; 26% instances), discourse (26; 3% instances), cc (22; 3% instances), conj (8; 1% instances), flat (7; 1% instances), dep (5; 1% instances), advmod (4; 1% instances), obl (3; 0% instances), advcl (2; 0% instances), iobj (2; 0% instances), parataxis (2; 0% instances), cop (1; 0% instances), goeswith (1; 0% instances), mark (1; 0% instances), nsubj (1; 0% instances), obj (1; 0% instances)

Children of PART nodes belong to 13 different parts of speech: PUNCT (480; 63% instances), PART (189; 25% instances), SCONJ (38; 5% instances), CCONJ (20; 3% instances), ADV (9; 1% instances), PRON (7; 1% instances), VERB (7; 1% instances), INTJ (4; 1% instances), NOUN (4; 1% instances), ADJ (1; 0% instances), AUX (1; 0% instances), SYM (1; 0% instances), X (1; 0% instances)