home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Albanian-TSA: POS Tags: PART

There are 8 PART lemmas (2%), 8 PART types (2%) and 36 PART tokens (4%). Out of 14 observed tags, the rank of PART is: 8 in number of lemmas, 8 in number of types and 10 in number of tokens.

The 10 most frequent PART lemmas: të, më, duke, madje, s’, nuk, se, t’

The 10 most frequent PART types: të, më, duke, madje, s’, nuk, se, t’

The 10 most frequent ambiguous lemmas: (PART 10, ADP 1), se (SCONJ 2, PART 1)

The 10 most frequent ambiguous types: (DET 47, PART 17), (PART 10, ADP 1), se (SCONJ 2, PART 1)

Morphology

The form / lemma ratio of PART is 1.000000 (the average of all parts of speech is 1.167464).

The 1st highest number of forms (1) was observed with the lemma “duke”: duke.

The 2nd highest number of forms (1) was observed with the lemma “madje”: madje.

The 3rd highest number of forms (1) was observed with the lemma “më”: .

PART occurs with 1 features: Polarity (3; 8% instances)

PART occurs with 1 feature-value pairs: Polarity=Neg

PART occurs with 2 feature combinations. The most frequent feature combination is _ (33 tokens). Examples: të, më, duke, madje, se, t’

Relations

PART nodes are attached to their parents using 3 different relations: mark (20; 56% instances), advmod (15; 42% instances), case (1; 3% instances)

Parents of PART nodes belong to 4 different parts of speech: VERB (24; 67% instances), ADJ (7; 19% instances), ADV (3; 8% instances), NOUN (2; 6% instances)

36 (100%) PART nodes are leaves.

The highest child degree of a PART node is 0.