home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-ArT: POS Tags: PART

There are 3 PART lemmas (1%), 11 PART types (3%) and 50 PART tokens (9%). Out of 14 observed tags, the rank of PART is: 13 in number of lemmas, 6 in number of types and 6 in number of tokens.

The 10 most frequent PART lemmas: să, nu, sâ

The 10 most frequent PART types: s-, nu, să, z-, -s, n-, no, no-, nu-, sa

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: s- (PART 20, PRON 9), z- (PART 2, PRON 1), -s (PART 1, PRON 1), si (PRON 2, PART 1)

Morphology

The form / lemma ratio of PART is 3.666667 (the average of all parts of speech is 1.341667).

The 1st highest number of forms (6) was observed with the lemma “să”: -s, s-, sa, si, să, z-.

The 2nd highest number of forms (5) was observed with the lemma “nu”: n-, no, no-, nu, nu-.

The 3rd highest number of forms (1) was observed with the lemma “sâ”: s-.

PART occurs with 3 features: Mood (30; 60% instances), Polarity (20; 40% instances), Variant (14; 28% instances)

PART occurs with 4 feature-value pairs: Mood=Cnd, Mood=Sub, Polarity=Neg, Variant=Short

PART occurs with 5 feature combinations. The most frequent feature combination is Polarity=Neg (19 tokens). Examples: nu, no, no-, nu-, n-

Relations

PART nodes are attached to their parents using 2 different relations: mark (30; 60% instances), advmod (20; 40% instances)

Parents of PART nodes belong to 2 different parts of speech: VERB (49; 98% instances), ADJ (1; 2% instances)

50 (100%) PART nodes are leaves.

The highest child degree of a PART node is 0.