home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-RRT: POS Tags: PART

There are 4 PART lemmas (0%), 8 PART types (0%) and 4885 PART tokens (2%). Out of 16 observed tags, the rank of PART is: 15 in number of lemmas, 16 in number of types and 13 in number of tokens.

The 10 most frequent PART lemmas: să, nu, a, o

The 10 most frequent PART types: să, nu, a, n-, s-, a-, o, -a

The 10 most frequent ambiguous lemmas: (PART 2409, SCONJ 1), a (PART 840, NOUN 65, ADP 30, DET 2), o (PART 10, INTJ 1)

The 10 most frequent ambiguous types: (PART 2336, PRON 1), a (AUX 2130, DET 1596, PART 811, NOUN 65, ADP 30), s- (PRON 590, PART 41), o (DET 1814, PRON 187, NUM 27, PART 9, AUX 7, ADV 1), -a (AUX 26, DET 23, PART 1, X 1)

Morphology

The form / lemma ratio of PART is 2.000000 (the average of all parts of speech is 1.814756).

The 1st highest number of forms (3) was observed with the lemma “a”: -a, a, a-.

The 2nd highest number of forms (2) was observed with the lemma “nu”: n-, nu.

The 3rd highest number of forms (2) was observed with the lemma “să”: s-, să.

PART occurs with 5 features: Mood (2409; 49% instances), Polarity (1626; 33% instances), PartType (840; 17% instances), Variant (140; 3% instances), Tense (10; 0% instances)

PART occurs with 5 feature-value pairs: Mood=Sub, PartType=Inf, Polarity=Neg, Tense=Fut, Variant=Short

PART occurs with 7 feature combinations. The most frequent feature combination is Mood=Sub (2368 tokens). Examples: să, s-

Relations

PART nodes are attached to their parents using 13 different relations: mark (3247; 66% instances), advmod (1591; 33% instances), fixed (17; 0% instances), conj (12; 0% instances), root (7; 0% instances), obj (3; 0% instances), amod (2; 0% instances), advcl (1; 0% instances), case (1; 0% instances), cc:preconj (1; 0% instances), expl:pv (1; 0% instances), obl (1; 0% instances), orphan (1; 0% instances)

Parents of PART nodes belong to 13 different parts of speech: VERB (4516; 92% instances), ADJ (132; 3% instances), NOUN (106; 2% instances), ADV (70; 1% instances), PRON (22; 0% instances), ADP (12; 0% instances), AUX (10; 0% instances), (7; 0% instances), SCONJ (4; 0% instances), DET (2; 0% instances), NUM (2; 0% instances), PART (1; 0% instances), PROPN (1; 0% instances)

4854 (99%) PART nodes are leaves.

17 (0%) PART nodes have one child.

6 (0%) PART nodes have two children.

8 (0%) PART nodes have three or more children.

The highest child degree of a PART node is 5.

Children of PART nodes are attached using 12 different relations: punct (17; 30% instances), cc (11; 19% instances), fixed (11; 19% instances), conj (6; 11% instances), ccomp (3; 5% instances), parataxis (3; 5% instances), advmod (1; 2% instances), amod (1; 2% instances), case (1; 2% instances), det (1; 2% instances), obj (1; 2% instances), orphan (1; 2% instances)

Children of PART nodes belong to 11 different parts of speech: PUNCT (17; 30% instances), VERB (15; 26% instances), CCONJ (11; 19% instances), DET (4; 7% instances), ADV (3; 5% instances), NOUN (2; 4% instances), ADJ (1; 2% instances), ADP (1; 2% instances), PART (1; 2% instances), PRON (1; 2% instances), SCONJ (1; 2% instances)