home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-RRT: POS Tags: X

There are 76 X lemmas (0%), 112 X types (0%) and 152 X tokens (0%). Out of 16 observed tags, the rank of X is: 7 in number of lemmas, 9 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: _, 5a, American, alia, in, inter, -a, ACTIVE, Awards, A­

The 10 most frequent X types: 000, 500, 100, 0, 2, 5a, American, K., alia, dată

The 10 most frequent ambiguous lemmas: _ (X 72, NUM 2, PUNCT 1), 5a (ADV 3, X 2, NUM 1, PROPN 1), in (ADP 23, NOUN 1, X 1), -a (DET 19, X 1), Awards (PROPN 1, X 1), Book (PROPN 1, X 1), Klebsiella (PROPN 1, X 1), New (PROPN 7, X 1), al (DET 2851, X 1), car (NOUN 3, X 1)

The 10 most frequent ambiguous types: 000 (X 26, NUM 1), 500 (NUM 8, X 4), 100 (NUM 21, X 3), 0 (NUM 21, X 2), 2 (NUM 279, X 2), 5a (ADV 3, X 2, NUM 1, PROPN 1), dată (NOUN 76, VERB 6, X 2, ADJ 1), in (ADP 18, NOUN 1, X 1), un (DET 1616, NUM 16, X 2), -a (DET 20, AUX 17, X 1)

Morphology

The form / lemma ratio of X is 1.473684 (the average of all parts of speech is 1.819791).

The 1st highest number of forms (38) was observed with the lemma “_”: 0, 000, 065, 100, 2, 230, 2C9, 3, 307, 390, 391, 400, 463, 500, 672, 720, 736, 770, 867, 9, 900, 914, 957, G-CSF, alpine, amiezei, apune, dată, dopei, glicozidice, glicozidică, operativă, spre, traumatice, un, una, zicochimice, zise.

The 2nd highest number of forms (1) was observed with the lemma “-a”: -a.

The 3rd highest number of forms (1) was observed with the lemma “5a”: 5a.

X occurs with 2 features: Foreign (31; 20% instances), Abbr (5; 3% instances)

X occurs with 2 feature-value pairs: Abbr=Yes, Foreign=Yes

X occurs with 3 feature combinations. The most frequent feature combination is _ (116 tokens). Examples: 000, 500, 100, 0, 2, American, dată, un, -a, 065

Relations

X nodes are attached to their parents using 12 different relations: goeswith (72; 47% instances), flat (42; 28% instances), nmod (12; 8% instances), conj (7; 5% instances), appos (6; 4% instances), amod (3; 2% instances), dep (3; 2% instances), nsubj (2; 1% instances), obl (2; 1% instances), case (1; 1% instances), obj (1; 1% instances), root (1; 1% instances)

Parents of X nodes belong to 11 different parts of speech: NUM (56; 37% instances), X (36; 24% instances), NOUN (27; 18% instances), PROPN (11; 7% instances), ADJ (8; 5% instances), VERB (5; 3% instances), ADV (4; 3% instances), DET (2; 1% instances), ADP (1; 1% instances), PRON (1; 1% instances), (1; 1% instances)

116 (76%) X nodes are leaves.

9 (6%) X nodes have one child.

14 (9%) X nodes have two children.

13 (9%) X nodes have three or more children.

The highest child degree of a X node is 6.

Children of X nodes are attached using 11 different relations: flat (32; 37% instances), punct (27; 31% instances), case (7; 8% instances), conj (5; 6% instances), det (4; 5% instances), cc (3; 3% instances), amod (2; 2% instances), nmod (2; 2% instances), nummod (2; 2% instances), advmod (1; 1% instances), appos (1; 1% instances)

Children of X nodes belong to 10 different parts of speech: X (36; 42% instances), PUNCT (27; 31% instances), ADP (6; 7% instances), DET (4; 5% instances), PROPN (4; 5% instances), CCONJ (3; 3% instances), ADJ (2; 2% instances), NUM (2; 2% instances), ADV (1; 1% instances), NOUN (1; 1% instances)