home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-SiMoNERo: POS Tags: ADP

There are 33 ADP lemmas (1%), 35 ADP types (1%) and 1941 ADP tokens (13%). Out of 15 observed tags, the rank of ADP is: 7 in number of lemmas, 9 in number of types and 4 in number of tokens.

The 10 most frequent ADP lemmas: de, în, cu, la, prin, pentru, din, pe, dintre, după

The 10 most frequent ADP types: de, în, cu, la, prin, pentru, din, pe, dintre, după

The 10 most frequent ambiguous lemmas: fără (ADP 15, SCONJ 1), peste (ADP 9, ADV 6), drept (ADJ 5, ADP 2, NOUN 1), a (PART 19, DET 3, ADP 1, NOUN 1, X 1)

The 10 most frequent ambiguous types: fără (ADP 15, SCONJ 1), peste (ADP 9, ADV 6), drept (ADJ 3, ADP 2), a (DET 191, AUX 52, PART 19, ADP 1, NOUN 1, X 1)

Morphology

The form / lemma ratio of ADP is 1.060606 (the average of all parts of speech is 1.477080).

The 1st highest number of forms (2) was observed with the lemma “de”: de, de-.

The 2nd highest number of forms (2) was observed with the lemma “întru”: Într, într-.

The 3rd highest number of forms (1) was observed with the lemma “(WF)coform”: coform.

ADP occurs with 3 features: AdpType (1941; 100% instances), Case (1941; 100% instances), Variant (19; 1% instances)

ADP occurs with 5 feature-value pairs: AdpType=Prep, Case=Acc, Case=Dat, Case=Gen, Variant=Short

ADP occurs with 4 feature combinations. The most frequent feature combination is AdpType=Prep|Case=Acc (1889 tokens). Examples: de, în, cu, la, prin, pentru, din, pe, dintre, după

Relations

ADP nodes are attached to their parents using 7 different relations: case (1781; 92% instances), advmod (64; 3% instances), fixed (62; 3% instances), mark (27; 1% instances), amod (5; 0% instances), conj (1; 0% instances), nmod (1; 0% instances)

Parents of ADP nodes belong to 10 different parts of speech: NOUN (1672; 86% instances), NUM (70; 4% instances), VERB (61; 3% instances), PRON (56; 3% instances), ADP (39; 2% instances), ADV (19; 1% instances), ADJ (12; 1% instances), X (8; 0% instances), PROPN (3; 0% instances), CCONJ (1; 0% instances)

1798 (93%) ADP nodes are leaves.

95 (5%) ADP nodes have one child.

38 (2%) ADP nodes have two children.

10 (1%) ADP nodes have three or more children.

The highest child degree of a ADP node is 4.

Children of ADP nodes are attached using 6 different relations: fixed (176; 87% instances), punct (21; 10% instances), cc (3; 1% instances), conj (1; 0% instances), nummod (1; 0% instances), obj (1; 0% instances)

Children of ADP nodes belong to 11 different parts of speech: NOUN (77; 38% instances), ADP (39; 19% instances), ADV (24; 12% instances), PRON (22; 11% instances), PUNCT (21; 10% instances), ADJ (9; 4% instances), CCONJ (4; 2% instances), DET (3; 1% instances), SCONJ (2; 1% instances), NUM (1; 0% instances), VERB (1; 0% instances)