home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-SiMoNERo: POS Tags: ADP

There are 41 ADP lemmas (0%), 44 ADP types (0%) and 20080 ADP tokens (14%). Out of 16 observed tags, the rank of ADP is: 8 in number of lemmas, 10 in number of types and 2 in number of tokens.

The 10 most frequent ADP lemmas: de, în, la, cu, din, pentru, prin, pe, dintre, după

The 10 most frequent ADP types: de, în, la, cu, din, pentru, prin, pe, dintre, după

The 10 most frequent ambiguous lemmas: de (ADP 6724, X 2), în (ADP 3627, NOUN 2), pentru (ADP 797, VERB 1), fără (ADP 135, SCONJ 3), sub (ADP 126, X 2, ADV 1), peste (ADP 125, ADV 32), până (ADP 92, SCONJ 28), versus (ADP 36, ADV 3, PROPN 2, X 1), drept (ADJ 23, ADP 20, NOUN 2, ADV 1), a (PART 302, DET 13, ADP 12, NOUN 8, X 5)

The 10 most frequent ambiguous types: de (ADP 6607, X 1), în (ADP 3145, NOUN 1), pentru (ADP 747, VERB 1), fără (ADP 135, SCONJ 3), sub (ADP 124, X 2, ADV 1), peste (ADP 123, ADV 32), până (ADP 86, SCONJ 27), versus (ADP 36, ADV 3, PROPN 2, X 1), drept (ADP 19, ADJ 12, ADV 1), a (DET 1826, AUX 793, PART 302, ADP 12, NOUN 8, X 5)

Morphology

The form / lemma ratio of ADP is 1.073171 (the average of all parts of speech is 1.666462).

The 1st highest number of forms (2) was observed with the lemma “de”: de, de-.

The 2nd highest number of forms (2) was observed with the lemma “după”: Dupa, după.

The 3rd highest number of forms (2) was observed with the lemma “întru”: într, într-.

ADP occurs with 4 features: AdpType (20077; 100% instances), Case (20077; 100% instances), Variant (157; 1% instances), Abbr (3; 0% instances)

ADP occurs with 6 feature-value pairs: Abbr=Yes, AdpType=Prep, Case=Acc, Case=Dat, Case=Gen, Variant=Short

ADP occurs with 5 feature combinations. The most frequent feature combination is AdpType=Prep|Case=Acc (19619 tokens). Examples: de, în, la, cu, din, pentru, prin, pe, dintre, după

Relations

ADP nodes are attached to their parents using 19 different relations: case (18081; 90% instances), fixed (833; 4% instances), advmod (674; 3% instances), mark (409; 2% instances), amod (34; 0% instances), conj (16; 0% instances), obl (9; 0% instances), nmod (7; 0% instances), xcomp (4; 0% instances), appos (3; 0% instances), goeswith (2; 0% instances), acl (1; 0% instances), advcl (1; 0% instances), compound (1; 0% instances), dep (1; 0% instances), det (1; 0% instances), flat (1; 0% instances), obj (1; 0% instances), root (1; 0% instances)

Parents of ADP nodes belong to 13 different parts of speech: NOUN (16525; 82% instances), NUM (869; 4% instances), VERB (867; 4% instances), ADP (557; 3% instances), PRON (511; 3% instances), ADV (292; 1% instances), PROPN (153; 1% instances), ADJ (145; 1% instances), X (118; 1% instances), SCONJ (27; 0% instances), DET (13; 0% instances), CCONJ (2; 0% instances), (1; 0% instances)

18370 (91%) ADP nodes are leaves.

1055 (5%) ADP nodes have one child.

463 (2%) ADP nodes have two children.

192 (1%) ADP nodes have three or more children.

The highest child degree of a ADP node is 8.

Children of ADP nodes are attached using 18 different relations: fixed (2193; 84% instances), punct (332; 13% instances), conj (21; 1% instances), cc (20; 1% instances), advmod (18; 1% instances), nummod (16; 1% instances), nmod (4; 0% instances), amod (3; 0% instances), cop (3; 0% instances), mark (3; 0% instances), nsubj (3; 0% instances), det (2; 0% instances), obj (2; 0% instances), appos (1; 0% instances), case (1; 0% instances), goeswith (1; 0% instances), iobj (1; 0% instances), obl (1; 0% instances)

Children of ADP nodes belong to 13 different parts of speech: NOUN (923; 35% instances), ADP (557; 21% instances), PUNCT (332; 13% instances), PRON (300; 11% instances), ADV (229; 9% instances), ADJ (91; 3% instances), VERB (70; 3% instances), DET (39; 1% instances), NUM (32; 1% instances), CCONJ (29; 1% instances), SCONJ (14; 1% instances), PART (6; 0% instances), AUX (3; 0% instances)