home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-DANTEStocks: POS Tags: ADP

There are 31 ADP lemmas (0%), 43 ADP types (0%) and 8760 ADP tokens (11%). Out of 16 observed tags, the rank of ADP is: 12 in number of lemmas, 13 in number of types and 4 in number of tokens.

The 10 most frequent ADP lemmas: de, em, a, para, com, por, sobre, até, sem, após

The 10 most frequent ADP types: de, em, a, com, para, por, c/, pra, sobre, até

The 10 most frequent ambiguous lemmas: a (ADP 634, X 3, DET 1), para (ADP 598, SCONJ 1), com (ADP 567, X 2, NOUN 1), por (ADP 283, VERB 1, X 1), até (ADP 67, ADV 16), sem (ADP 49, PRON 1), como (ADP 36, ADV 35, SCONJ 13, CCONJ 1), segundo (ADP 21, ADJ 9, NOUN 1), o (DET 5527, PRON 176, ADP 19, X 4, INTJ 1, NOUN 1), contra (ADP 17, ADV 1)

The 10 most frequent ambiguous types: de (ADP 3525, ADV 1, NOUN 1, VERB 1), a (DET 2452, ADP 619, PRON 27, X 3, NOUN 2, VERB 1), com (ADP 418, X 2, NOUN 1), para (ADP 394, VERB 7), por (ADP 253, X 1), até (ADP 51, ADV 14), sem (ADP 36, PRON 1), como (ADP 31, ADV 23, SCONJ 8, CCONJ 1), entre (ADP 25, VERB 19), d (ADP 26, NOUN 1, VERB 1, X 1)

Morphology

The form / lemma ratio of ADP is 1.387097 (the average of all parts of speech is 1.238049).

The 1st highest number of forms (7) was observed with the lemma “para”: P., p, p/, pa, para, pr, pra.

The 2nd highest number of forms (4) was observed with the lemma “com”: c, c/, cm, com.

The 3rd highest number of forms (3) was observed with the lemma “como”: c/, com, como.

ADP occurs with 3 features: ExtPos (76; 1% instances), Number (20; 0% instances), Gender (19; 0% instances)

ADP occurs with 7 feature-value pairs: ExtPos=ADP, ExtPos=ADV, ExtPos=CCONJ, ExtPos=SCONJ, Gender=Fem, Number=Plur, Number=Sing

ADP occurs with 8 feature combinations. The most frequent feature combination is _ (8664 tokens). Examples: de, em, a, com, para, por, c/, pra, sobre, até

Relations

ADP nodes are attached to their parents using 14 different relations: case (8172; 93% instances), mark (451; 5% instances), fixed (72; 1% instances), advmod (34; 0% instances), nmod (13; 0% instances), obl (6; 0% instances), dep (4; 0% instances), flat:name (2; 0% instances), advcl (1; 0% instances), amod (1; 0% instances), cc (1; 0% instances), parataxis (1; 0% instances), reparandum (1; 0% instances), xcomp (1; 0% instances)

Parents of ADP nodes belong to 12 different parts of speech: NOUN (4582; 52% instances), PROPN (1922; 22% instances), SYM (780; 9% instances), NUM (559; 6% instances), VERB (460; 5% instances), ADV (198; 2% instances), PRON (190; 2% instances), X (31; 0% instances), ADJ (28; 0% instances), ADP (8; 0% instances), AUX (1; 0% instances), DET (1; 0% instances)

8683 (99%) ADP nodes are leaves.

44 (1%) ADP nodes have one child.

29 (0%) ADP nodes have two children.

4 (0%) ADP nodes have three or more children.

The highest child degree of a ADP node is 4.

Children of ADP nodes are attached using 3 different relations: fixed (106; 92% instances), punct (8; 7% instances), advmod (1; 1% instances)

Children of ADP nodes belong to 8 different parts of speech: NOUN (30; 26% instances), PRON (21; 18% instances), SCONJ (19; 17% instances), DET (15; 13% instances), ADV (11; 10% instances), ADP (8; 7% instances), PUNCT (8; 7% instances), ADJ (3; 3% instances)