home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-PetroGold: POS Tags: ADP

There are 53 ADP lemmas (0%), 64 ADP types (0%) and 43294 ADP tokens (17%). Out of 16 observed tags, the rank of ADP is: 8 in number of lemmas, 10 in number of types and 2 in number of tokens.

The 10 most frequent ADP lemmas: de, em, a, para, por, com, como, entre, sobre, até

The 10 most frequent ADP types: de, em, a, para, por, com, como, entre, sobre, até

The 10 most frequent ambiguous lemmas: de (ADP 23043, SCONJ 19, PUNCT 1), em (ADP 7350, PROPN 2), a (ADP 3391, CCONJ 74, SCONJ 20, NOUN 5, NUM 3, PROPN 2), para (ADP 2336, SCONJ 4), por (ADP 2200, SCONJ 13), com (ADP 2068, SCONJ 4, NOUN 2, X 1), como (ADP 1013, ADV 35, SCONJ 30, CCONJ 18), até (ADP 209, ADV 7), após (ADP 151, ADV 2), segundo (ADP 128, ADJ 58, NOUN 18, ADV 1, PROPN 1)

The 10 most frequent ambiguous types: de (ADP 22831, SCONJ 19), a (DET 10528, ADP 3222, PRON 99, CCONJ 73, SCONJ 20, NOUN 4, NUM 3, ADV 2, PROPN 2, VERB 1), para (ADP 2104, SCONJ 3), por (ADP 2110, SCONJ 12), com (ADP 2005, SCONJ 4, NOUN 2, CCONJ 1, X 1), como (ADP 916, ADV 31, CCONJ 17, SCONJ 17), entre (ADP 445, VERB 2), até (ADP 205, ADV 7), após (ADP 91, ADV 2), segundo (ADP 59, ADJ 31, NOUN 5, ADV 1)

Morphology

The form / lemma ratio of ADP is 1.207547 (the average of all parts of speech is 1.452143).

The 1st highest number of forms (4) was observed with the lemma “de”: de, des, e, se.

The 2nd highest number of forms (3) was observed with the lemma “a”: a, as, á.

The 3rd highest number of forms (2) was observed with the lemma “até”: ate, até.

ADP does not occur with any features.

Relations

ADP nodes are attached to their parents using 9 different relations: case (38716; 89% instances), mark (1676; 4% instances), fixed (1424; 3% instances), flat:name (1031; 2% instances), obl (357; 1% instances), cc (84; 0% instances), root (4; 0% instances), advmod (1; 0% instances), appos (1; 0% instances)

Parents of ADP nodes belong to 12 different parts of speech: NOUN (34027; 79% instances), PROPN (4298; 10% instances), VERB (2116; 5% instances), ADP (829; 2% instances), PRON (621; 1% instances), NUM (503; 1% instances), ADV (404; 1% instances), SYM (239; 1% instances), ADJ (198; 0% instances), DET (40; 0% instances), X (15; 0% instances), (4; 0% instances)

41846 (97%) ADP nodes are leaves.

246 (1%) ADP nodes have one child.

926 (2%) ADP nodes have two children.

276 (1%) ADP nodes have three or more children.

The highest child degree of a ADP node is 5.

Children of ADP nodes are attached using 6 different relations: fixed (2669; 90% instances), punct (299; 10% instances), obl (3; 0% instances), nummod (2; 0% instances), conj (1; 0% instances), nmod (1; 0% instances)

Children of ADP nodes belong to 10 different parts of speech: NOUN (956; 32% instances), ADP (829; 28% instances), PUNCT (299; 10% instances), VERB (287; 10% instances), DET (258; 9% instances), PRON (162; 5% instances), SCONJ (150; 5% instances), ADJ (23; 1% instances), ADV (9; 0% instances), NUM (2; 0% instances)