home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-CHILDES: POS Tags: ADV

There are 315 ADV lemmas (5%), 314 ADV types (4%) and 17142 ADV tokens (6%). Out of 17 observed tags, the rank of ADV is: 5 in number of lemmas, 5 in number of types and 7 in number of tokens.

The 10 most frequent ADV lemmas: here, there, where, now, just, how, too, why, on, so

The 10 most frequent ADV types: here, there, where, now, just, how, too, why, on, so

The 10 most frequent ambiguous lemmas: here (ADV 1379, PRON 6), there (ADV 1241, PRON 639, DET 1, INTJ 1), where (ADV 1135, SCONJ 123), now (ADV 885, INTJ 4), just (ADV 839, ADJ 9), how (ADV 694, SCONJ 167), why (ADV 575, SCONJ 42, INTJ 1), on (ADP 1798, ADV 538, SCONJ 4), so (ADV 504, SCONJ 44, INTJ 11, NOUN 1), right (ADV 482, ADJ 199, NOUN 12, INTJ 5, PROPN 1, VERB 1)

The 10 most frequent ambiguous types: here (ADV 1024, PRON 1), there (ADV 1135, PRON 312), where (ADV 290, SCONJ 110), just (ADV 661, ADJ 6), how (ADV 236, SCONJ 164), why (ADV 129, SCONJ 42), on (ADP 1690, ADV 537, SCONJ 4), so (ADV 332, SCONJ 35), right (ADV 405, ADJ 187, NOUN 12, INTJ 3, PROPN 1, VERB 1), down (ADV 421, ADP 205, NOUN 3)

Morphology

The form / lemma ratio of ADV is 0.996825 (the average of all parts of speech is 1.232942).

The 1st highest number of forms (3) was observed with the lemma “late”: late, lately, later.

The 2nd highest number of forms (3) was observed with the lemma “not”: ‘t, n’t, not.

The 3rd highest number of forms (3) was observed with the lemma “too”: Ta, to, too.

ADV occurs with 2 features: ExtPos (88; 1% instances), Typo (2; 0% instances)

ADV occurs with 4 feature-value pairs: ExtPos=ADP, ExtPos=ADV, ExtPos=SCONJ, Typo=Yes

ADV occurs with 5 feature combinations. The most frequent feature combination is _ (17052 tokens). Examples: here, there, where, now, just, how, too, why, on, so

Relations

ADV nodes are attached to their parents using 32 different relations: advmod (13080; 76% instances), root (1850; 11% instances), obl (466; 3% instances), compound:prt (418; 2% instances), discourse (295; 2% instances), conj (171; 1% instances), case (119; 1% instances), mark (114; 1% instances), parataxis (101; 1% instances), amod (79; 0% instances), reparandum (73; 0% instances), nmod (63; 0% instances), ccomp (60; 0% instances), xcomp (59; 0% instances), obj (54; 0% instances), advcl (37; 0% instances), nsubj (34; 0% instances), fixed (26; 0% instances), acl:relcl (11; 0% instances), nsubj:outer (8; 0% instances), compound (4; 0% instances), obl:unmarked (4; 0% instances), cc (3; 0% instances), vocative (3; 0% instances), acl (2; 0% instances), cc:preconj (2; 0% instances), appos (1; 0% instances), csubj (1; 0% instances), dep (1; 0% instances), flat (1; 0% instances), nmod:npmod (1; 0% instances), obl:npmod (1; 0% instances)

Parents of ADV nodes belong to 16 different parts of speech: VERB (9969; 58% instances), (1850; 11% instances), ADV (1565; 9% instances), ADJ (1332; 8% instances), NOUN (1198; 7% instances), AUX (465; 3% instances), PRON (394; 2% instances), PROPN (128; 1% instances), NUM (78; 0% instances), ADP (52; 0% instances), DET (51; 0% instances), INTJ (31; 0% instances), SCONJ (19; 0% instances), PART (6; 0% instances), CCONJ (3; 0% instances), PUNCT (1; 0% instances)

13223 (77%) ADV nodes are leaves.

1771 (10%) ADV nodes have one child.

775 (5%) ADV nodes have two children.

1373 (8%) ADV nodes have three or more children.

The highest child degree of a ADV node is 13.

Children of ADV nodes are attached using 35 different relations: punct (1852; 22% instances), advmod (1462; 18% instances), nsubj (1436; 17% instances), cop (1216; 15% instances), case (708; 9% instances), discourse (286; 3% instances), obl (205; 2% instances), conj (178; 2% instances), cc (151; 2% instances), vocative (148; 2% instances), parataxis (128; 2% instances), fixed (86; 1% instances), reparandum (78; 1% instances), aux (60; 1% instances), mark (60; 1% instances), advcl (43; 1% instances), obl:npmod (37; 0% instances), acl:relcl (33; 0% instances), det (32; 0% instances), nmod (18; 0% instances), amod (16; 0% instances), obl:unmarked (15; 0% instances), compound:prt (13; 0% instances), xcomp (10; 0% instances), compound (6; 0% instances), acl (5; 0% instances), nummod (5; 0% instances), appos (3; 0% instances), ccomp (3; 0% instances), det:predet (2; 0% instances), goeswith (2; 0% instances), nmod:poss (2; 0% instances), dislocated (1; 0% instances), nsubj:outer (1; 0% instances), obj (1; 0% instances)

Children of ADV nodes belong to 16 different parts of speech: PUNCT (1852; 22% instances), ADV (1565; 19% instances), AUX (1293; 16% instances), NOUN (898; 11% instances), ADP (712; 9% instances), PRON (655; 8% instances), INTJ (260; 3% instances), PROPN (247; 3% instances), VERB (196; 2% instances), CCONJ (147; 2% instances), PART (140; 2% instances), DET (120; 1% instances), ADJ (105; 1% instances), SCONJ (65; 1% instances), NUM (42; 1% instances), X (5; 0% instances)