Treebank Statistics: UD_English-CHILDES: POS Tags: PUNCT
There are 23 PUNCT lemmas (0%), 23 PUNCT types (0%) and 48217 PUNCT tokens (16%).
Out of 17 observed tags, the rank of PUNCT is: 13 in number of lemmas, 14 in number of types and 2 in number of tokens.
The 10 most frequent PUNCT lemmas: ., ?, !, dere, :, o, that, Swww, a, dis
The 10 most frequent PUNCT types: ., ?, !, dere, :, dat, o, Swww, a, dis
The 10 most frequent ambiguous lemmas: dere (ADV 44, PROPN 33, INTJ 21, NOUN 21, PRON 12, PUNCT 8, DET 2, ADJ 1), o (NOUN 14, X 5, PUNCT 3, INTJ 2, PROPN 1, SYM 1), that (PRON 4425, DET 1080, SCONJ 361, INTJ 39, ADV 28, PROPN 13, ADP 10, NOUN 10, PART 7, PUNCT 3, SYM 1), Swww (PROPN 58, PUNCT 2), a (DET 7218, NOUN 36, X 15, SYM 10, PRON 8, INTJ 6, PROPN 4, ADP 2, PUNCT 2), dis (PRON 74, DET 34, PROPN 23, INTJ 14, NOUN 13, ADV 4, ADJ 2, PART 2, PUNCT 2, ADP 1, X 1), t (NOUN 13, X 3, ADV 2, INTJ 2, PART 2, PROPN 2, PUNCT 2, ADJ 1), Dwww (PROPN 20, NOUN 1, PUNCT 1), b (NOUN 46, PROPN 9, NUM 5, X 3, INTJ 2, ADJ 1, ADP 1, PUNCT 1, SYM 1), horsie (NOUN 84, PROPN 6, ADJ 4, ADV 2, INTJ 1, PUNCT 1, VERB 1, X 1)
The 10 most frequent ambiguous types: dere (ADV 64, PROPN 33, NOUN 21, INTJ 12, PRON 10, PUNCT 8, VERB 5, DET 2, ADJ 1), dat (PRON 108, DET 42, NOUN 8, PROPN 7, PART 6, INTJ 5, ADP 4, PUNCT 3, SCONJ 2, SYM 1), o (NOUN 14, X 5, PUNCT 3, ADP 1, INTJ 1, PROPN 1, SYM 1), Swww (PROPN 58, PUNCT 2, INTJ 1, PRON 1), a (DET 5737, NOUN 32, PART 14, X 13, ADP 12, SYM 9, PRON 6, PROPN 4, INTJ 3, PUNCT 2, ADV 1, AUX 1), dis (PRON 56, DET 34, PROPN 21, NOUN 13, ADV 4, INTJ 4, ADJ 2, PART 2, PUNCT 2, ADP 1, X 1), t (NOUN 13, PART 2, PUNCT 2, ADJ 1, ADV 1, PROPN 1, X 1), Dwww (PROPN 20, NOUN 2, PRON 2, INTJ 1, PUNCT 1), b (NOUN 45, PROPN 9, X 3, NUM 2, ADJ 1, ADP 1, PUNCT 1, SYM 1), horsie (NOUN 87, PROPN 10, ADJ 4, ADV 2, PUNCT 1, X 1)
- dere
- dat
- PRON 108: And I ‘m gon na turn into a knight if you do dat .
- DET 42: Mommy look at dat camera .
- NOUN 8: A dat ?
- PROPN 7: Give dat a Ursula give dat Ursula .
- PART 6: You have dat okay ?
- INTJ 5: Dat dat dat my neck .
- ADP 4: Who riding dat Mommy ?
- PUNCT 3: Like like dat .
- SCONJ 2: Dat a box dat dey put it in .
- SYM 1: Look at dat .
- o
- Swww
- a
- DET 5737: Adam a home .
- NOUN 32: Make the s then make the a .
- PART 14: Here you trying a get out .
- X 13: Leopard ‘s spelled l e o p a r d .
- ADP 12: It ‘s a lott a years yep .
- SYM 9: Um d a .
- PRON 6: Uma um how come a um um a man is a scuba diver ?
- PROPN 4: Write pencil Adam a .
- INTJ 3: Cock a doodle doo .
- PUNCT 2: Oh that one ‘s the wrong a .
- ADV 1: What makes that kind a noise ?
- AUX 1: You a you be a front Julian be a back .
- dis
- PRON 56: I want dis big .
- DET 34: He want a go dis side .
- PROPN 21: Dis is for r r and dis is for p .
- NOUN 13: I want to keep dis on the floor .
- ADV 4: Look is dis pretty ?
- INTJ 4: Like dis you do n’t because de table ‘s high .
- ADJ 2: Mommy what is dis for ?
- PART 2: What does dis spell ?
- PUNCT 2: Um dis .
- ADP 1: I found dis a racing car .
- X 1: No dis de mother duck in dere .
- t
- Dwww
- b
- horsie
Morphology
The form / lemma ratio of PUNCT is 1.000000 (the average of all parts of speech is 1.232942).
The 1st highest number of forms (1) was observed with the lemma “!”: !.
The 2nd highest number of forms (1) was observed with the lemma “-”: -.
The 3rd highest number of forms (1) was observed with the lemma “.”: ..
PUNCT does not occur with any features.
Relations
PUNCT nodes are attached to their parents using 2 different relations: punct (48186; 100% instances), root (31; 0% instances)
Parents of PUNCT nodes belong to 18 different parts of speech: VERB (27919; 58% instances), NOUN (9694; 20% instances), ADJ (2623; 5% instances), PRON (1928; 4% instances), ADV (1852; 4% instances), AUX (1317; 3% instances), PROPN (1248; 3% instances), INTJ (790; 2% instances), NUM (367; 1% instances), DET (172; 0% instances), ADP (90; 0% instances), SCONJ (81; 0% instances), PART (33; 0% instances), PUNCT (31; 0% instances), (31; 0% instances), CCONJ (19; 0% instances), X (16; 0% instances), SYM (6; 0% instances)
48186 (100%) PUNCT nodes are leaves.
0 (0%) PUNCT nodes have one child.
15 (0%) PUNCT nodes have two children.
16 (0%) PUNCT nodes have three or more children.
The highest child degree of a PUNCT node is 6.
Children of PUNCT nodes are attached using 11 different relations: punct (31; 35% instances), nsubj (21; 24% instances), cop (11; 13% instances), discourse (8; 9% instances), det (6; 7% instances), case (3; 3% instances), nummod (3; 3% instances), amod (2; 2% instances), advmod (1; 1% instances), compound (1; 1% instances), mark (1; 1% instances)
Children of PUNCT nodes belong to 11 different parts of speech: PUNCT (31; 35% instances), PRON (12; 14% instances), AUX (11; 13% instances), INTJ (10; 11% instances), NOUN (9; 10% instances), DET (6; 7% instances), NUM (4; 5% instances), ADJ (2; 2% instances), ADP (1; 1% instances), ADV (1; 1% instances), SCONJ (1; 1% instances)