Statistics of NOUN in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_English-EWT: POS Tags: `NOUN`

There are 6097 NOUN lemmas (34%), 7817 NOUN types (35%) and 43084 NOUN tokens (17%). Out of 17 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: time, year, service, place, day, people, thanks, food, way, number

The 10 most frequent NOUN types: time, people, service, place, thanks, food, way, year, day, number

The 10 most frequent ambiguous lemmas: time (NOUN 557, VERB 1), place (NOUN 337, VERB 23), day (NOUN 330, PROPN 1), food (NOUN 265, PROPN 1), way (NOUN 241, ADV 19), number (NOUN 219, PROPN 1, VERB 1), price (NOUN 197, VERB 5), question (NOUN 154, VERB 6), work (VERB 276, NOUN 154), one (NUM 424, NOUN 148, PRON 49)

The 10 most frequent ambiguous types: time (NOUN 467, X 1), place (NOUN 263, VERB 12), food (NOUN 223, PROPN 1), way (NOUN 221, ADV 17, X 3), day (NOUN 199, X 3, PROPN 1), number (NOUN 116, PROPN 1), work (NOUN 140, VERB 135), price (NOUN 126, VERB 1), world (NOUN 131, PROPN 4), am (AUX 278, NOUN 30)

time
- NOUN 467: Game tonight at 7 , it ‘s time to kick some ass .
- X 1: However , we waited and waited and in the mean time , saw 4 groups of people simply just paraded in without signing there names .
place
- NOUN 263: The games will have to take place on Fri , Sat , or Sun .
- VERB 12: We need to change the lock and place posted signs at the gate .
food
- NOUN 223: Find him before he finds the dog food .
- PROPN 1: Such a convenient location as well with coffee shop and bradley food and beverage right around corner .
way
- NOUN 221: I was on my way to my wedding fearing death , basically . “
- ADV 17: There are way more stranger names in the U.S for areas than Miramar .
- X 3: I am half way through the first and you can borrow it when I am through .
day
- NOUN 199: I am looking forward to hearing from you soon and have a nice day .
- X 3: They were chirping and doing their every day business .
- PROPN 1: Also how much do compact system cameras drop on boxing day ?
number
- NOUN 116: i got her number though .
- PROPN 1: Came the disintegration of the Beatles ‘ minds with LSD which has caused , among others , schizophrenic lyrics such as “ I am the Walrus “ and incoherent schizophrenic musical expositions like “ Revolution number 9 “ .
work
- NOUN 140: Anybody up for happy hour after work ?
- VERB 135: This afternoon at 2 PM or later would work for us .
price
- NOUN 126: Go to Goldstar.com and get tickets for about a third of the price .
- VERB 1: Secondly , he will still out price performers that you have in the same job group that are excellent and strong performers respectively eg. Paul Thomas , Jason Choate , Todd DeCook and Peter Makkai .
world
- NOUN 131: Are there any new developments in the trader world ?
- PROPN 4: how much does it cost to join world resorts international ?
am
- AUX 278: i am not going unless lisa promises to get all wasted and boob out .
- NOUN 30: October 4 , ENA orientation in the am .

Morphology

The form / lemma ratio of NOUN is 1.282106 (the average of all parts of speech is 1.250484).

The 1st highest number of forms (6) was observed with the lemma “service”: $ervice, sercvice, serivce, service, services, svce.

The 2nd highest number of forms (6) was observed with the lemma “thanks”: thaks, thank, thanks, thanx, thx, tks.

The 3rd highest number of forms (5) was observed with the lemma “company”: companie, companie$, companies, company, company’s.

NOUN occurs with 8 features: Number (43084; 100% instances), Typo (231; 1% instances), NumForm (151; 0% instances), NumType (151; 0% instances), Abbr (135; 0% instances), ExtPos (16; 0% instances), Foreign (8; 0% instances), Style (8; 0% instances)

NOUN occurs with 16 feature-value pairs: Abbr=Yes, ExtPos=ADV, ExtPos=PROPN, Foreign=Yes, NumForm=Combi, NumForm=Digit, NumForm=Word, NumType=Card, NumType=Frac, NumType=Ord, Number=Plur, Number=Ptan, Number=Sing, Style=Expr, Style=Vrnc, Typo=Yes

NOUN occurs with 24 feature combinations. The most frequent feature combination is Number=Sing (32395 tokens). Examples: time, service, place, thanks, food, way, year, day, number, pm

Relations

NOUN nodes are attached to their parents using 35 different relations: obj (8492; 20% instances), obl (7252; 17% instances), compound (5543; 13% instances), nmod (5526; 13% instances), nsubj (4793; 11% instances), conj (3074; 7% instances), root (2753; 6% instances), obl:unmarked (1030; 2% instances), appos (809; 2% instances), nsubj:pass (719; 2% instances), nmod:unmarked (530; 1% instances), parataxis (371; 1% instances), list (301; 1% instances), nmod:poss (277; 1% instances), ccomp (270; 1% instances), obl:agent (220; 1% instances), advcl (215; 0% instances), xcomp (198; 0% instances), nsubj:outer (174; 0% instances), iobj (122; 0% instances), nmod:desc (87; 0% instances), acl:relcl (84; 0% instances), fixed (59; 0% instances), vocative (42; 0% instances), acl (33; 0% instances), flat (31; 0% instances), discourse (27; 0% instances), advmod (15; 0% instances), orphan (14; 0% instances), advcl:relcl (8; 0% instances), dislocated (6; 0% instances), reparandum (4; 0% instances), csubj (3; 0% instances), csubj:pass (1; 0% instances), dep (1; 0% instances)

Parents of NOUN nodes belong to 16 different parts of speech: VERB (20398; 47% instances), NOUN (14647; 34% instances), (2753; 6% instances), ADJ (2309; 5% instances), PROPN (933; 2% instances), NUM (764; 2% instances), ADV (519; 1% instances), PRON (313; 1% instances), DET (157; 0% instances), SYM (146; 0% instances), ADP (61; 0% instances), AUX (54; 0% instances), INTJ (10; 0% instances), SCONJ (10; 0% instances), X (9; 0% instances), PART (1; 0% instances)

6574 (15%) NOUN nodes are leaves.

11388 (26%) NOUN nodes have one child.

11865 (28%) NOUN nodes have two children.

13257 (31%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 17.

Children of NOUN nodes are attached using 40 different relations: det (17512; 20% instances), case (14125; 16% instances), amod (10807; 12% instances), nmod (7021; 8% instances), punct (7019; 8% instances), compound (6968; 8% instances), nmod:poss (4311; 5% instances), conj (3098; 4% instances), cc (2442; 3% instances), cop (1995; 2% instances), acl:relcl (1809; 2% instances), nsubj (1779; 2% instances), acl (1693; 2% instances), nummod (1413; 2% instances), advmod (1377; 2% instances), appos (883; 1% instances), parataxis (428; 0% instances), mark (426; 0% instances), nmod:unmarked (269; 0% instances), aux (244; 0% instances), det:predet (212; 0% instances), flat (197; 0% instances), advcl (191; 0% instances), obl (188; 0% instances), list (158; 0% instances), discourse (139; 0% instances), csubj (57; 0% instances), vocative (57; 0% instances), expl (44; 0% instances), cc:preconj (38; 0% instances), obl:unmarked (30; 0% instances), goeswith (27; 0% instances), advcl:relcl (18; 0% instances), orphan (18; 0% instances), fixed (15; 0% instances), nsubj:outer (11; 0% instances), reparandum (10; 0% instances), dep (4; 0% instances), compound:prt (1; 0% instances), dislocated (1; 0% instances)

Children of NOUN nodes belong to 17 different parts of speech: DET (17793; 20% instances), NOUN (14647; 17% instances), ADP (13648; 16% instances), ADJ (10569; 12% instances), PUNCT (7019; 8% instances), PRON (5057; 6% instances), VERB (4990; 6% instances), PROPN (4007; 5% instances), CCONJ (2371; 3% instances), AUX (2260; 3% instances), NUM (2088; 2% instances), ADV (1285; 1% instances), PART (552; 1% instances), SCONJ (326; 0% instances), SYM (260; 0% instances), X (82; 0% instances), INTJ (81; 0% instances)

Treebank Statistics: UD_English-EWT: POS Tags: NOUN

Morphology

Relations

Treebank Statistics: UD_English-EWT: POS Tags: `NOUN`