Statistics of NOUN in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Romanian-TueCL: POS Tags: `NOUN`

There are 469 NOUN lemmas (39%), 578 NOUN types (35%) and 844 NOUN tokens (19%). Out of 17 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: femeie, bărbat, fată, pupic, ban, curvă, fund, soț, copil, picior

The 10 most frequent NOUN types: femeie, femeia, femeile, femei, fetele, bărbat, PUPICI, bărbatul, bărbații, fată

The 10 most frequent ambiguous lemmas: domn (NOUN 2, PROPN 1), față (ADP 2, NOUN 2), sexy (ADJ 5, NOUN 2), vină (NOUN 2, VERB 1), feminist (ADJ 2, NOUN 1), frumoasă (ADJ 2, NOUN 1), frumos (ADJ 22, NOUN 1), misogin (ADJ 6, NOUN 1), prost (ADJ 1, NOUN 1), șef (ADJ 1, NOUN 1)

The 10 most frequent ambiguous types: frumoasă (ADJ 8, NOUN 3), fata (NOUN 2, ADP 1), iubit (NOUN 2, VERB 1), era (AUX 6, VERB 2, NOUN 1), față (ADP 1, NOUN 1), feministe (ADJ 2, NOUN 1), misogini (ADJ 4, NOUN 1), sexy (ADJ 4, NOUN 1), vină (NOUN 1, VERB 1)

frumoasă
- ADJ 8: Degeaba ești frumoasă dacă nu ești cu mine …
- NOUN 3: @Utilizator_fem Simona ești cea mai frumoasă și sexy
fata
- NOUN 2: mergeam si eu liniștită spre gara și mai departe pe o banca stăteau 2 bărbati și se holbau la mine asa ciudat și face unu “ cea mai frumoasa fata “ 🤢 🤮
- ADP 1: masculii romani chiar nu au nici un pic de respect fata de femei pe tiktok 😐
iubit
- NOUN 2: Dacă întrebi o fată dacă are iubit iar ea îți zice ‘ n- am , nici măcar unul ‘ , e clar că are mai multe posturi de ocupat decât are autocarul scaune
- VERB 1: @Utilizator_fem Mmm ce bună ești de iubit 😉
era
- AUX 6: Îți urez o noapte că dacă era bună , erai în pat cu mine .
- VERB 2: era un mos beat mort în autobuz și se ținea în continuu dupa mine
- NOUN 1: @Utilizator_fem in ăsta urla misoginismul si lipsa de sex si de asta face pe interesantul . nu mai traim in era medievala cand femeia era tratata ca o mizerie de barbati , treziti va in plm daca tu crezi ca esti mai putin “ barbat ” dupa ce tratezi o femeie cu egalitate , nu ai fost niciodata trust me
față
- ADP 1: Întotdeauna am preferat curvă adevărată față de sfinții falși ! 🤭
- NOUN 1: @Utilizator_fem Ah , pune -mi -o pe față sa ii simt gustul
feministe
- ADJ 2: După o zi oribila să vină unu la tine să ti zică “ auzi esti frumoasa da păcat ca n ai tate “ 😐 nu mai criticați fetele ca s feministe ca stiu ele ce stiu 💗
- NOUN 1: Eu știu cel puțin 3 feministe care asculta Future si totusi el e cel mai misogin rapper / personalitate
misogini
- ADJ 4: Fotbalul : patroni misogini , antrenori misogini , fotbaliști misogini , femei trofeu . Lumea : de ce sunt unii suporteri misogini cu Ioana Cosma ? WTF ? !
- NOUN 1: @Utilizator_x1 Utilizator_x2 Îs misogini doar cu femeile , vorba lui @Utilizator_x3
sexy
- ADJ 4: @Utilizator_fem Arăți super sexy frumușico 💋 ☺️ 😱
- NOUN 1: @Utilizator_fem Simona ești cea mai frumoasă și sexy
vină
- NOUN 1: @Utilizator_fem Așa este . Nici nu știu cum să mă exprim când aud la știri despre cazuri cu femei omorâte de perechea lor . Azi au dat alt caz de o femeie ucisă , au ajuns la 20 de la începutul anului . 😡 Mare parte din vină o are justiția .
- VERB 1: După o zi oribila să vină unu la tine să ti zică “ auzi esti frumoasa da păcat ca n ai tate “ 😐 nu mai criticați fetele ca s feministe ca stiu ele ce stiu 💗

Morphology

The form / lemma ratio of NOUN is 1.232409 (the average of all parts of speech is 1.367279).

The 1st highest number of forms (11) was observed with the lemma “bărbat”: barbat, barbati, barbatu, barbatul, bărbat, bărbati, bărbatul, bărbaţii, bărbați, bărbații, bărbaților.

The 2nd highest number of forms (6) was observed with the lemma “femeie”: femei, femeia, femeie, femeii, femeile, femeilor.

The 3rd highest number of forms (5) was observed with the lemma “picior”: picioare, picioarele, picioarelor, picior, piciorul.

NOUN occurs with 8 features: Number (843; 100% instances), Gender (827; 98% instances), Definite (821; 97% instances), Case (517; 61% instances), Typo (90; 11% instances), Foreign (20; 2% instances), Degree (14; 2% instances), Abbr (2; 0% instances)

NOUN occurs with 15 feature-value pairs: Abbr=Yes, Case=Acc, Case=Acc,Nom, Case=Dat,Gen, Case=Nom, Case=Voc, Definite=Def, Definite=Ind, Degree=Pos, Foreign=Yes, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, Typo=Yes

NOUN occurs with 51 feature combinations. The most frequent feature combination is Case=Acc,Nom|Definite=Ind|Gender=Fem|Number=Sing (162 tokens). Examples: femeie, fată, iubire, mamă, minte, nevoie, bere, noapte, stradă, zi

Relations

NOUN nodes are attached to their parents using 26 different relations: obl (174; 21% instances), nsubj (150; 18% instances), obj (150; 18% instances), nmod (110; 13% instances), conj (68; 8% instances), root (35; 4% instances), fixed (19; 2% instances), list (19; 2% instances), parataxis (14; 2% instances), xcomp (12; 1% instances), ccomp (11; 1% instances), appos (10; 1% instances), obl:agent (10; 1% instances), vocative (10; 1% instances), orphan (9; 1% instances), advcl (7; 1% instances), nsubj:pass (7; 1% instances), flat (6; 1% instances), iobj (6; 1% instances), acl (5; 1% instances), amod (4; 0% instances), csubj (2; 0% instances), discourse (2; 0% instances), obl:pmod (2; 0% instances), compound (1; 0% instances), obl:tmod (1; 0% instances)

Parents of NOUN nodes belong to 11 different parts of speech: VERB (498; 59% instances), NOUN (226; 27% instances), (35; 4% instances), ADJ (33; 4% instances), PRON (13; 2% instances), PROPN (11; 1% instances), AUX (9; 1% instances), ADP (8; 1% instances), ADV (5; 1% instances), INTJ (4; 0% instances), SYM (2; 0% instances)

171 (20%) NOUN nodes are leaves.

331 (39%) NOUN nodes have one child.

189 (22%) NOUN nodes have two children.

153 (18%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 18.

Children of NOUN nodes are attached using 32 different relations: case (297; 23% instances), det (205; 16% instances), amod (126; 10% instances), punct (126; 10% instances), nmod (120; 9% instances), conj (63; 5% instances), cop (52; 4% instances), cc (46; 4% instances), acl (43; 3% instances), advmod (39; 3% instances), parataxis (38; 3% instances), nsubj (23; 2% instances), list (20; 2% instances), nummod (20; 2% instances), mark (17; 1% instances), vocative:mention (13; 1% instances), obl (10; 1% instances), orphan (10; 1% instances), advcl (7; 1% instances), appos (6; 0% instances), discourse:emo (4; 0% instances), aux (3; 0% instances), compound (3; 0% instances), obj (3; 0% instances), advmod:tmod (2; 0% instances), cc:preconj (1; 0% instances), ccomp (1; 0% instances), csubj (1; 0% instances), iobj (1; 0% instances), obl:agent (1; 0% instances), vocative (1; 0% instances), xcomp (1; 0% instances)

Children of NOUN nodes belong to 16 different parts of speech: ADP (298; 23% instances), NOUN (226; 17% instances), DET (201; 15% instances), ADJ (133; 10% instances), PUNCT (126; 10% instances), VERB (82; 6% instances), AUX (56; 4% instances), CCONJ (51; 4% instances), ADV (35; 3% instances), NUM (20; 2% instances), PRON (19; 1% instances), PROPN (16; 1% instances), SYM (16; 1% instances), SCONJ (13; 1% instances), PART (10; 1% instances), X (1; 0% instances)

Treebank Statistics: UD_Romanian-TueCL: POS Tags: NOUN

Morphology

Relations

Treebank Statistics: UD_Romanian-TueCL: POS Tags: `NOUN`