Statistics of NOUN in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Beja-NSC: POS Tags: `NOUN`

There are 1 NOUN lemmas (6%), 285 NOUN types (23%) and 894 NOUN tokens (15%). Out of 16 observed tags, the rank of NOUN is: 8 in number of lemmas, 2 in number of types and 4 in number of tokens.

The 10 most frequent NOUN lemmas: _

The 10 most frequent NOUN types: tak, mhiːn, doːr, meːk, ʔoːr, jam, kaːm, naː, na, dhaj

The 10 most frequent ambiguous lemmas: _ (PUNCT 1126, VERB 1097, DET 933, NOUN 894, ADP 408, PRON 395, SCONJ 298, PART 167, CCONJ 160, AUX 125, ADV 104, ADJ 77, PROPN 32, INTJ 28, NUM 26, X 18)

The 10 most frequent ambiguous types: naː (NOUN 21, PRON 1), na (NOUN 20, SCONJ 1), dhaj (NOUN 17, ADP 1), =na (NOUN 14, PART 1), raw (NOUN 4, ADJ 2), deː (NOUN 3, DET 1), nafs (NOUN 2, PRON 2), suːr (ADV 9, ADP 3, NOUN 2), wari (ADJ 2, NOUN 2), ʔeːgrim (ADJ 2, NOUN 2)

naː
- NOUN 21: naː =t =ka manri / əddew# ##
- PRON 1: naː =t timrija jhaka rhisa =heːb diːt / w= ʔoːr =oː ahiːt jʔiːni //
na
- NOUN 20: tuːt t= ʔaːdeː =t =i naː =t / matig / iːfi =jeːt toː= na / timiːr //
- SCONJ 1: hoːj sallamaman =eːt toː= na nuːn / w= ʔaraːw =oː ba= aakaj =eːt / toː= na / tikan =heːb //
dhaj
- NOUN 17: uːn uː= dhaj jʔeːtiːt barjoː geːb askineːna eːn / i= dawaːhi =eːb //
- ADP 1: uː= jhaːm oː= mhiːn w= ʔani angad iːkti =jeːb / t= ʔalaːma / uː= na / əgəg# / i= xawaːʤa dhaj hija iːkti =jeːb / hasamja =ajt /
=na
- NOUN 14: manniima ini =jeːb / i= manniimti =iː imri =jeː =na tikati //
- PART 1: ʔakraː =ka =na han dh =eː atʔi =jajt / ʔaːlaʤani ini //
raw
- NOUN 4: hoːj libaːban i= ɖa~ɖib =i i= raw =eːb /
- ADJ 2: oː= mijʔat haːra~riwtiːt / areː i= jam =eː / eː= jam i= raw =eː =da hantajaː =b iːkti =jeːb oː= mijʔat amanri =hoːb / i= jam =eːb andiːf //
deː
- NOUN 3: oːn aneːb / gaːt deː =t =i tiːfi / tak titdʔaːr /
- DET 1: aneːb / deː =t =i baːɖinaː =t =oː =t =u / naː =t ki= tfirʔa indi =hoːb / deː =t =ijoːk oː= mhiːn mitjajaː =heːb indi eːn /
nafs
- NOUN 2: i= tak =iː =ka ʔaɖami =ka =b akajeː i= nafs =i rhi /
- PRON 2: ti= ʃartija / kaːm =i meːs =iːt / i= nafs =i hasama =b akajeː amri /
suːr
- ADV 9: uːn ani suːr / nifri ʤaːntaːji atoːrbi =b =u /
- ADP 3: tʔa mhasi suːr aba =aj hoːj hiːreːran =eːk / t= ʔaba =t =i dh =i tʔiit tirib /
- NOUN 2: i= ragad / w= harʔiː =iːb =wa i= suːr =iːb =wa / taɖoːmaː =b =a =ajt //
wari
- ADJ 2: wari =t ʔaba dh =i titʔi /
- NOUN 2: ɖa~ɖibti =eːk wari =t hariwa naː =t teːtgiːm =eːk /
ʔeːgrim
- ADJ 2: beːn uː= tak w= ʔeːgrim / idir oː= jhaːm // ini =t / hadiːdjaː eːn //
- NOUN 2: ʔeːgrim tak / oːn w= ʔoːr =i mhiːn jʔeːtiːt bʔiːni eːn //

Morphology

The form / lemma ratio of NOUN is 285.000000 (the average of all parts of speech is 76.500000).

The 1st highest number of forms (285) was observed with the lemma “_”: =na, =naː, alla, allaː, allaːji, baji, balad, balami, bani, baraːm, baxit, baʃar, baːb, baːgi, bhali, bhar, bilbil, biri, bissa, bri, buːn, bʔaɖ, bʔaɖaɖ, bʔeː, da, daba, damaːn, dammʔara, dar, dara, darab, dawaːhi, deː, deːm, dhaj, dikʷkʷaːn, dirʔa, dirʔaː, diwaːn, doːr, doːra, duːr, dwaːn, dʔawaː, faras, findikʷ, finʤan, finʤaːn, firha, firkaːk, firʔa, gab, gabal, gabaːteː, gahwat, gamiːs, ganaːj, garb, gaw, gaɖʔa, ginh, ginha, ginʔ, girma, gʷargʷadi, gʷaːb, gʷbi, gʷʔanaːti, hajʔa, halak, halaka, halla, hamoː, handi, hanʤar, harnaːjeːt, harroː, hasir, hawat, hawlijaːj, haˈwaːd, haːda~doːjeː, haːl, haːʃ, heːlaj, hi, his, hoː, hoːb, hus, iːjʔa, iːjʔaː, i̠ːjʔaː, jad, jaf, jam, jhaːm, jinaː, jiwaːʃi, kam, kantuːr, karaːj, karaːma, karaːmaː, kaːm, kiraːj, kiʃja, kjas, koːba, koːlaj, koːma, kʷinha, liːlaːw, liːli, luːbja, madar, magʷal, majʔa, mana, manan, mangaːj, manniimti, maːl, mbaːba, mbiɖeːj, mbʔaɖ, mbʔi, meːk, meːs, mha, mhallaga, mhawaj, mhijeː, mhiːn, mijaːd, mijʔat, mindikʷijaːj, mirkʷaːj, mirʔafi, misuːs, mittia, miʃʔari, miːmaʃa, mʔakʷara, mʔam, mʔari, mʔariː, mʔaːdami, na, nafara, nafs, naweː, naː, nda, ndeː, ndi, nfʔa, ngirab, nifri, nihaːs, noːs, nʔandaː, rabameːkaːji, ragad, ragada, raw, raːw, rba, reːr, reːw, rhat, riba, rifkaːk, sak, samaːr, sana, saraːt, saroːj, sasuːbajaː, siganfoːj, sijaːm, sikka, sitoːboːj, siʤin, sjaːm, suːfa, suːg, suːr, tak, takat, taktʔi, taktʔiː, talga, tam, tarabeː, tarʤimaːl, tijoː, tirig, tji, trig, tʔiit, wanas, wari, wast, waʤʤa, waːw, weːnaː, wjaː, xadaːra, xaddam, xawaːʤa, zirʔa, ɖa~ɖib, ɖiwa, ʃa, ʃabaka, ʃakeː, ʃaki, ʃakʷiːn, ʃamat, ʃanha, ʃartija, ʃawweː, ʃawwia, ʃaː, ʃaːk, ʃibibat, ʃinhat, ʃkaːm, ʃuːk, ʃʔaː, ʈibin, ʈiːn, ʈʔa, ʔaba, ʔabaː, ʔadeː, ʔadi, ʔagja, ʔaj, ʔajajdhaja, ʔajaːj, ʔalaːma, ʔalba, ʔamaːr, ʔamuːl, ʔangʷil, ʔani, ʔannuːr, ʔanoː, ʔar, ʔarabijaːj, ʔaraw, ʔaraːw, ʔarːbi, ʔasir, ʔaweː, ʔawi, ʔaːda, ʔaːdeː, ʔaːmanaːj, ʔaːrbi, ʔaːrbiː, ʔaːʃoː, ʔeːga, ʔeːgirim, ʔeːgrim, ʔeːtrig, ʔidda, ʔimir, ʔiʃa, ʔiʤir, ʔiːbaːb, ʔiːbaːbkina, ʔiːd, ʔoːr, ʔoːraj, ʔoːt, ʤabanaː, ʤaːntaːji, ʤimʔa, ʤineːnaː, ʤinsa, ʤoːharaaːji, ʤoːharajaːj.

NOUN occurs with 4 features: Gender (685; 77% instances), Number (89; 10% instances), ExtPos (2; 0% instances), Foreign (2; 0% instances)

NOUN occurs with 6 feature-value pairs: ExtPos=ADV, Foreign=Yes, Gender=Fem, Gender=Masc, Number=Coll, Number=Plur

NOUN occurs with 10 feature combinations. The most frequent feature combination is Gender=Masc (443 tokens). Examples: tak, mhiːn, doːr, jhaːm, mijʔat, jam, bhar, heːlaj, gaw, handi

Relations

NOUN nodes are attached to their parents using 21 different relations: obj (292; 33% instances), nsubj (186; 21% instances), dep:comp (185; 21% instances), dep:conj (34; 4% instances), obl:mod (33; 4% instances), nmod (28; 3% instances), dislocated:obj (26; 3% instances), obl:arg (21; 2% instances), dislocated:subj (20; 2% instances), reparandum (17; 2% instances), root (11; 1% instances), xcomp (10; 1% instances), parataxis (7; 1% instances), dislocated (5; 1% instances), vocative (5; 1% instances), acl:relcl (4; 0% instances), fixed (4; 0% instances), advmod (2; 0% instances), appos (2; 0% instances), discourse (1; 0% instances), parataxis:parenth (1; 0% instances)

Parents of NOUN nodes belong to 11 different parts of speech: VERB (599; 67% instances), ADP (170; 19% instances), NOUN (72; 8% instances), SCONJ (16; 2% instances), (11; 1% instances), ADJ (9; 1% instances), X (9; 1% instances), AUX (3; 0% instances), PROPN (3; 0% instances), INTJ (1; 0% instances), PRON (1; 0% instances)

89 (10%) NOUN nodes are leaves.

334 (37%) NOUN nodes have one child.

261 (29%) NOUN nodes have two children.

210 (23%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 9.

Children of NOUN nodes are attached using 30 different relations: det (734; 47% instances), punct (213; 14% instances), nmod:poss (127; 8% instances), acl:relcl (111; 7% instances), nmod (73; 5% instances), discourse (40; 3% instances), amod (38; 2% instances), dep (36; 2% instances), cc (32; 2% instances), advmod (30; 2% instances), dep:conj (27; 2% instances), case (16; 1% instances), cop (16; 1% instances), reparandum (13; 1% instances), dep:comp (12; 1% instances), nummod (8; 1% instances), acl (7; 0% instances), nsubj (7; 0% instances), appos (6; 0% instances), aux (4; 0% instances), dislocated:mod (4; 0% instances), obj (4; 0% instances), obl:arg (4; 0% instances), vocative (4; 0% instances), fixed (2; 0% instances), csubj (1; 0% instances), dislocated:subj (1; 0% instances), obl:mod (1; 0% instances), parataxis (1; 0% instances), parataxis:parenth (1; 0% instances)

Children of NOUN nodes belong to 16 different parts of speech: DET (738; 47% instances), PUNCT (213; 14% instances), PRON (150; 10% instances), NOUN (72; 5% instances), SCONJ (68; 4% instances), VERB (62; 4% instances), ADP (59; 4% instances), ADJ (54; 3% instances), PART (45; 3% instances), CCONJ (30; 2% instances), ADV (22; 1% instances), AUX (21; 1% instances), NUM (17; 1% instances), INTJ (9; 1% instances), PROPN (8; 1% instances), X (5; 0% instances)

Treebank Statistics: UD_Beja-NSC: POS Tags: NOUN

Morphology

Relations

Treebank Statistics: UD_Beja-NSC: POS Tags: `NOUN`