For full functionality of Sketch Engine it is necessary to
enable JavaScript
corbama-brut
corbama-net-non-tonal
corbama-net-tonal
corbama-ud
corbamafara
corfarabama
corbama-net-non-tonal
defaults
Reset settings
Home
Search
Word list
Corpus info
My jobs
User guide
All words
All lemmas
Find x
Menu position
This action may take several minutes for large corpora, please wait.
Word list options
Corpus:
corbama-brut
corbama-net-non-tonal
corbama-net-tonal
corbama-ud
corbamafara
corfarabama
Subcorpus:
create new
Search attribute:
word
lemma
tag
gloss
parts
original
tonal
polisemy
tagstring
doc.id
doc.wordcount
doc.text_genre
doc.source_type
doc.source_year
doc.text_translation
doc.text_medium
doc.author_name
doc.text_title
use n-grams
. Value of n: from
2
3
4
5
6
to
2
3
4
5
6
hide/nest sub-n-grams
Filter options:
Filter word list by:
Regular expression:
Minimum frequency:
Maximum frequency:
(0 = no maximum frequency)
Whitelist:
Blacklist:
format
Word list whitelists and blacklists must be plain text (.txt), encoded in UTF-8, with one item per line. The items must correspond to the selected attribute, so, eg, if 'lemma' is selected from the attribute menu, then the list should be a list of lemmas. We use exact matching, not regular-expression matching, for file input.
Include non-words
Output options:
Frequency figures:
Hit counts
Document counts
ARF
Output type:
Simple
Keywords
Reference (sub)corpus
corbama-net-non-tonal
corbama-brut
corbama-net-tonal
corbama-ud
corbamafara
corfarabama
(whole corpus)
Prefer:
rare words
common words
Change output attribute(s)
---
word
lemma
tag
gloss
parts
original
tonal
polisemy
tagstring
---
word
lemma
tag
gloss
parts
original
tonal
polisemy
tagstring
---
word
lemma
tag
gloss
parts
original
tonal
polisemy
tagstring
You can select one or more output attributes. Please note that this option can be time-consuming.