These alignments are designed for very early spellers of Czech. Stimuli should be presented in IPA, as exemplified in the tables below.
The model of alignment is that we align only one letter to any given phone. Therefore several letters are omitted from the alignment. Even obvious digraphs such as <ch> /x/ have one aligned letter and one unaligned one: [c=x][=h]. These letters can be handled by setting the Insertion penalty to 0.
Where possible, only one phoneme is aligned with a given letter. Affricates are treated as a single phoneme and are so indicated by using a tie bar: e.g. /t͡ʃ/ is one phoneme. Occasionally, two phonemes have to be aligned with a single letter to make the whole word parse. This happens primarily with foreign words, but notably the native sequence <mě> /mɲe/ requires polyphonic alignment.
Spellings found in the Weslalex Czech corpus have a penalty of 0. Other alignments are scored as follows:
This table gives for each alignment a sample word:
stimulus | response | penalty | example |
---|---|---|---|
a | a | 0 | adresa ‘address’ |
aː | á | 0 | já ‘I’ |
aː | a | 0 | piano |
b | b | 0 | bílý ‘white’ |
b | p | .1 | voiceless b |
c | ť | 0 | chuť ‘appetite’ |
c | t | 0 | ti ‘you’ |
c | ď | 0 | hruď ‘chest’ |
c | d | .1 | ď sans diacritic |
d | d | 0 | dobrý ‘good’ |
d | t | 0 | věštba |
d͡z | dz | 0 | podzimní ‘autumn’ |
d͡z | d | .2 | dz sans z |
d͡z | z | .2 | dz sans d |
d͡z | c | .1 | voiceless |
d͡z | ts | .2 | separated c |
d͡z | t | .4 | ts sans s |
d͡z | s | .4 | ts sans t |
d͡z | ds | .1 | half-voiceless |
d͡z | tz | .1 | half-voiceless |
d͡ʒ | dž | 0 | džem ‘jam’ |
d͡ʒ | dz | .1 | dž sans diacritic |
d͡ʒ | d | .2 | dž sans ž |
d͡ʒ | ž | .2 | dž sans d |
d͡ʒ | z | .3 | ž sans diacritic |
d͡ʒ | č | .1 | voiceless |
d͡ʒ | c | .2 | č sans diacritic |
d͡ʒ | tš | .2 | separated č |
d͡ʒ | ts | .3 | tš sans diacritic |
d͡ʒ | t | .4 | tš sans š |
d͡ʒ | š | .4 | tš sans t |
d͡ʒ | s | .5 | š sans diacritic |
d͡ʒ | dš | .1 | half voiceless |
d͡ʒ | ds | .2 | dš sans diacritic |
e | e | 0 | ekonom ‘economist’ |
eː | é | 0 | dobré ‘good’ |
eː | e | 0 | tematika |
f | f | 0 | fax ‘fax’ |
f | v | 0 | ovce ‘sheep’ |
ɡ | g | 0 | gól ‘goal’ |
ɡ | k | 0 | kdo ‘who’ |
ɡz | x | 0 | existence ‘existence’ |
h | h | 0 | harfa ‘harp’ |
i | i | 0 | idea ‘idea’ |
i | y | 0 | vydra ‘otter’ |
iː | í | 0 | jíst ‘eat’ |
iː | i | 0 | arktida ‘arctic’ |
iː | ý | 0 | výr ‘owl’ |
iː | y | 0 | ? |
ij | i | 0 | viola ‘viola’ |
j | j | 0 | jablko ‘apple’ |
je | ě | 0 | pět ‘five’ |
je | e | .1 | ě sans diacritic |
ɟ | ď | 0 | ďábel ‘devil’ |
ɟ | d | 0 | díky ‘thanks’ |
ɟ | ť | .1 | voiceless |
ɟ | t | .2 | ť sans diacritic |
k | g | 0 | biolog ‘biologist’ |
k | k | 0 | kniha ‘book’ |
ks | x | 0 | text ‘text’ |
l | l | 0 | létat ‘to fly’ |
l̩ | l | 0 | vlk ‘wolf’ |
m | m | 0 | matka ‘mother’ |
m̩ | m | 0 | osm ‘eight’ |
n | n | 0 | nádoba ‘vessel’, čítanka ‘reader’ |
ɲ | ň | 0 | kůň ‘horse’ |
ɲ | n | 0 | něco ‘something’ |
ɲe | ě | 0 | mě ‘me’ |
ɲe | e | .1 | ě sans diacritic |
o | o | 0 | otřes ‘tremor’ |
oː | ó | 0 | gól ‘goal’ |
oː | o | 0 | Cecioni |
p | p | 0 | plat ‘pay’ |
p | b | 0 | obchod ‘business’, chléb ‘bread’ |
r | r | 0 | roztok ‘solution’ |
r̝ | ř | 0 | hřát ‘to warm’ |
r̝ | r | .1 | ř sans diacritic |
r̩ | r | 0 | krk ‘neck’ |
s | s | 0 | směs ‘mixture’ |
s | z | 0 | zpívat ‘sing’ |
ʃ | š | 0 | štěstí ‘happiness’ |
ʃ | s | .1 | š without diacritic |
ʃ | ž | 0 | lež ‘lie’ |
ʃ | z | .1 | ž without diacritic |
t | t | 0 | tráva ‘grass’ |
t | d | 0 | odpadnout ‘fall away’ |
t͡s | c | 0 | co ‘what’ |
t͡s | dz | .1 | voiced |
t͡s | d | .3 | dz sans z |
t͡s | z | .3 | dz sans d |
t͡s | ts | .1 | split affricate |
t͡s | t | .3 | ts sans s |
t͡s | s | .3 | ts sans t |
t͡ʃ | č | 0 | čeština ‘Czech’ |
t͡ʃ | c | .1 | č without diacritic |
t͡ʃ | tš | .1 | split affricate |
t͡ʃ | ts | .2 | tš sans diacritic |
t͡ʃ | t | .3 | tš sans š |
t͡ʃ | š | .3 | tš sans t |
t͡ʃ | s | .4 | š sans diacritic |
t͡ʃ | dž | 0 | poněvadž ‘because’ |
t͡ʃ | dz | .1 | dž without diacritic |
t͡ʃ | d | .2 | dž sans ž |
t͡ʃ | ž | .2 | dž sans d |
t͡ʃ | z | .3 | ž sans diacritic |
u | u | 0 | učesat ‘comb’ |
uː | ú | 0 | účes ‘hair style’ |
uː | ů | 0 | kůň ‘horse’ |
uː | u | 0 | ? |
v | v | 0 | vejce ‘egg’ |
v | f | .1 | voiceless |
w | u | 0 | soud ‘court’, auto ‘car’ |
x | ch | 0 | chyba ‘mistake’ |
x | c | .2 | part of ch |
x | h | 0 | lehký ‘light’ |
z | z | 0 | zajíc ‘hare’ |
z | s | 0 | prosba ‘request’, Josef |
ʒ | ž | 0 | žena ‘woman’ |
ʒ | z | .1 | ž without diacritic |
ʒ | š | 0 | věštba |
ʒ | s | .1 | š without diacritic |