Model of alignment is that we align only one letter to any given phone. Therefore several letters are omitted from the alignment. Even obvious digraphs such as <th> /θ/ have one aligned letter and one unaligned one: [θ=t][=h]. These letters can be handled by setting the Insertion penalty to 0.
Preferably only one phoneme is aligned with a given letter, when that is at all reasonable; but there are exceptional cases such as /wən/ <one> where blindly following that rule would be silly; thus [wə=o].
We try to make commonsense assignments. In case of doubt, the leftmost letter is assumed to bear the main weight of expressing the sound. This seems to be a guiding principle of the spelling system; e.g., [θ=t][=h] makes more sense than [=t][θ=h], because <h> appears after several letters to modify their basic sound). In a few cases it doesn't seem historically accurate, e.g. -ation gets the schwa assigned to the <i>; but hard to say how people interpret that nowadays.
stimulus | response | penalty |
---|---|---|
b | b | 0 |
d | d | 0 |
d | t | 0 |
dʒ | d | 0 |
dʒ | g | 0 |
dʒ | j | 0 |
e | a | 0 |
e | e | 0 |
f | f | 0 |
f | g | 0 |
f | p | 0 |
f | v | 0 |
g | g | 0 |
gz | x | 0 |
gʒ | x | 0 |
ɡ | g | 0 |
ɡz | x | 0 |
ɡʒ | x | 0 |
h | h | 0 |
h | j | 0 |
h | x | 0 |
i | a | 0 |
i | e | 0 |
i | i | 0 |
i | y | 0 |
j | e | 0 |
j | i | 0 |
j | j | 0 |
j | u | 0 |
j | y | 0 |
ji | i | 0 |
ju | u | 0 |
juw | u | 0 |
jə | u | 0 |
jəw | u | 0 |
jʊ | u | 0 |
k | c | 0 |
k | g | 0 |
k | k | 0 |
k | q | 0 |
ks | x | 0 |
kʃ | x | 0 |
l | l | 0 |
m | m | 0 |
m | n | 0 |
mp | m | 0 |
n | n | 0 |
o | a | 0 |
o | e | 0 |
o | g | 0 |
o | o | 0 |
o | u | 0 |
p | p | 0 |
r | r | 0 |
s | c | 0 |
s | s | 0 |
s | z | 0 |
t | d | 0 |
t | t | 0 |
ts | z | 0 |
tʃ | c | 0 |
tʃ | s | 0 |
tʃ | t | 0 |
tθ | t | 0 |
u | o | 0 |
u | u | 0 |
u | w | 0 |
uw | u | 0 |
v | f | 0 |
v | p | 0 |
v | v | 0 |
v | w | 0 |
w | o | 0 |
w | u | 0 |
w | w | 0 |
wə | o | 0 |
z | s | 0 |
z | x | 0 |
z | z | 0 |
æ | a | 0 |
æ | e | 0 |
ð | d | 0 |
ð | t | 0 |
ŋ | n | 0 |
ɑ | a | 0 |
ɑ | e | 0 |
ɑ | i | 0 |
ɑ | o | 0 |
ɑj | i | 0 |
ɑj | y | 0 |
ɔ | a | 0 |
ɔ | e | 0 |
ɔ | o | 0 |
ə | a | 0 |
ə | c | 0 |
ə | e | 0 |
ə | i | 0 |
ə | o | 0 |
ə | u | 0 |
ə | y | 0 |
ək | c | 0 |
əl | l | 0 |
əm | m | 0 |
əw | u | 0 |
ɚ | l | 0 |
ɚ | r | 0 |
ɛ | a | 0 |
ɛ | e | 0 |
ɛ | i | 0 |
ɛ | u | 0 |
ɪ | a | 0 |
ɪ | e | 0 |
ɪ | i | 0 |
ɪ | o | 0 |
ɪ | u | 0 |
ɪ | y | 0 |
ʃ | c | 0 |
ʃ | s | 0 |
ʃ | t | 0 |
ʊ | o | 0 |
ʊ | u | 0 |
ʒ | g | 0 |
ʒ | j | 0 |
ʒ | s | 0 |
ʒ | t | 0 |
ʒ | z | 0 |
θ | t | 0 |
This list is based on words from Zeno et al.’s Educators Word Frequency Guide. It uses the words where the WFG breaks the statistics down by grade, and the word is found in one of the grades between kindergarten and Grade 12, inclusive. Pronunciations were derived from the Carnegiue Mellon Pronouncing Dictionary. An alignment program was used to give the most plausible single-letter alignments.
This table gives for each alignment a sample word, as well as a token (frequency-weighted) count of the number of words that use that correspondence.
stimulus | response | frequency | example | |
---|---|---|---|---|
b | b | 2640 | æb | ab |
d | d | 6933 | əbændən | abandon |
d | t | 1708 | əbriviedəd,abbreviated | |
dʒ | d | 246 | ædʒrɛs,address | |
dʒ | g | 857 | æbərɪdʒəniz,aborigines | |
dʒ | j | 350 | ədʒɑr,ajar | |
e | a | 2497 | e,a | |
e | e | 130 | æləgeni,allegheny | |
f | f | 2184 | edɑlf,adolf | |
f | g | 26 | kɑf,cough | |
f | p | 226 | ælfə,alpha | |
f | v | 1 | krustʃɔf,khrushchev | |
g | g | 1465 | æbəgel,abigail | |
gz | x | 61 | æləgzændɚ,alexander | |
gʒ | x | 3 | ləgʒɚiz,luxuries | |
h | h | 978 | ebrəhæm,abraham | |
h | j | 10 | bɑhɑ,baja | |
h | x | 1 | kihodi,quixote | |
i | a | 20 | ældʒi,algae | |
i | e | 2112 | æbi,abbey | |
i | i | 1096 | əbriviedəd,abbreviated | |
i | y | 1852 | əbɪlədi,ability | |
j | e | 17 | bɑrθɑləmju,bartholomew | |
j | i | 274 | ədʒojnɪŋ,adjoining | |
j | j | 1 | johɑn,johann | |
j | u | 22 | ækjɚəsi,accuracy | |
j | y | 235 | æloj,alloy | |
ji | i | 1 | nɑjiv,naive | |
ju | u | 372 | əbjus,abuse | |
juw | u | 5 | fɛbjuwɛri,february | |
jə | u | 177 | əkjumjəlet,accumulate | |
jəw | u | 7 | ɪvækjəweʃən,evacuation | |
jʊ | u | 51 | bjʊro,bureau | |
k | c | 4760 | æbstrækt,abstract | |
k | g | 13 | lɛŋkθ,length | |
k | k | 922 | əkɪn,akin | |
k | q | 250 | ædəkwət,adequate | |
ks | x | 405 | æləks,alex | |
kʃ | x | 6 | æŋkʃəs,anxious | |
l | l | 7072 | æbdɑmənəl,abdominal | |
m | m | 4416 | æbdomən,abdomen | |
m | n | 4 | græmpɑ,grandpa | |
mp | m | 4 | mɛmpfəs,memphis | |
n | n | 10741 | ɛrən,aaron | |
o | a | 111 | əword,award | |
o | e | 5 | frojd,freud | |
o | g | 1 | ɛdənbɚo,edinburgh | |
o | o | 2884 | æbdomən,abdomen | |
o | u | 1 | pordə,puerto | |
p | p | 4530 | əbrəpt,abrupt | |
r | r | 8873 | ɛrən,aaron | |
s | c | 1361 | æbsəns,absence | |
s | s | 7491 | æbəlɪʃənəst,abolitionist | |
s | z | 11 | blɪts,blitz | |
t | d | 462 | əbɑlɪʃt,abolished | |
t | t | 8006 | æbət,abbot | |
ts | z | 6 | lərɛnts,lorenz | |
tʃ | c | 508 | ətʃiv,achieve | |
tʃ | s | 5 | ɪkspæntʃən,expansion | |
tʃ | t | 803 | æktʃrəs,actress | |
tθ | t | 1 | etθ,eighth | |
u | o | 333 | æftɚnun,afternoon | |
u | u | 610 | æbsəlut,absolute | |
u | w | 110 | ændru,andrew | |
uw | u | 4 | fɛbruwɛri,february | |
v | f | 3 | əv,of | |
v | p | 1 | stivən,stephen | |
v | v | 2017 | əbriviedəd,abbreviated | |
v | w | 1 | vɑgnɚ,wagner | |
w | o | 14 | ɑntwɑn,antoine | |
w | u | 636 | əbɑwt,about | |
w | w | 1116 | æftɚwɚd,afterward | |
wə | o | 10 | ɛniwən,anyone | |
z | s | 4504 | əbrivieʃənz,abbreviations | |
z | x | 2 | æŋzɑjədiz,anxieties | |
z | z | 330 | ægənɑjzɪŋ,agonizing | |
æ | a | 3381 | æb,ab | |
æ | e | 7 | ʃɑjæn,cheyenne | |
ð | d | 1 | fluðɚ,fflewddur | |
ð | t | 157 | ɔlðo,although | |
ŋ | n | 2334 | əbzorbɪŋ,absorbing | |
ɑ | a | 942 | əfɑr,afar | |
ɑ | e | 34 | ʃɑjæn,cheyenne | |
ɑ | i | 17 | ɑntwɑn,antoine | |
ɑ | o | 2045 | æbdɑmənəl,abdominal | |
ɑj | i | 1713 | əbɑjd,abide | |
ɑj | y | 219 | ælɑj,ally | |
ɔ | a | 417 | ɛrənɔdɪks,aeronautics | |
ɔ | e | 1 | krustʃɔf,khrushchev | |
ɔ | o | 243 | əbrɔd,abroad | |
ə | a | 3606 | ə,a | |
ə | c | 4 | məkɑrθi,mccarthy | |
ə | e | 2765 | əbriviedəd,abbreviated | |
ə | i | 1578 | æbdɑmənəl,abdominal | |
ə | o | 2966 | ɛrən,aaron | |
ə | u | 1889 | əbrəpt,abrupt | |
ə | y | 31 | ənæləsəs,analysis | |
ək | c | 1 | məkdɑwəl,mcdowell | |
əl | l | 639 | ebəl,able | |
əm | m | 37 | ælkəhɔlɪzəm,alcoholism | |
əw | u | 28 | æktʃəwəl,actual | |
ɚ | l | 1 | kɚnəl,colonel | |
ɚ | r | 4044 | əbsɚd,absurd | |
ɛ | a | 469 | ɛrən,aaron | |
ɛ | e | 3825 | əbrɛst,abreast | |
ɛ | i | 18 | frɛnd,friend | |
ɛ | u | 5 | bɛriəl,burial | |
ɪ | a | 223 | ækjɚɪt,accurate | |
ɪ | e | 3895 | əbriviedɪd,abbreviated | |
ɪ | i | 7081 | əbɪlədiz,abilities | |
ɪ | o | 30 | æbɪt,abbot | |
ɪ | u | 20 | bɪzid,busied | |
ɪ | y | 151 | əbɪs,abyss | |
ʃ | c | 248 | æpəleʃən,appalachian | |
ʃ | s | 708 | əbɑlɪʃ,abolish | |
ʃ | t | 856 | əbrivieʃən,abbreviation | |
ʊ | o | 156 | ədəlthʊd,adulthood | |
ʊ | u | 130 | æmbʊʃ,ambush | |
ʒ | g | 86 | æknɑlɪdʒ,acknowledge | |
ʒ | j | 12 | ədʒesənt,adjacent | |
ʒ | s | 108 | eʒə,asia | |
ʒ | t | 2 | ɪkweʒən,equation | |
ʒ | z | 1 | siʒɚ,seizure | |
θ | t | 520 | ɛsθɛdɪk,aesthetic |