logo ELT Concourse teacher training
Concourse 2



If anything in the first part of this guide is unfamiliar to you, you should probably take a little time to refresh your memory concerning the essential concepts in phonology.  You can open that guide in a new tab by clicking here.

Two questions:

  1. Can you define 'consonant'?
  2. What are the consonant sounds of English?

Click here when you have an answer.

Only 7 of the 24 sounds need a special symbol to represent them.  It is quite a simple matter to learn how to read and write the phonemic script for consonant sounds.  The only ones which differ from the letters of the Latin alphabet are:

You will need to learn these seven easily to understand what follows.

There are some other things to note:

Now we can go on to /'klæ.sɪ.faɪ.ɪŋ 'kɒn.sə.nənts/.


Classifying consonants

There are three areas to consider when classifying consonant sounds:

  1. Voice
  2. Place of articulation
  3. Manner of articulation



Voicing describes how phonemes may be different depending on whether the vocal cords vibrate or not at the time of pronunciation.  For example, the /k/ sound is made without voicing but the /ɡ/ sound is made with the mouth parts in the same place but with voice added.  If you put your hand on your throat and say the words sue and zoo, you will see what is meant and feel a slight vibration on the second word (/s/ is unvoiced but /z/ is voiced).
The same phenomenon is noticeable when saying log vs. lock (although the voicing of the /ɡ/ in the first is less obvious).

Sixteen of the consonant phonemes form voiced / unvoiced pairings:

Unvoiced Voiced Minimal pairs
/p/ /b/ pat vs. bat
/tʃ/ /dʒ/ chin vs. gin
/f/ /v/ fan vs. van
/s/ /z/ sip vs. zip
/k/ /ɡ/ cut vs. gut
/t/ /d/ tab vs. dab
/θ/ /ð/ loath vs. loathe
/ʃ/ /ʒ/ leash on vs. lesion

Voicing is not a digital, on-off phenomenon; it exists on a cline from fully voiced to fully unvoiced.  In some circumstances, the consonants normally considered voiced are only partially voiced and, more rarely and in very rapid speech, not voiced at all.
In initial and final positions, as in words like had, sob, dig, do, be and go, the consonants /d/, /b/ and /ɡ/ are only partially voiced but in the mid-position, as in words like ladder, rubber and bigger, voicing is more pronounced.
This variation in the level of voicing has led some to use two different terms for the phenomenon:

  1. fortis (meaning strong) which alludes to the fact that unvoiced consonants are, allegedly, pronounced with more energy.  The consonants /p/, /t/ and /k/ are described as fortis consonants.
  2. lenis (meaning weak) alludes to the opposite phenomenon of the consonants /b/, /d/ and /ɡ/ which are variable in the amount of voicing they take and often produced with little force.

What this implies is that phonemes are a way of digitalising the information.  Although a sound may, in fact, be very variably pronounced (shouted, whispered, mumbled etc.) and may be affected by its environment vis-à-vis other sounds, it will still be instantly recognisable by a native speaker of a language.  Phonemes are, in other words, sets of allophones, not simple sounds.

There are two other things to know about any consonant:

  1. Where is it pronounced?  This is called place of articulation
  2. How is it pronounced?  This is called manner of articulation


Place of articulation

To figure this out, we need to do a bit of physiology to get the terms right.  As you read the following guide, move your tongue around to identify the parts we are talking about.  Technically, the various parts you identify are called articulators.

  1. Start at the front of your mouth, where it meets the outside world and you have found your lips.  Sounds which require the use of your lips are called labial.  Sounds which require both lips are called bilabial.  An example is the /m/ sound in member.
  2. Behind your lips are your teeth and sounds produced here are, unsurprisingly, called dental.  An example is the th (/ð/) sound in that.
  3. Behind your top front teeth, there is a bony ridge called the alveolar ridge and sounds produced here are called alveolar.  An example is the /t/ sound in teeth.
  4. Behind that, the roof of the mouth has two sections:
    1. the hard palate (where palatal sounds are made) to the front.  An example is the sh (/ʃ/) sound in ship.
    2. the soft palate or vellum to the rear (where we make velar sounds).  An example is the /k/ sound in cake.
  5. Your tongue can reach no further but pause to note that the tongue has three areas: the tip, the front and the back.
  6. At the back of your mouth is a teardrop-shaped fleshy part called the uvula.  It is, unsurprisingly, where uvular sounds are made.
  7. Right at the back of the mouth is the glottis where we make glottal sounds.  The only true glottal in English is the /h/ in, for example, horse.  In rapid speech and some varieties of English, there is also the glottal stop (/ʔ/), however, that appears when a consonant is dropped as in, e.g., the Scots and Southern British pronunciation of better as be'er (/ˈbe.ʔə/ rather than /ˈbe.tə/.
  8. The nasal cavity which is connected to the mouth and involved in nasal sounds.  An example is the sound on ng (/ŋ/)in swing.

Here's a picture:

vocal tract

A copy of that diagram is available.  Download it here.

Now pronounce some consonants and see if you can identify which parts of the mouth are involved in making the sounds.  Can you put the following sounds in the table?
/s/ as in seem
/t/ as in tent
/f/ as in fine
/ɡ/ as in gone
/θ/ as in think
/l/ as in link
/ŋ/ as in sing
/w/ as in went
/h/ happy
/p/ as in pin
/ʃ/ as in shine
In the third column, put in your best guess at the adjective for the type of sound.
You can download a printable version of this and the next activity here.

Click on the table to get the right answer.

place of articulation table


Manner of articulation

There is, unfortunately, no universally recognised system to describe how sounds are produced.  However, English sounds are all produced pulmonically (i.e., by expelling air) and by restricting the airflow in some way.

Stops or plosives
These sounds are produced by completely blocking the air flow and then releasing the blockage.  For example, to produce a /p/ sound, we close both lips, let a little breath build up and then release it by opening the lips.  These sounds can't be made continuously.
There are four phases to their production:
  1. the articulators are closed (e.g., the lips are pressed together for /p/)
  2. the air behind the articulators is compressed
  3. the articulators are moved apart to allow the air to be released
  4. the air, once released, often makes an audible sound or aspiration.  That is the difference between the sound of the 't' and 'p' in top [/thɒp/] and pat [/phæt/].
English has seven plosive consonants: /p/, /b/, /t/, /d/, /k/, /ɡ/ and /ʔ/.  The last of these is called the glottal plosive and is often an alternative to /p/, /t/ and /k/.
/p/ and /b/ are bilabial, formed by both lips, and the second is voiced.
/t/ and /d/ are alveolar, formed by the tongue pressing against the alveolar ridge (not the teeth), and the second is voiced.
/k/ and /ɡ/ are velar, formed by the back of the tongue pressing against the juncture of the hard and soft palate, and the second is voiced.
The glottal plosive (/ʔ/) is also known as a glottal stop (because the airflow is entirely blocked) and is voiceless.  It has to be voiceless because it is formed by compressing the vocal tract entirely and holding the vocal folds rigid.  It occurs in many words, often replacing a plosive as in the London and Scots pronunciation of butter which may be transcribed as /ˈbʌʔ.ə/ with the /t/ plosive replaced by the glottal.
It is not always a signal of non-standard speech patterns as, in rapid speech, the stop is commonly used.
To make these sounds, we close off the airflow (as we do for plosives) but allow the air to enter and flow out through the nasal cavity.  Try saying /ŋ/ and feeling how the soft palate is lowered to allow air to flow upwards.
There are three nasal consonants in English, two of which, /m/ and /n/, cause few problems because they are common to many languages.
The third, /ŋ/, does cause difficulties because it is quite unusual.
It never occurs initially in English.
It occurs frequently in mid-position but is only pronounced as /ŋ/ when the morphology of the word allows it.  It is pronounced /ŋ/ in bringer [/brɪŋ.ə/] because the word is formed from bring + er but it is not pronounced that way in finger [/ˈfɪn.ɡə/] because the word is morphologically different, and not formed from fing + er.
In other words, when it occurs at the end of a morpheme 'ng' is pronounced as /ŋ/ but in other circumstances, 'ng' is pronounced /nɡ/.
To make these sounds, the air flow is not completely cut but is restricted with air flowing continuously and turbulently between two mouth parts.  What you hear is the result of friction, hence the name.  The term sibilant is used to refer to  the sounds such as /s/ and /z/ which are produced by allowing the air to flow across the tip of the tongue between it and the alveolar ridge.
Fricatives include:
The fricatives in English are:
  • labiodental fricatives:
    /f/ and /v/ formed by the lips and top teeth.  The second is voiced.  Compare fine and vine.
  • dental fricatives:
    /θ/ and /ð/ formed by the tongue touching the teeth.  The second is voiced.  Compare breath and breathe.
  • alveolar fricatives:
    /s/ and /z/ formed as sibilants with the air compressed between between the tongue and the alveolar ridge.  The second is voiced.  Compare bus and buzz.
  • palatal or post-alveolar fricatives:
    /ʃ/ and /ʒ/ formed by the tongue compressing the air slightly further back.  The second is voiced.  Compare mesh and measure.
  • glottal fricatives:
    /h/ formed by air compressed in the glottis.  It is not voiced in English but a voiced equivalent exists in some languages, including Basque, Chinese, Czech, Finish, Korean, Polish, Portuguese, Romanian, Slovak and Slovene.  The sound is usually transcribed as [ɦ] and occurs, incidentally, in some South African speakers' production.  Speakers of those languages may be tempted to insert it into English and, although this rarely causes comprehension issues, it contributes to a foreign accent.
  • velar and uvular fricatives:
    The fricative /x/ is not usually included in a list of standard English consonants but appears on words of Scots and Welsh origin and in South African varieties where the words are borrowed from Afrikaans or Xhosa.
    The Scots sound at the end of, e.g., loch [lake] is a voiceless velar fricative but in Welsh the sound at the beginning of llyn [lake] is pronounced further back in the mouth with air flowing between the tongue and the uvula (the teardrop shaped bit at the back of your throat).
    English speakers will often substitute /k/.
    Other languages (such as German and Dutch) have a voiced version for the /x/ which is transcribed as /ɣ/.
These are formed as a combination of a plosive and a fricative.  First there is closure of the airflow but release is allowed in a restricted way, extending the sounds.  For example, the /dʒ/ sound in bridge is formed in this way.
There are only two affricatives in English:
/tʃ/ and /dʒ/ which have the /t/ the /d/ sounds produced a little further back than in /t/ and /d/ (so the phonemes are called palatal or post-alveolar).  The second is voiced.  Compare chump and jump.
These sounds are all voiced and are produced by small obstructions of the airflow.  There are two types of these:
Glides or semi-vowels: for example /w/ is  produced by slightly restricting the airflow by moving the tongue upwards.  It occurs at the beginning of the word would.
Lateral approximants: for example, the /l/ sound in English is formed by using the tongue to stop air moving directly forward and out and forcing it to run along the side of the tongue.  It occurs in the word lullaby in both its allophones.

There are some other ways to make sounds and languages are quite inventive.  These include trills (the Spanish rolled /r/) in which the tongue vibrates and flaps (for example, the 'dd' sound in madder in US English) when the airflow is momentarily interrupted.

Retroflex sounds

Retroflex sounds are formed in many languages with the tongue concave and/or curled back on itself to block the air flow, like this:
(Image adapted from Wikipedia)

For example:
    Russian and Polish have a retroflex /z/, transcribed as [ʐ].
    Hindi and other Indian languages have a retroflex /t/ transcribed as [ʈ].
    Swedish has both a retroflex /ŋ/ transcribed as [ɳ] and a retroflex /d/ transcribed as [ɖ].
    Chinese languages have a retroflex /s/ transcribed as [ʂ].
If speakers of these languages import the retroflex sounds into English it contributes greatly to a foreign accent.
It is usually helpful to make learners aware of the differences.


Markedness and phonemic substitution

Markedness in this sense refers to how widely consonants are represented in the world's languages.  That, it is sometimes averred, is a measure of how hard they are to acquire.  The common sounds will give few problems but consonants which are not represented in the learners' first language(s) will, understandably, cause significant problems.

There is evidence to suggest that the unvoiced consonant sounds, especially, /t/, /s/, /p/, and /k/ are common to nearly all languages and are, therefore, considered unmarked.  They should cause few learners any trouble at all except in terms of their allophonic varieties (with and without aspiration, retroflex or not).
The consonant /n/ is also an unmarked form which appears in many languages.
On the other hand, the equivalent voiced sounds, /d/, /z/, /b/ and /ɡ/ are marked in that they do not universally occur with anything like the same frequency so they require more attention as does the nasalised /ŋ/ which is also less common and causes some learners a good deal of difficulty.

Where a sound may occur also plays a role.  Final voiced consonants are rare in many languages, including German and Dutch, for example and this may tempt learners of those backgrounds to pronounce dog as dock, cab as cap, cadge as catch and so on.

There is a guide on this site to teaching troublesome sounds (new tab) which considers many of the more marked, i.e., less common, vowel sounds.


Allophones, reductions and regional variations

individuals vary
No two speakers pronounce all consonants in exactly the same way.  Individual speakers will also pronounce some consonants slightly differently depending on how they feel, how carefully they wish to speak and how quickly.  So, for example, we might pronounce take in:
    I want to take it home
as /teɪk/
with no aspiration on the /t/ sound and on another occasion, we might pronounce the word in
    Take it!
as /theɪk/
with the aspiration on the /t/ prominent.
/t/, /p/ and /k/ are all variously aspirated depending on the phonological environment in which they occur and the speaker's attitude.
Voicing, too, is variable with some individuals using more (because there is a cline from unvoiced to voiced, not an either-or distinction).  So, for example, some speakers may pronounce pub as /pʌb/ with a clearly voiced final consonant but others may reduce the amount of voicing until word approximates to /pʌp/.  Some may even remove the final consonant and substitute a glottal stop as in /pʌʔ/.  No-one will mistake the word, however it is pronounced, so we are dealing with allophonic variation.
the positions of consonants vary
Where a consonant occurs in a word may also affect how it is pronounced.  For example:
  1. /b/, /d/, /dʒ/ and /ɡ/ which are all voiced in most transcriptions may become wholly or partially de-voiced when they fall at the end of a word or phrase so, for example
        It's my job
    may be transcribed as
        /ɪts.maɪ.dʒɒp/ or /ɪts.maɪ.dʒɒb/
        I'll be the judge
    may be transcribed as
        /aɪl.bi.ðə.dʒʌtʃ/ or /aɪl.bi.ðə.dʒʌdʒ/
    The /ɡ/ sound is clearly voiced in, e.g.
    but much less so in
    so the first may be transcribed as
    and the second is nearer to (but not identical with)
    and so on.
  2. The /l/ sound also exhibits variations in what is called velarization (the amount it is pronounced by partial closure of the velum at the back of the mouth).  So for, e.g.:
        Let me go
    the transcription would be
    but the transcription of
        Let me fall
    will be:
    with a velarized final consonant (the so-called dark [ɫ]).
    In standard BrE, the sound is light (/l/) before a vowel and dark elsewhere but that disguises changes in connected speech because the sound will be light in pull it (/pʊl.ɪt/) but dark in pull that (/pʊɫ.ðæt/).  The transcription may safely be left as /l/ in all cases because it is simpler and we have a rule for the pronunciation of the allophones.
    Most native speakers of English are unaware of the two pronunciations of /l/ because they make no phonemic difference.  A Turkish speaker, in whose language the sounds are phonemes, will be much more aware of the distinction having been trained since childhood to recognise it.
  3. The /t/ sound often becomes glottalized when it occurs finally.  In other words, it is replaced by the stop /ʔ/.  So the transcription of
        I got it
    is either
    or, even
  4. The amount of aspiration is also dependent on the position of the consonant vis-à-vis other sounds.  We saw above that this aspiration affect /t/, /k/ and /p/ in particular.  When these sounds are the first in a word or the first in a stressed syllable, they are aspirated so the sounds followed by the elevated /h/ in the following will be aspirated:
        peter /ˈphiː.tə/
        tap /thæp/
        kill /khɪl/
    but will remain unaspirated in these:
        couple /ˈkʌp.l̩/
        hate /heɪt/
        sicken /ˈsɪkən/
    Because the sounds are not full phonemes in English, most speakers are unaware of the differences in pronunciation and may be surprised to discover it but to speakers of languages (such as Mandarin) where aspiration is a phonemic characteristic, the change in pronunciation will be very obvious because they have been brought up to recognise it.
    (In fact the phoneme /t/ has six possible pronunciations in English:
    At the end of a hat it is called an unreleased /t/ and transcribed phonetically as [t̚].
    At the beginning of task it is aspirated [th].
    It may be glottalised in, e.g., butter and got [ʔ].
    It may be flapped as in the AmE later [ɾ].
    It may be nasalised and flapped as in the AmE counter [ɾ̃] (because it is following a nasalised consonant /n/).
    It may just be a plain [t] sound as in stitch.)
reductions and elisions of consonants and clusters
When consonants occur in clusters such as at the end of a word like clothes (/kləʊðz/) there is a tendency in English to elide one of the consonants so the pronunciation is often as /kləʊz/ with the elision of the /ð/.  (If learners always say it that way, they will never be misunderstood and it's a good deal easier for them.)
Some clusters such as the one at the end of sixths, are simply difficult to pronounce.  The result is usually something like /sɪkθs/ or even /sɪkfs/.  Learners whose languages do not allow the same clusters as English are often tempted to use cluster reduction inappropriately, for example, pronouncing crisps as /krɪps/ rather than /krɪsps/.
It is usually /t/, /d/, /p/ and /k/ which are elided in this respect, so, for example:
    text message
becomes /teks.ˈme.sɪdʒ/
becomes /mɪst/
becomes /ɡlɪms/
    and asked can be pronounced /ˈɑːst/.
The same phenomenon is observable with the unvoiced /θ/ sound so asthma is pronounced as /ˈæ.smə/.
Occasionally, elision can become fixed in the language so, for example, the confection now known as ice cream was originally iced cream but the /t/ sound of the letter 'd' was routinely elided and the phrase took on its current spelling.
accents vary
Where people come from may also have a significant effect.  In some parts of Britain, for example, a final letter 'r' will be pronounced quite obviously so, e.g.
    My father is
will be pronounced as
by lots of people because the /r/ precedes the vowel, but many people will pronounce it as
without the /r/ sound  However, even those who do pronounce the /r/ would not pronounce
    He is my father
because there is no following vowel.
In Standard AmE, the /r/ is usually produced so the transcription is
with a syllabic /r/ as the final consonant and no preceding schwa.
Alternatively, the transcription appends a tiny /r/ to the vowel so we have, e.g., nurse transcribed not as /nɜːs/ but as /nɝːs/.
Another significant difference between Standard American and British is the pronunciation of the letter 't' when it occurs in the middle of words so, for example, we find:
Word British American
butter /ˈbʌt.̩ə/ /ˈbʌd.r̩/
Peter /ˈpiː.tə/ /ˈpiː.dər/
There are a few other significant (and some not very significant) variations in how consonants are pronounced between BrE and AmE.  For a list of the differences, see either the guide to teaching yourself to transcribe or download the PDF document for this area.  (Both those links open in new tabs.)
A regional difference in parts of Britain is that the central /t/ sound may be replaced by a glottal stop (/ˈbʌʔ.ə/ and /ˈpiː.ʔə/, respectively).
/hw/ vs. /w/
Now almost extinct except in some varieties of English spoken in Scotland, parts of Ireland and the southern United States, is a variant of /w/ usually transcribed as /hw/ (or you may see it as [ʍ]).  It appears at the beginning of words spelled wh- but has for almost all speakers of English now merged with /w/.  The result is that apart from a small minority of speakers, there is no distinction in pronunciation between weather and whether, wine and whine etc.  The merger is generally called the whine-wine merger.)


A summary and test

Now we have all three ways to classify the consonants and can describe them properly.  These three ways are:

  1. Voicing
  2. Place of articulation
  3. Manner of articulation

Can you complete this chart?  If you have your downloaded and printed activity sheet to hand, do it there.  If you would like to download that now, click here.  When you have filled in all the consonant sounds, click on the chart to reveal the answer.


The voiced consonants are in bold.
Notice, too, that /t/ and /d/ are alveolar stops in English, not dental sounds as they are in a range of other languages.  Making them dental sounds contributes to a foreign accent in English.

If you would like to hear these sounds, the ideal place to go has been kindly provided by the British Council.

Of course there's a test (two, to be honest on what has been covered up to now).


Spelling consonant sounds

What follows is a guide to how the consonant sounds of English are realised in its orthography.  If you have followed the general guide to spelling in English, you will be aware that English is often described, sometimes despairingly, as a wholly inconsistently spelled language with no discernible connections between sound and spelling.  You will also be aware that that is only very partially true.
In the case of consonant sounds, there are clear consistencies and these are teachable.

The following takes each consonant in turn and suggests the commonest way that the sounds are realised in the morphology as well as noting some rarities, often loan words from other languages, which have to be learned individually.
A silent final 'e' has been ignored in this list and the ordering is as for the list of consonants in the table above.

Sound Common spellings Rarities and varieties Sound Common spellings Rarities and varieties
/p/ p or pp:
/z/ z, zz, or s:
(also tsarina)
/d/ d, dd or ed:
AmE: tt:
*/h/ h or wh:
/tʃ/ ch, tch or t:

tch is never initial
/ŋ/ ng, n or ngue:
/v/ v, vv or f:
/j/ y or i:
/s/ s, ss or c:
/t/ t, tt, bt, ght or ed:
/ʒ/ g, j or s:
/ɡ/ g, gue or gh:
/n/ n, nn or kn:
kn is only initial
nn is only medial
/f/ f, ff, gh or ph:
lieutenant (BrE)
/r/ r, rr or wr:
/ð/ th:
/b/ b or bb:
/ʃ/ sh, s, ss, c, ce, ch or ti:
*/k/ c, k, kk or cc:
ck is never initial
/m/ m, mm or mb:
/dʒ/ g, j, dg or dj:
*/l/ l or ll:
/θ/ th:
/w/ w, wh or u:
* The /h/, /k/ or /l/ sounds are often the ways in which the /x/ sound in loch, chutzpah, llyn and other loan words are rendered in Standard English.  Many speakers of Standard English do, however, make the effort to produce /x/ in these cases.

Go to the index for the pronunciation section of the in-service guides