cmudict-ipa

CMU Pronouncing Dictionary converted to IPA by Alexander Piperski

cmudict-0.7b-ipa.txt contains all the entries from the CMU Pronouncing Dictionary (version 0.7b; http://www.speech.cs.cmu.edu/cgi-bin/cmudict) automatically converted to the International Phonetic Alphabet (IPA). Each line consists of a headword and its transcription(s) separated by a tab symbol; variant forms are separated by ", ".

Examples:

ALPINE ˈælˌpaɪn
APPALACHIAN ˌæpəˈleɪtʃən, ˌæpəˈleɪʃən, ˌæpəˈlætʃən
HIMALAYAN ˌhɪməˈleɪən, ˌhɪməˈlaɪən

Apart from the standard Arpabet to IPA conversion rules (https://en.wikipedia.org/wiki/Arpabet), the following ones were observed:

AH is converted to ʌ when bearing primary stress and to ə otherwise (AH1 -> ʌ; AH0, AH2 -> ə)
Vowel length is indicated (AA -> ɑː, ER -> ɝː, IY -> iː, UW -> uː). However, unstressed word-final ER and IY are short (i.e., ER0 -> ɝ and IY -> i when word-final).

brown-frequency-list-with-ipa.txt contains 33862 words occurring in the Brown Corpus of American English (https://archive.org/details/BrownCorpus) sorted by their frequency of occurrence. This list may be of use to English learners willing to pronounce most frequent words correctly.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
brown-frequency-list-with-ipa.txt		brown-frequency-list-with-ipa.txt
cmudict-0.7b-ipa.txt		cmudict-0.7b-ipa.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

cmudict-ipa

About

Releases

Packages

menelik3/cmudict-ipa

Folders and files

Latest commit

History

Repository files navigation

cmudict-ipa

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages