Skip to content

DQNEO/apple-dictionary-parser

Repository files navigation

Apple Dictionary Parser

Apple Dictionary Parser is a command line tool and a library to parse and analyze MacOS's built-in dictionary files.

Currently only "New Oxford American Dictionary" is supported.

Installation

go install github.com/DQNEO/apple-dictionary-parser@latest

Usage

Export dictionary contents in raw format

apple-dictionary-parser dump

This dump subcommand automatically finds the location of the dictionary file in your MacOS, and extract the binary content into a raw dump file (/tmp/noad.cache).

The format of the dump file is TSV (tab separated values), each line representing a word with definition.

<world title>\t<definition of the word in XML>\n

If you are just interested in the raw contents of the dictionary and want to process the data on your own, this will be all you want.

Export dictionary contents into a text file

apple-dictionary-parser text /tmp/all.txt

Export dictionary contents into a HTML file

apple-dictionary-parser html  /tmp/all.html

Export dictionary contents into alphabetically separated HTML files

apple-dictionary-parser htmlsplit /tmp/

This generates a.html, b.html, ..., z.html files in a given directory.

Analyze etymology data

apple-dictionary-parser etym /tmp/

This analyzes etymology graph and make outputs in various formats (yaml, html)

Show phonetics information

apple-dictionary-parser phonetics

License

MIT

Author

@DQNEO