Skip to content

Releases: mozilla/DeepSpeech

v0.2.0-alpha.3

28 Jul 13:34
bcde01f
Compare
Choose a tag to compare
v0.2.0-alpha.3 Pre-release
Pre-release
Merge pull request #1369 from lissyx/newver

Bump to v0.2.0-alpha.3

v0.2.0-alpha.2

28 Jul 13:30
26a0904
Compare
Choose a tag to compare
v0.2.0-alpha.2 Pre-release
Pre-release
Merge pull request #1367 from lissyx/arm64-nomore

Also remove aarch64 from packages pulling

v0.2.0-alpha.1

28 Jul 13:23
74f458a
Compare
Choose a tag to compare
v0.2.0-alpha.1 Pre-release
Pre-release
Merge pull request #1366 from lissyx/new-ver

Bump to v0.2.0-alpha.1

0.2.0-alpha.0

28 Jul 13:09
57f2371
Compare
Choose a tag to compare
0.2.0-alpha.0 Pre-release
Pre-release
Merge pull request #1364 from lissyx/tag_more_more_fix

Avoid duplicated tag name for routes

Deep Speech 0.1.0

24 Nov 15:39
b7abbd9
Compare
Choose a tag to compare

General

This is the initial release of Deep Speech, an open speech-to-text engine. This release includes source code

v0.1.0.tar.gz

and a model, not yet optimized for size,

deepspeech-0.1.0-models.tar.gz

trained on American English which achieves a 6.0% word error rate (The language model included some test data.) on the LibriSpeech clean test corpus, and example audio

audio-0.1.0.tar.gz

which can be used to test the engine.

Bindings

In addition it includes a Python based command line tool deepspeech, installed through

pip install deepspeech

Alternatively, quicker inference can be performed using a supported NVIDIA GPU on Linux. (See below to find which GPU's are supported.) This is done by instead installing the GPU specific package:

pip install deepspeech-gpu

Also, it exposes bindings for the following languages

  • Python (Versions 2.7, 3.4, 3.5, and 3.6) installed via
    pip install deepspeech
    Alternatively, quicker inference can be performed using a supported NVIDIA GPU on Linux. (See below to find which GPU's are supported.) This is done by instead installing the GPU specific package:
    pip install deepspeech-gpu
  • NodeJS (Versions 4.x, 5.x, and 6.x) installed via
    npm install deepspeech
    
    Alternatively, quicker inference can be performed using a supported NVIDIA GPU on Linux. (See below to find which GPU's are supported.) This is done by instead installing the GPU specific package:
    npm install deepspeech-gpu
    
  • C++ which requires the appropriate shared objects are installed from native_client.tar.xz (See the section in the main README which describes native_client.tar.xz installation.)

In addition there are third party bindings that are supported by external developers, for example

  • Rust which is installed by following the instructions on the external Rust repo.

Supported Platforms

  • OS X 10.12 and 10.13
  • Linux x86 64 bit with a modern CPU (Supports up to AVX2/FMA)
  • Linux x86 64 bit with a modern CPU + NVIDIA GPU (Compute Capability at least 3.0, see NVIDIA docs)
  • Raspbian Jessie on Raspberry Pi 3

Contact/Getting Help

  1. FAQ - We have a list of common questions, and their answers, in our FAQ. When just getting started, it's best to first check the FAQ to see if your question is addressed.
  2. Discourse Forums - If your question is not addressed in the FAQ, the Discourse Forums is the next place to look. They contain conversations on General Topics, Using Deep Speech, Alternative Platforms, and Deep Speech Development.
  3. IRC - If your question is not addressed by either the FAQ or Discourse Forums, you can contact us on the #machinelearning channel on Mozilla IRC; people there can try to answer/help
  4. Issues - Finally, if all else fails, you can open an issue in our repo if there is a bug with the current code base.