Skip to content
/ sd Public

A tool for translating texts using an statistical dictionary.

License

Notifications You must be signed in to change notification settings

midobal/sd

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

Statistical Dictionary

This software translates a text using an statistical dictionary.

Dependencies

To generate IBM Model 1 alignments, mgiza is needed.

Usage

Alignments

IBM1Alignments.sh source_file target_file dest_dir

Where:

  • source_file is the source from the parallel training data.
  • target_file is the target from the parallel training data.
  • dest_dir is the directory in which to save the alignments.

Note: the script expects a variable ($GIZA) pointing towards mgiza's bin directory.

Text translation

SD.py -t text_file -a alignments

Where:

  • text_file if the text to translate.
  • alignments is the file created at the alignments step (dest_dir/alignmetns).

Run through Docker

Alternatively, if you want to run this software using Docker, you can have a look at this repo.

About

A tool for translating texts using an statistical dictionary.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published