Skip to content
This repository has been archived by the owner on Jul 22, 2024. It is now read-only.

Replaces existing SED and PY file #61

Open
wants to merge 3 commits into
base: master
Choose a base branch
from
Open

Conversation

ctiijima
Copy link

@ctiijima ctiijima commented Apr 2, 2019

Still needs the newline after every sentence function worked in to finish a complete replacement.

Still needs the newline after every sentence function worked in.
@yhwang
Copy link
Member

yhwang commented Apr 3, 2019

one question: where can we use this csv file?

@ctiijima
Copy link
Author

ctiijima commented Apr 3, 2019

It will write the CSV in the folder where the RTF files were sourced. If you would prefer to continue to use individual TXT files, that can be modified into the script functionality. I am more or less providing this to you because it was part of the preliminary work to bring the ezDI dataset to the Tensorflow/Datasets project before I drop from the group.

@tonanhngo
Copy link
Contributor

Hi Chris, thanks for putting this script together. It appears the csv usage along with the listing of wav file name is really for the TF Datasets for ezDI that we are working on now. You might want to check into my fork instead: https://github.com/tonanhngo/datasets.git
For the code pattern, it's also good to consolidate some of the shell script commands into the python script as you mention, but we would want the output to be a regular text file without the wav filename. If you want to make another version for this usage, that would be good. In this case, please update the README file also to reflect the new way to process the data.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants