Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Source code for Mining Transliterations from parallel corpus #1

Open
GokulNC opened this issue May 4, 2020 · 5 comments
Open

Source code for Mining Transliterations from parallel corpus #1

GokulNC opened this issue May 4, 2020 · 5 comments
Assignees

Comments

@GokulNC
Copy link

GokulNC commented May 4, 2020

Can you please share the source code for mining transliterations from parallel corpus, like say Wiki Eng and Wiki Telugu? @bedapudi6788

Also, is it possible to mine for transliterations at the article level rather than just at the titles level?

It would be really useful for other Indian languages too.
Thanks in advance :)

@bedapudi6788 bedapudi6788 self-assigned this May 8, 2020
@bedapudi6788
Copy link
Contributor

bedapudi6788 commented May 8, 2020

I will update the repo with the scripts and data for other languages.

Also, is it possible to mine for transliterations at the article level rather than just at the titles level?

I have not explored this yet. I don't think it will be as easy or as useful though.

@GokulNC
Copy link
Author

GokulNC commented May 9, 2020

I will update the repo with the scripts and data for other languages.

@bedapudi6788 Sure, thanks a lot :)

@GokulNC
Copy link
Author

GokulNC commented Jun 18, 2020

Hi @bedapudi6788

Any info on this? :)

@bedapudi6788
Copy link
Contributor

Sorry about the delay, I have been working on some other things and re-factoring is a tedious task. I will try to finish this in a couple of weeks.

@GokulNC
Copy link
Author

GokulNC commented Jun 21, 2020

Sure thanks. :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants