Welcome to project W!

What is this?

Project W is a self-hostable platform on which users can create transcripts of their audio files (speech-to-text). It leverages OpenAIs Whisper model for the actual transcription while providing an easy-to-use interface on which users can create and manage their transcription jobs.

Why do we need this? Why not just use OpenAIs own service?

In short: OpenAIs service is not good enough when it comes to data privacy.

In some research fields at our university a lot of interviews (and transcriptions of these interviews) need to be done. Traditionally the transcriptions are done manually, however with recent advancements AI can do this job as good or even better than the average human and fully automated. Since the faculties are required to keep these interviews private they can't just give it to third-parties like OpenAI. Everything must stay inside the university. Furthermore Whisper has high hardware requirements (for the large models you need powerful GPUs) making it quite difficult or impossible for the average person to use on their work laptop/desktop. Also the setup of CUDA and whisper and its usage (it only has a CLI interface) is not something that the average user would like to do.

This is where Project W comes in: It is designed so that everything can be hosted by the university itself on powerful hardware (like an A100 GPU) while it is very easy to be used by the average person. Just go to the website, sign up and upload some files!

Why are there three repositories?

Project W consists of three components: The frontend/client, the backend, and the runner. We decided to host them on different git repositories to seperate them better.

The backend and runner are written in Python, and we use Flask for the backends HTTP-Server. The Frontend is written in Svelte with svelte-spa-router so that it can be compiled into native Javascript, HTML and CSS. No Nodejs or anything other than a webserver (e.g. nginx) is required to serve the frontend. Of course you can also choose to write your own client with anything you like that can communicate with a REST API. This means you can use Project W with some bash or python script to automate certain tasks.

Documentation: RTFM!

You can access the full documentation for administrators and developers here. Most notably this includes installation and configuration instructions for all three components if you want to host them yourself.

Presentation

This project was created as part of the software practical "Research Software Engineering" at the university of Heidelberg during the winter term 2023/24. At the end of this practical we also held a presentation that you can find here.

Acknowledgments

This repository was set up using the SSC Cookiecutter for Python Packages.

Name		Name	Last commit message	Last commit date
Latest commit History 133 Commits
.github		.github
doc		doc
nix		nix
project_W		project_W
tests		tests
.gitignore		.gitignore
.readthedocs.yml		.readthedocs.yml
COPYING.md		COPYING.md
Dockerfile		Dockerfile
FILESTRUCTURE.md		FILESTRUCTURE.md
LICENSE.md		LICENSE.md
README.md		README.md
TODO.md		TODO.md
codecov.yml		codecov.yml
config.yml		config.yml
flake.lock		flake.lock
flake.nix		flake.nix
pyproject.toml		pyproject.toml
run.sh		run.sh
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

Welcome to project W!

What is this?

Why do we need this? Why not just use OpenAIs own service?

Why are there three repositories?

Documentation: RTFM!

Presentation

Acknowledgments

About

Licenses found

Contributors 4

Languages

License

Licenses found

JulianFP/project-W

Folders and files

Latest commit

History

Repository files navigation

Welcome to project W!

What is this?

Why do we need this? Why not just use OpenAIs own service?

Why are there three repositories?

Documentation: RTFM!

Presentation

Acknowledgments

About

Topics

Resources

License

Licenses found

Stars

Watchers

Forks

Contributors 4

Languages