Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about input data for training SkyGPT. #2

Open
GabbySuwichaya opened this issue Feb 28, 2024 · 0 comments
Open

Question about input data for training SkyGPT. #2

GabbySuwichaya opened this issue Feb 28, 2024 · 0 comments

Comments

@GabbySuwichaya
Copy link

Hi!
Thanks for the great work and thanks so much for releasing the implementation of your work and related algorithms. I am learning a lot from reading your paper. Also, your model and how you shaped the problem into a probabilistic model is quite interesting.

Therefore, I am currently trying to run the training of SkyGPT.

But I don't know that what are going to be the inputs for

  1. Training the transformer ? (the data has to be specified by users).
  2. Training the VQVAE ? I am guessing it is a hdf5 called GPT_full_2min.hdf5...

My understanding is that I will have to generate the samples for training SkyGPT

  1. by running SkyGPT/script/reformat_input.py and then
  2. Then, use the result from previous step for SkyGPT/script/sample_gen.py

Then, I will get a hdf5 file containing

  • 'train_data': [B, H, W, 3] np.uint8,
  • 'train_idx': [B], np.int64 (start indexes for each video)
  • 'test_data': [B', H, W, 3] np.uint8,
  • 'test_idx': [B'], np.int64

But here are the problems....

  1. What is the input for SkyGPT/script/reformat_input.py ?
  2. Is GPT_full_2min.hdf5 a resulting file from SkyGPT/script/sample_gen.py ?
  3. What is the input for Training the transformer ?
  4. And how are these data related to the files that you provided in the Google drive ?
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant