You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Nov 1, 2021. It is now read-only.
Kaggle is a lot of hoops for a user to jump through just to get an example working: install, set up key, run data getter script.
It's also work for us: We have to ensure datasets haven't moved or changed a lot, and we sometimes have to perform extra steps to process them.
These datasets are generally under creative commons licenses or similar. There's no reason why we can't:
Download a subset for example purposes (this keeps things light)
Process that subset ourselves (saves users time and effort)
Store it either in data/ (for light stuff like text which can go directly in repo) or use get_data.sh to download from somewhere we control (for larger stuff like images)
Affected examples
wikipedia-sentences
multires-lyrics-search
cross-modal-search
query-while-indexing
The text was updated successfully, but these errors were encountered:
As discussed in various meetings with @lusloher , @aga11313 , @FionnD
Kaggle is a lot of hoops for a user to jump through just to get an example working: install, set up key, run data getter script.
It's also work for us: We have to ensure datasets haven't moved or changed a lot, and we sometimes have to perform extra steps to process them.
These datasets are generally under creative commons licenses or similar. There's no reason why we can't:
data/
(for light stuff like text which can go directly in repo) or useget_data.sh
to download from somewhere we control (for larger stuff like images)Affected examples
The text was updated successfully, but these errors were encountered: