Skip to content

Navigation Menu

Explore
By size
By industry
By use case
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

tyronechen / genomenlp Public

Notifications You must be signed in to change notification settings
Fork 3
Star 5

Code
Issues 5
Pull requests
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Releases: tyronechen/genomenlp

Releases · tyronechen/genomenlp

v2.8.5

04 Oct 06:38

tyronechen

Compare

Choose a tag to compare

Loading

v2.8.5 Latest

Latest

Fix a bug where passing a config to train did not override output URL
Raise a warning when local and remote sources clash
Fix a bug where automated download does not work for cross validation

Assets 2

Loading

All reactions

v2.8.2

07 Sep 08:30

tyronechen

Compare

Choose a tag to compare

Loading

v2.8.2

Enable fit_powerlaw to pull models from wandb directly.

Assets 2

Loading

All reactions

v2.8.1

30 Aug 23:49

tyronechen

Compare

Choose a tag to compare

Loading

v2.8.1

Enable pooling of different token files
Update documentation for associated scripts

Assets 2

Loading

All reactions

v2.8.0

27 Aug 02:53

tyronechen

Compare

Choose a tag to compare

Loading

v2.8.0

Can now compare empirical token distributions across runs

Assets 2

Loading

All reactions

v2.7.2

24 Aug 04:41

tyronechen

Compare

Choose a tag to compare

Loading

v2.7.2

Fix a bug where casing and sequence splitting did not occur correctly during tokenisation

Assets 2

Loading

All reactions

v2.7.1

12 Aug 14:41

tyronechen

Compare

Choose a tag to compare

Loading

v2.7.1

Fix a bug where tokenise_bio did not load dependencies correctly
Add sequence breaking functionality to tokenise_bio -b for splitting long seqs that may cause memory issues (eg chr1)
Add sequence casing functionality to tokenise_bio -c for changing data input during tokeniser training to upper or lower case

Assets 2

Loading

All reactions

v2.6.3

10 Aug 11:24

tyronechen

Compare

Choose a tag to compare

Loading

v2.6.3

Can now sweep on a subset of data by using the --partition_percent option

Assets 2

Loading

All reactions

v2.5.0

09 Aug 05:18

tyronechen

Compare

Choose a tag to compare

Loading

v2.5.0

Fix a bug where train was not functioning correctly (class label encodings)

Assets 2

Loading

All reactions

v2.4.4

04 Aug 02:51

tyronechen

Compare

Choose a tag to compare

Loading

v2.4.4

Fix a bug where csv files were truncated if the input sequence is too long
Fix a bug where embeddings were not generated correctly in create_embedding_bio_sp.py

Assets 2

Loading

All reactions

v2.4.3

31 Jul 08:12

tyronechen

Compare

Choose a tag to compare

Loading

v2.4.3

Add support for reverse complementing some non-standard nucleotides
Fix bug in k-merisation process where a non-existing tokeniser file was parsed

Assets 2

Loading

All reactions

Previous 1 2 Next

Previous Next

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.