Skip to content
View bauwenst's full-sized avatar
:octocat:
#SigmaMaleGrindset
:octocat:
#SigmaMaleGrindset

Organizations

@LAGoM-NLP
Block or Report

Block or report bauwenst

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
bauwenst/README.md

I'm Thomas. This is my ✨professional✨ GitHub account, which I use to host my website plus repos I want associated with my name. If this account shows days without any commits, it's likely because I'm cooking up gruesome code in private repos on my original account @GitMew.

  • PhD student in natural language processing at the KU Leuven in Belgium. I work in the LAGoM • NLP research group, which is part of the Human-Computer Interaction (HCI) division at our department of Computer Science.
  • I tokenise stuff, and if you're not careful, I will tokenise you next 💀
  • Contact me using the information on this page.

Pinned Loading

  1. TkTkT TkTkT Public

    A collection of Pythonic subword tokenisers and text preprocessing tools.

    Python

  2. LaMoTO LaMoTO Public

    Language Modelling Tasks as Objects (LaMoTO) treats the pretraining and finetuning of causal and masked language models as classes themselves, not just the models.

    Python

  3. fiject fiject Public

    Object-oriented, two-stage PDF figure generation library for Python.

    Python 3

  4. MoDeST MoDeST Public

    Morphological Decomposition & Segmentation Trove

    Python